W

Web Light / googleweblight

Operated by GoogleDeveloper Tools
Visit Bot Homepage

Verify Web Light / googleweblight IP Address

Verify if an IP address truly belongs to Google, using official verification methods. Enter both IP address and User-Agent from your logs for the most accurate bot verification.

[This crawler is officially retired as per Google] googleweblight is a Google fetcher used by the now-deprecated Google Web Light service, which provided simplified, faster-loading versions of webpages for slow mobile networks. It requested pages to generate lightweight, transcoded versions optimized for low-bandwidth conditions. Site owners allowed it to ensure better accessibility for users on slow connections. Since Web Light has been discontinued, activity from this user-agent is now rare or legacy in nature. Any remaining traffic is typically minimal and related to leftover systems or outdated client requests rather than active Google services. It ignored the global user agent (*) rule. RobotSense.io verifies Web Light / googleweblight using Google’s official validation methods, ensuring only genuine Web Light / googleweblight traffic is identified.

This bot does not honor Crawl-Delay rule.

User Agent Examples

Mozilla/5.0 (Linux; Android 4.2.1; en-us; Nexus 5 Build/JOP40D) AppleWebKit/535.19 (KHTML, like Gecko; googleweblight) Chrome/38.0.1025.166 Mobile Safari/535.19
Example user agent strings for Web Light / googleweblight

Robots.txt Configuration for Web Light / googleweblight

Robots.txt User-Agent:googleweblight

Use this identifier in your robots.txt User-agent directive to target Web Light / googleweblight.

Recommended Configuration

Our recommended robots.txt configuration for Web Light / googleweblight:

# This bot is officially retired by Google
User-agent: googleweblight
Allow: /

Completely Block Web Light / googleweblight

Prevent this bot from crawling your entire site:

User-agent: googleweblight
Disallow: /

Completely Allow Web Light / googleweblight

Allow this bot to crawl your entire site:

User-agent: googleweblight
Allow: /

Block Specific Paths

Block this bot from specific directories or pages:

User-agent: googleweblight
Disallow: /private/
Disallow: /admin/
Disallow: /api/

Allow Only Specific Paths

Block everything but allow specific directories:

User-agent: googleweblight
Disallow: /
Allow: /public/
Allow: /blog/

Set Crawl Delay

Limit how frequently Web Light / googleweblight can request pages (in seconds):

User-agent: googleweblight
Allow: /
Crawl-delay: 10

Note: This bot does not officially mention about honoring Crawl-Delay rule.

Frequently Asked Questions

What is googleweblight, and why is it visiting my website?
googleweblight was a crawler operated by Google for the now-retired Google Web Light service. The service fetched webpages and generated lightweight, optimized versions designed for users on slow mobile networks or low-bandwidth connections. Historically, visits were triggered when users accessed webpages through Web Light-enabled experiences on mobile devices. Since the service has been discontinued, legitimate crawl activity from this bot is now rare and typically associated with legacy systems, cached requests, or outdated client behavior. Public websites may still occasionally see this User-Agent in server logs, but sustained activity is uncommon.
Is googleweblight a legitimate bot, or is it commonly spoofed?
googleweblight was an official Google fetcher when the Web Light service was active. However, because the service has been retired, any modern traffic using this User-Agent deserves closer scrutiny. User-Agent spoofing is common across well-known Google crawler names because attackers and scrapers sometimes imitate trusted bots to bypass rate limits, security filters, or bot protection systems. User-Agent strings alone cannot verify authenticity, since any automated client can send the same header. You can use Google's recommended methods mentioned below to verify a legitimate visit, or use RobotSense.io API to easily verify googleweblight visits.
How can I verify that a request is really coming from googleweblight?
You can use Google's recommended official methods to verify googleweblight visits, these include: - IP range checks - Reverse DNS → forward DNS Do not use User-Agent based detection as that can be easily spoofed. Alternatively, you can use RobotSense.io API to easily verify googleweblight and all other bots from Google.
Should I allow or block googleweblight on my website?
Because Google Web Light has been discontinued, allowing googleweblight is generally optional for most websites today. Blocking it typically has little or no impact on modern search visibility or user experience. Blocking may make sense when: - The traffic appears suspicious or excessive - Requests target sensitive endpoints or APIs - You want to reduce unnecessary bot traffic - Server resources are limited If verified Google-origin traffic is minimal and non-disruptive, allowing it is usually harmless, but most sites no longer depend on this crawler.
How can I control or block googleweblight using robots.txt or other methods?
You can add a rule in your robots.txt, as given above to control (crawl-delay) or disallow googleweblight. googleweblight honors robots.txt directives. Also, you can use further controls in your WAF, or in RobotSense enforcement settings to manage the bot behavior.
How often does googleweblight crawl websites, and can it impact server performance?
When the Web Light service was active, crawling was primarily request-driven and tied to mobile users accessing optimized pages. The bot fetched content as needed to generate lightweight versions for low-bandwidth environments. Today, crawl activity is typically very limited. Any remaining traffic usually has minimal impact on bandwidth, request rates, or dynamic page generation. On most websites, modern server performance effects from legitimate googleweblight traffic are negligible.
What happens if I block googleweblight? SEO, visibility, and feature impact explained.
Blocking googleweblight generally has little to no effect on modern SEO or Google Search rankings because the underlying Web Light service has been retired. Most websites no longer rely on this crawler for accessibility or content delivery features. Potential impacts are limited and mostly historical: - No significant impact on Google Search indexing - No effect on standard Googlebot crawling - No meaningful impact on modern mobile search visibility - Legacy low-bandwidth page transcoding would no longer function - Minimal or no effect on analytics or developer tools For most sites, blocking this crawler has no noticeable operational consequence today.
Does googleweblight collect, scrape, or use my content for training or reuse?
googleweblight fetched webpage content in order to generate simplified, compressed versions of pages for low-bandwidth mobile users. This included retrieving HTML, images, CSS, and other resources necessary to create optimized page renderings. Its documented purpose was content transcoding and mobile delivery optimization rather than AI training or large-scale content reuse. The crawler could temporarily process or cache page content and metadata to serve lightweight versions, but there is no documented indication that googleweblight was specifically used for machine learning training pipelines. The bot historically processed: - Full page HTML - Metadata and structured content - Images and page resources - Mobile rendering information - Performance optimization data Because the service has been retired, active collection activity from legitimate googleweblight infrastructure is now extremely limited.