Googlebot News
SearchVerify Googlebot News IP Address
Verify if an IP address truly belongs to Google, using official verification methods. Enter both IP address and User-Agent from your logs for the most accurate bot verification.
Googlebot-News is Google’s crawler dedicated to discovering and indexing news content for Google News and Top Stories. The bot focuses on timely, high-quality journalism, scanning article pages, structured data, headlines, timestamps, authorship, and metadata to assess relevance and freshness. Crawling is more frequent than standard Googlebot, reflecting the need for rapid updates. Its role is to ensure accurate, real-time coverage of news sources across Google’s search and news platforms. RobotSense.io verifies Googlebot News using Google’s official validation methods, ensuring only genuine Googlebot News traffic is identified.
User Agent Examples
Googlebot Smartphone: Mozilla/5.0 (Linux; Android 6.0.1; Nexus 5X Build/MMB29P) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/W.X.Y.Z Mobile Safari/537.36 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
Googlebot Desktop: Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Googlebot/2.1; +http://www.google.com/bot.html) Chrome/W.X.Y.Z Safari/537.36
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
Googlebot/2.1 (+http://www.google.com/bot.html)Robots.txt Configuration for Googlebot News
Googlebot-NewsUse this identifier in your robots.txt User-agent directive to target Googlebot News.
Recommended Configuration
Our recommended robots.txt configuration for Googlebot News:
User-agent: Googlebot-News
Allow: /Completely Block Googlebot News
Prevent this bot from crawling your entire site:
User-agent: Googlebot-News
Disallow: /Completely Allow Googlebot News
Allow this bot to crawl your entire site:
User-agent: Googlebot-News
Allow: /Block Specific Paths
Block this bot from specific directories or pages:
User-agent: Googlebot-News
Disallow: /private/
Disallow: /admin/
Disallow: /api/Allow Only Specific Paths
Block everything but allow specific directories:
User-agent: Googlebot-News
Disallow: /
Allow: /public/
Allow: /blog/Set Crawl Delay
Limit how frequently Googlebot News can request pages (in seconds):
User-agent: Googlebot-News
Allow: /
Crawl-delay: 10Note: This bot officially honors the Crawl-delay directive.
Frequently Asked Questions
- What is Googlebot News, and why is it visiting my website?
- Googlebot News is Google's specialized news crawler used to discover, fetch, and update news articles for Google News, Top Stories, and other news-related search features. It focuses on timely journalism content and evaluates article pages, headlines, publication timestamps, structured data, authorship information, and freshness signals. Visits are typically triggered by newly published articles, updated news coverage, news sitemaps, RSS feeds, or rapidly changing content. Crawl activity is generally more frequent than standard Googlebot because news indexing depends on fast content discovery and near real-time updates. Traffic from this crawler is expected for publicly accessible news and media websites.
- Is Googlebot News a legitimate bot, or is it commonly spoofed?
- Googlebot News is a legitimate crawler officially operated by Google as part of Google's search and news infrastructure. It is specifically designed for news discovery and indexing workflows. Like other major crawlers, its User-Agent may be spoofed by scrapers, malicious bots, or automated tools attempting to bypass bot filtering and security rules. Attackers commonly impersonate trusted crawlers because many websites allow them unrestricted access. User-Agent strings alone are not sufficient for verification. You can use Google's recommended methods mentioned below to verify a legitimate visit, or use RobotSense.io API to easily verify Googlebot News visits.
- How can I verify that a request is really coming from Googlebot News?
- You can use Google's recommended official methods to verify Googlebot News visits, these include: - IP range checks - Reverse DNS → forward DNS Do not use User-Agent based detection as that can be easily spoofed. Alternatively, you can use RobotSense.io API to easily verify Googlebot News and all other bots from Google.
- Should I allow or block Googlebot News on my website?
- Allowing Googlebot News is generally beneficial for publishers that want visibility in Google News, Top Stories, and news-related search features. Timely crawling helps newly published articles appear faster across Google's news ecosystem. Blocking may be appropriate for: - Private or subscriber-only news content - High-load publishing systems during traffic spikes - Internal editorial tools or APIs - Websites that do not want inclusion in Google News Some publishers selectively allow crawling for public article sections while restricting archives, staging systems, or premium content.
- How can I control or block Googlebot News using robots.txt or other methods?
- You can add a rule in your robots.txt, as given above to control (crawl-delay) or disallow Googlebot News. Googlebot News honors robots.txt directives. Also, you can use further controls in your WAF, or in RobotSense enforcement settings to manage the bot behavior.
- How often does Googlebot News crawl websites, and can it impact server performance?
- Googlebot News crawls news websites frequently and in near real time to detect breaking stories, article updates, and fresh reporting. Crawl frequency increases for active publishers with rapidly changing content. For most publishers, impact is moderate and manageable. However, high-volume news websites may notice increased: - Request rates during publishing spikes - Bandwidth usage - Dynamic page rendering load - Database activity from frequently updated article pages Performance impact is most noticeable on large media platforms publishing content continuously throughout the day.
- What happens if I block Googlebot News? SEO, visibility, and feature impact explained.
- Blocking Googlebot News can reduce or eliminate visibility within Google News and related news surfaces. Potential impacts include: - Articles excluded from Google News - Reduced eligibility for Top Stories placement - Slower discovery of breaking news content - Reduced visibility in news-related search experiences Typically unaffected: - Standard web indexing by regular Googlebot - Basic organic rankings outside news features - Direct website traffic Blocking Googlebot News does not necessarily remove pages from normal Google Search, but it can significantly reduce exposure within Google's news ecosystem.
- Does Googlebot News collect, scrape, or use my content for training or reuse?
- Googlebot News collects publicly accessible news content and related metadata to support news indexing and ranking systems. It fetches article text, headlines, timestamps, author information, structured data, images, and metadata associated with news pages. Collected information may be used for: - Google News indexing - Headline and snippet generation - News ranking systems - Freshness analysis - Search and news previews Google stores indexed article content and extracted metadata within its search infrastructure. Public documentation describes Googlebot News primarily as a news indexing crawler rather than a dedicated AI training crawler, although Google broadly applies machine learning systems within search and news ranking technologies.