Pricing
Log inStart free trial→
Try free
Measure

MentionShare Tracking

See your brand mention rate across 9 AI engines daily

Competitor Intelligence

Track share-of-voice vs competitors across all engines

Prompt Performance

Per-query mention rates and 90-day trend lines

Brand Sentiment

Track if AI describes your brand positively or negatively

Optimize

Fix Generator

Generate FAQ blocks, JSON-LD schema, and answer paragraphs

AI SEO Audit

Page-by-page AI readiness scoring with specific fixes

Integrations

Connect GA4, Search Console, Slack, and your data stack

Authority Capture

Build topical authority AI engines trust and cite

Featured

Fix Generator

Generate FAQ blocks, JSON-LD schema, and answer paragraphs — ready to publish in one click.

See how it works →

IntegrationsGA4Search ConsoleSlackAPI

By Team

Marketing Teams

Track AI visibility and generate content at scale

Founders & Startups

Get cited by AI engines from day one — no SEO agency needed

Agencies

Manage client workspaces with white-label PDF reporting

B2B SaaS Companies

Win AI-generated buyer comparisons in your software category

Enterprise

SSO, dedicated support & custom contracts

How Teams Use It

Improve AI Citations

Publish content that trains ChatGPT and Perplexity to recommend you

Prove AI-Driven ROI

Connect AI citations to real traffic and pipeline via GA4

Get a Competitive Edge

Real results from teams dominating AI-generated answers

Managing multiple clients?See Agency plan →

Content

Blog

New

AEO strategies, AI visibility guides, and industry insights

What's New

Updated

Latest product releases, features, and platform updates

AEO Beginner Guide

Free

Free 10-step guide to getting cited in AI search results

Tutorials

Step-by-step video walkthroughs for every feature

Reference

Documentation

Platform guide — features, workflows, and getting started

API Reference

REST API docs, authentication, and code examples

Case Studies

Real results from marketing teams and agencies

Comparisons

TrueCite vs Otterly, Peec AI, and more

Security

Data handling, compliance, and infrastructure

New to AEO?
Read the free guide →About us →
← Back to blog
Technical AEOMay 28, 2026·7 min read

robots.txt and AI Bots: Should You Allow or Block AI Crawlers?

SM
By Sukanta Mohapatra, Founder & CEO · TrueCite · Updated May 28, 2026

AI crawlers from ChatGPT, Perplexity, Claude, and others need to access your site to cite your brand. Here is exactly how to configure robots.txt for maximum AI visibility.

The AI Crawler Question Every Brand Needs to Answer

Your robots.txt file tells web crawlers what they can and cannot access. Traditionally, it was about search engine bots. Now, it is also about AI crawlers from ChatGPT, Perplexity, Claude, and others.

The decision to allow or block AI crawlers has direct consequences for your AEO. Here is what you need to know.

The Major AI Crawlers

CrawlerAI EngineCompany
GPTBotChatGPTOpenAI
ChatGPT-UserChatGPT (browsing)OpenAI
ClaudeBotClaudeAnthropic
PerplexityBotPerplexityPerplexity AI
Google-ExtendedGemini (training)Google
GooglebotGoogle AI OverviewsGoogle
FacebookBotMeta AIMeta
Applebot-ExtendedApple IntelligenceApple
cohere-aiCohereCohere

Why You Should Allow AI Crawlers (In Most Cases)

If you want AI engines to cite your brand, those engines need to be able to read your content. Blocking AI crawlers prevents:

  • ▸ChatGPT from learning about your products and recommendations
  • ▸Perplexity from citing your FAQ pages and product descriptions
  • ▸Claude from referencing your technical content
  • ▸AI engines from updating their knowledge about your brand

The math is simple: if AI bots cannot crawl your site, AI engines cannot cite you. And if AI engines cannot cite you, you have zero presence in the AI research sessions that influence your buyers.

When Blocking AI Crawlers Might Make Sense

There are legitimate reasons to block specific AI crawlers:

  • ▸Proprietary content — if you have valuable content you do not want used for AI training
  • ▸Paywalled content — resources that require payment and should not be freely available to AI
  • ▸Legal restrictions — content with licensing restrictions that prohibit AI training use
  • ▸Competitive intelligence — if competitors use AI tools to mine your content for insights

If these apply to specific sections of your site, use path-specific robots.txt rules rather than blanket blocks.

How to Configure robots.txt for AI Visibility

Allow all AI crawlers (recommended for most brands):

User-agent: GPTBot
Allow: /

User-agent: ClaudeBot
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: Google-Extended
Allow: /

User-agent: FacebookBot
Allow: /

Allow AI crawlers but block training on specific content:

User-agent: GPTBot
Allow: /
Disallow: /private/
Disallow: /customer-data/

User-agent: Google-Extended
Allow: /
Disallow: /internal/

Block specific AI crawlers (use sparingly):

User-agent: GPTBot
Disallow: /

User-agent: ClaudeBot
Disallow: /

The AI Crawler Permission Score

TrueCite calculates an AI crawler permission score based on how many of the 9 major AI crawlers can access your site. The score affects your overall AEO health:

  • ▸90-100% — All or nearly all AI crawlers allowed. Optimal for AEO.
  • ▸60-89% — Most crawlers allowed. Minor gaps in specific engines.
  • ▸Below 60% — Significant crawler blocks that will limit AI citations.

Beyond robots.txt: Additional AI Access Signals

Meta AI no-index tags

The `<meta name="robots" content="noai, noimageai">` tag signals to some AI systems not to use page content for training. Use selectively — if applied site-wide, it may reduce your AI citation potential.

Crawl-delay directives

If you have a high-traffic site and want to control AI crawler load:

User-agent: GPTBot
Allow: /
Crawl-delay: 10

Sitemap reference in robots.txt

Always include your sitemap in robots.txt — this helps all crawlers, including AI ones, discover your content:

Sitemap: https://yourdomain.com/sitemap.xml

Check Your AI Crawler Status

Use TrueCite's Crawler Check to see:

  • ▸Which AI bots are currently allowed or blocked on your domain
  • ▸Your AI crawler permission score
  • ▸Specific robots.txt rules affecting each AI crawler
  • ▸Recommendations for improving your crawler access configuration

[Check your AI crawler status with TrueCite →](/dashboard/crawlers)

SM
BySukanta Mohapatra

Founder & CEO · TrueCite

Updated May 28, 2026

Related articles

ProductIntroducing AI Speedometer: How AI-Citable Is Your Website?GuideWhat is AEO? ChatGPT, Perplexity & Gemini GuideStrategyWhy ChatGPT Ignores Your Brand (and How to Fix It)
Want to improve your AI visibility? Start with TrueCite for free →
truecite.

When buyers ask AI, your brand is the answer.

Product

  • Pricing
  • Features
  • Integrations
  • API Docs
  • Fix Generator
  • AI SEO Audit

Company

  • About
  • Careers
  • Security
  • Case Studies
  • Comparisons
  • Support
  • Status

Resources

  • Blog
  • Documentation
  • Tutorials
  • AEO Guide
  • AEO Explained
  • GEO Explained

Legal

  • Privacy Policy
  • Terms of Service
  • Cookie Policy
truecite.
© 2026 TrueCite · AI Collective Labs Inc
PrivacyTermsSupport