Problem
AI crawlers (GPTBot, ClaudeBot, ChatGPT-User, PerplexityBot) are likely being blocked by:
- Cloudflare AI Crawl Control — managed rules and/or managed robots.txt may be overriding origin robots.txt with Disallow directives
- Vercel AI Bots Managed Ruleset — if set to
deny, returns 403 to all AI user agents (x-vercel-mitigated: deny header)
This means the site can't appear in ChatGPT responses, Claude answers, Perplexity search results, or Google AI Overviews.
What to do
Cloudflare (dashboard → AI → AI Audit)
- Set "Block AI training bots" to Do not block
- Set "Manage your robots.txt" to Disabled
- On the Crawlers page, selectively block low-value training scrapers (Bytespider, CCBot) but keep GPTBot, ClaudeBot, ChatGPT-User, PerplexityBot set to Allow
Vercel (Firewall → Managed Rulesets)
- Change AI Bots Managed Ruleset from
deny to log
Codebase
- Add
public/llms.txt (llmstxt.org) for AI-readable site description
- Verify
robots.txt has no AI-specific Disallow directives
- Verify sitemap includes all public pages
Verify
curl -sI -A "Mozilla/5.0 (compatible; GPTBot/1.0)" <your-domain>/
# Should return 200, not 403
Source: mean-weasel/bleep-that-shit (audit done in #620)
Problem
AI crawlers (GPTBot, ClaudeBot, ChatGPT-User, PerplexityBot) are likely being blocked by:
deny, returns 403 to all AI user agents (x-vercel-mitigated: denyheader)This means the site can't appear in ChatGPT responses, Claude answers, Perplexity search results, or Google AI Overviews.
What to do
Cloudflare (dashboard → AI → AI Audit)
Vercel (Firewall → Managed Rulesets)
denytologCodebase
public/llms.txt(llmstxt.org) for AI-readable site descriptionrobots.txthas no AI-specific Disallow directivesVerify
Source: mean-weasel/bleep-that-shit (audit done in #620)