# Brooked — robots.txt # https://brooked.io # ── Standard crawlers ──────────────────────────────────────────────────────── User-agent: * Allow: / # ── AI/LLM crawlers — explicitly allowed ───────────────────────────────────── # OpenAI / ChatGPT User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / # Anthropic / Claude User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: anthropic-ai Allow: / # Google AI (Gemini, AI Overviews) User-agent: Google-Extended Allow: / # Perplexity User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / # Microsoft / Bing AI (Copilot) User-agent: Bingbot Allow: / # Apple User-agent: Applebot Allow: / User-agent: Applebot-Extended Allow: / # You.com User-agent: YouBot Allow: / # Cohere User-agent: cohere-ai Allow: / # Meta AI User-agent: Meta-ExternalAgent Allow: / User-agent: FacebookBot Allow: / # Diffbot (used by many AI knowledge graphs) User-agent: Diffbot Allow: / # Common Crawl (training data for many LLMs) User-agent: CCBot Allow: / # Amazon User-agent: Amazonbot Allow: / # Bytedance / TikTok AI User-agent: Bytespider Allow: / # ── Sitemaps ────────────────────────────────────────────────────────────────── Sitemap: https://brooked.io/sitemap.xml Sitemap: https://brooked.io/llms.txt