User-agent: * Allow: / # Keep asset chunks crawlable for rendering. Use exact route patterns so # /assets/usage-*.js and similar hashed files are not blocked by prefix rules. Disallow: /*/admin/ Disallow: /*/usage$ Disallow: /*/usage/ Disallow: /*/api_access$ Disallow: /*/api_access/ Disallow: /*/transit-overlay-window$ Disallow: /*/transit-overlay-window/ Disallow: /*/components$ Disallow: /*/components/ # AI search bots — explicitly allowed so the site can be cited in # AI Overviews / Perplexity / ChatGPT search. We pair this with a # well-formed /llms.txt and /llms-full.txt for citation context. # - Google-Extended: powers AI Overviews + Gemini citations. # - PerplexityBot: indexes for Perplexity search answers. # - OAI-SearchBot: OpenAI's search-time crawler (separate from # GPTBot, which is a training crawler and remains blocked below). User-agent: Google-Extended Allow: / User-agent: PerplexityBot Allow: / User-agent: OAI-SearchBot Allow: / # AI training crawlers — blocked. These ingest content for LLM # training and do not surface citations back to users. Keeping # them disallowed protects content from unattributed reuse while # still allowing the search-time bots above to cite the site. User-agent: Amazonbot Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: Bytespider Disallow: / User-agent: CCBot Disallow: / User-agent: ClaudeBot Disallow: / User-agent: GPTBot Disallow: / User-agent: meta-externalagent Disallow: / User-agent: cohere-ai Disallow: / Sitemap: https://humandesignhub.app/sitemap.xml