robots.txt
robots.txt ✦ THE DOORMAN
Controls who gets in and where they can go. One of the oldest files on the web, upgraded for AI. Tells every crawler and AI bot exactly which sections of your site they can access, and points them toward your AI-specific readiness files.
All web crawlers and AI bots
When site structure changes
Think of it like…
“Imagine a library where some shelves are open to visitors and others are for staff only. robots.txt is the sign at the entrance that tells each visitor which sections they are free to browse.”
What AI systems do with this file
- Checks it before crawling any page on your site
- Follows Allow and Disallow rules to determine which pages to index
- Discovers your AI-specific files through Sitemap references
- Learns where to find your llms.txt, ai-sitemap.xml, and other readiness files
Sources
Deploy at: /robots.txt
