Build a robots.txt file with allow/disallow rules, sitemap, and one-click AI-bot blocking (GPTBot, ClaudeBot, Google-Extended, and more).
Off = block everything except listed allows
Stop these bots from scraping your content for LLM training.
# Generated by onlinecorners.com
User-agent: *
Allow: /
Disallow: /admin/
Disallow: /api/
Disallow: /private/
User-agent: GPTBot
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Google-Extended
Disallow: /
User-agent: CCBot
Disallow: /
User-agent: Bytespider
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: anthropic-ai
Disallow: /
User-agent: Applebot-Extended
Disallow: /
Important
Robots.txt is a request, not enforcement. Reputable bots respect it; malicious scrapers will ignore it. Pair with server-level blocking (Cloudflare WAF, rate limits) for real protection.