diff options
| author | Fuwn <[email protected]> | 2025-05-02 03:53:56 -0700 |
|---|---|---|
| committer | Fuwn <[email protected]> | 2025-05-02 03:53:56 -0700 |
| commit | 7b5abdacf9ae1bbb68d6ca5c45df4381ff80dc0e (patch) | |
| tree | ff83daf1edb36463016e11d3a9d88383186bd7c8 /content | |
| parent | feat(the_daily): Add Time Machine blog post (diff) | |
| download | locus-7b5abdacf9ae1bbb68d6ca5c45df4381ff80dc0e.tar.xz locus-7b5abdacf9ae1bbb68d6ca5c45df4381ff80dc0e.zip | |
feat(robots.txt): Add additional rules from Anubis
Diffstat (limited to 'content')
| -rw-r--r-- | content/meta/robots.txt | 57 |
1 files changed, 45 insertions, 12 deletions
diff --git a/content/meta/robots.txt b/content/meta/robots.txt index 5be4278..3f8af65 100644 --- a/content/meta/robots.txt +++ b/content/meta/robots.txt @@ -1,35 +1,68 @@ +User-agent: AI2Bot User-agent: AdsBot-Google +User-agent: Ai2Bot-Dolma User-agent: Amazonbot -User-agent: anthropic-ai +User-agent: Applebot User-agent: Applebot-Extended +User-agent: Brightbot 1.0 User-agent: Bytespider User-agent: CCBot User-agent: ChatGPT-User -User-agent: ClaudeBot User-agent: Claude-Web -User-agent: cohere-ai +User-agent: ClaudeBot +User-agent: Cotoyogi +User-agent: Crawlspace User-agent: Diffbot +User-agent: DuckAssistBot User-agent: FacebookBot +User-agent: Factset_spyderbot +User-agent: FirecrawlAgent User-agent: FriendlyCrawler +User-agent: GPTBot User-agent: Google-Extended User-agent: GoogleOther -User-agent: GPTBot +User-agent: GoogleOther-Image +User-agent: GoogleOther-Video +User-agent: ICC-Crawler +User-agent: ISSCyberRiskCrawler +User-agent: ImagesiftBot +User-agent: Kangaroo Bot +User-agent: Meta-ExternalAgent +User-agent: Meta-ExternalFetcher +User-agent: NovaAct +User-agent: OAI-SearchBot +User-agent: Operator +User-agent: PanguBot +User-agent: Perplexity-User +User-agent: PerplexityBot +User-agent: PetalBot +User-agent: Scrapy +User-agent: SemrushBot-OCOB +User-agent: SemrushBot-SWA +User-agent: Sidetrade indexer bot +User-agent: TikTokSpider +User-agent: Timpibot +User-agent: VelenPublicWebCrawler +User-agent: Webzio-Extended +User-agent: YouBo +User-agent: YouBot +User-agent: aiHitBot +User-agent: anthropic-ai +User-agent: cohere-ai +User-agent: cohere-training-data-crawler +User-agent: ia_archiver +User-agent: iaskspider/2.0 User-agent: img2dataset +User-agent: imgproxy +User-agent: meta-externalagent +User-agent: meta-externalfetcher User-agent: omgili User-agent: omgilibot User-agent: peer39_crawler User-agent: peer39_crawler/1.0 -User-agent: PerplexityBot -User-agent: YouBot Disallow: / User-agent: * Disallow: /x Disallow: /proxy -User-agent: DataForSeoBot -Disallow: /x -Disallow: /proxy - -User-agent: ia_archiver -Disallow: |