Thursday September 4, 2025
Synthetic intelligence (AI) bots now scrape web sites to coach fashions and reply questions immediately, usually with out sending site visitors again to the unique sources. Our new AI audit characteristic places management in your palms: you determine which AI bots are allowed in your web site – and which aren’t.
The brand new period of net crawlers
For years, web site homeowners welcomed bots, also referred to as crawlers, from search engines like google and yahoo, as they improved visibility and helped monetize site visitors.
AI crawlers are actually altering the mannequin. They accumulate content material not only for indexing, however to ship on the spot solutions immediately inside AI instruments.
For a lot of companies, this creates new alternatives. Mentions in AI responses can develop attain and appeal to new audiences.
For others, particularly these monetizing by visits, it creates challenges. If an AI software offers the total reply, only some readers will click on by. No site visitors means no earnings.
Be in cost with AI audit
With AI audit, the selection is yours. Hold the crawlers that deliver worth. Block those that don’t. Modify your settings anytime.
The characteristic is obtainable to website hosting and cloud internet hosting shoppers utilizing our content material supply community (CDN). You could find it in your web site’s dashboard below Efficiency → CDN → AI Audit.

From there, you possibly can:
- See which AI crawlers go to your website
- Monitor crawler exercise and requests over time
- Block the bots you don’t need, whereas maintaining those you do
Notice: Blocking a crawler prevents future entry, however doesn’t erase what it already collected.
Our evaluation: How AI bots are crawling the online
How energetic are these AI crawlers? To seek out out, we analyzed logs from 5 million shopper web sites.
Right here’s what we found:
- A whole lot of crawler teams go to our shoppers’ web sites
- About a 3rd of probably the most energetic crawlers are linked to AI suppliers
- Crawlers from Google, Meta, and OpenAI are probably the most energetic, reaching between 60 and 85% of internet sites every day
- Bots from Apple, ByteDance (TikTok), and Huawei (Petalsearch) usually crawl 20–25% of websites
- Anthropic (Claude), Amazon, and Perplexity, and different AI crawlers go to solely 2-5%
Don’t let the decrease percentages idiot you – crawlers rotate targets and might construct a near-complete image of the online inside weeks.

To make crawler exercise clearer, listed here are some examples of frequent AI bots:
- Amazonbot (Amazon) – Indexes net content material for varied functions, together with help for Amazon Alexa
- Applebot (Apple) – Crawled knowledge helps energy Apple’s options, together with the corporate’s AI assistant Siri
- Openai-GPTBot (OpenAI) – Crawls content material which may be utilized in coaching generative AI basis fashions
- Googlebot (Google) – Google’s fundamental crawler. Whereas its official description doesn’t point out AI coaching, public knowledge and proof counsel it could be used for that objective
- Meta-Externalagent (Meta) – Crawls the online to be used circumstances akin to coaching AI fashions or enhancing merchandise
Able to take management?
The web is altering quick, however your web site continues to be your house. With AI audit, our CDN extension, you determine who can entry your content material – aligning visibility with your small business objectives.
Take management of your content material at present with the AI audit.
And if you wish to enhance visibility in AI-generated responses, contemplate adopting generative engine optimization (GEO) methods: create well-structured content material, enhance web site loading pace, and add an llms.txt file to make it simpler for AI instruments to navigate your website.