Live demo — seeded data for voltaic.dev·Start free →
All agents

CCBot

Verified

Common Crawl · Model training

Operator docs

Builds the open Common Crawl corpus — the dataset behind many LLM training runs. Blocking CCBot removes you from most open training data.

Visits · 30d

948

First seen

Feb 26

Last seen

9m ago

Respects robots.txt

Yes

Verification

UA pattern + reverse DNS (operator domain)

Daily visits

last 30 days
37227May 13May 28Jun 11
Most-read pagesReads
/docs/quickstart178
/docs/api/meters115
/docs/api/tariffs108
/docs/sdks/python95
/docs/api/carbon90
/74
/pricing50
/customers38
/pricing-202436
/blog/carbon-api-launch31
Built withProductOS
Live demo — Agentlens