Blocking AI crawlers on the fediverse

cecep@fedia.io · 10 months ago

CameronDev@programming.dev · 10 months ago

But robots.txt is not a legal document — and 30 years after its creation, it still relies on the good will of all parties involved

You can ask nicely, they can (and will) ignore it.

lad@programming.dev · 10 months ago

Also, I’ve already seen complaints about AI companies scraping everything ignoring robots.txt

And we would block the obedient and useful crawlers while doing no harm to malicious