Comment on The Open-Source Software Saving the Internet From AI Bot Scrapers
who@feddit.org 1 week agoSorry; I shouldn’t have written Cloudflare specifically. The CAPTCHA page now contains scripts from Google, not Cloudflare. I have corrected my comment.
How do you know this?
Because a couple months ago, archive.is/archive.today started showing me CAPTCHA pages instead of the archived articles when I use Firefox with scripts disabled. The current page contains scripts hosted by Google, which I won’t enable, so I can’t read the archived articles.
What about ghostarchive.org?
I haven’t used that site enough to have a consistent picture of what it’s doing. When I tried it a few minutes ago, it directed me to a CAPTCHA wall for submitting an article, but not for reading. I’ll try to remember to look at it again periodically, to be able to answer this question in the future.
remington@beehaw.org 1 week ago
Thanks. I appreciate the info and effort.