Comment on The Open-Source Software Saving the Internet From AI Bot Scrapers
remington@beehaw.org 1 day agoUnfortunately, archive.is has moved behind Cloudflare, subjecting readers to having their reading habits (both the articles and the referring communities) tracked at a large scale.
How do you know this?
What about ghostarchive.org?
who@feddit.org 1 day ago
Sorry; I shouldn’t have written Cloudflare specifically. The CAPTCHA page now contains scripts from Google, not Cloudflare. I have corrected my comment.
Because a couple months ago, archive.is/archive.today started showing me CAPTCHA pages instead of the archived articles when I use Firefox with scripts disabled. The current page contains scripts hosted by Google, which I won’t enable, so I can’t read the archived articles.
I haven’t used that site enough to have a consistent picture of what it’s doing. When I tried it a few minutes ago, it directed me to a CAPTCHA wall for submitting an article, but not for reading. I’ll try to remember to look at it again periodically, to be able to answer this question in the future.
remington@beehaw.org 1 day ago
Thanks. I appreciate the info and effort.