To be honest with you, I refuse on moral grounds. 404 are independent and do good work. You’ve already linked a pay wall bypass in the comments, if anyone would like to find it, it’s not hard to scroll.
Comment on The Open-Source Software Saving the Internet From AI Bot Scrapers
remington@beehaw.org 3 weeks ago
Would you edit your post and add the following archive link to the body, please?
who@feddit.org 3 weeks ago
Unfortunately, archive.is has moved behind Cloudflare, subjecting readers to having their reading habits (both the articles and the referring communities) tracked at a large scale.
I suggest this archive link instead:
web.archive.org/…/the-open-source-software-saving…
remington@beehaw.org 3 weeks ago
How do you know this?
What about ghostarchive.org?
who@feddit.org 3 weeks ago
Sorry; I shouldn’t have written Cloudflare specifically. The CAPTCHA page now contains scripts from Google, not Cloudflare. I have corrected my comment.
Because a couple months ago, archive.is/archive.today started showing me CAPTCHA pages instead of the archived articles when I use Firefox with scripts disabled. The current page contains scripts hosted by Google, which I won’t enable, so I can’t read the archived articles.
I haven’t used that site enough to have a consistent picture of what it’s doing. When I tried it a few minutes ago, it directed me to a CAPTCHA wall for submitting an article, but not for reading. I’ll try to remember to look at it again periodically, to be able to answer this question in the future.
remington@beehaw.org 3 weeks ago
Thanks. I appreciate the info and effort.