Comment

Comment on The Open-Source Software Saving the Internet From AI Bot Scrapers

remington@beehaw.org ⁨1⁩ ⁨year⁩ ago

Would you edit your post and add the following archive link to the body, please?

archive.is/VcoE1

source

Sort:hotnew top

who@feddit.org ⁨1⁩ ⁨year⁩ ago
Unfortunately, archive.is has moved behind Cloudflare, subjecting readers to having their reading habits (both the articles and the referring communities) tracked at a large scale.

I suggest this archive link instead:

web.archive.org/…/the-open-source-software-saving…

source
- remington@beehaw.org ⁨1⁩ ⁨year⁩ ago
  
  Unfortunately, archive.is has moved behind Cloudflare, subjecting readers to having their reading habits (both the articles and the referring communities) tracked at a large scale.
  
  How do you know this?
  
  What about ghostarchive.org?
  
  source
  - who@feddit.org ⁨1⁩ ⁨year⁩ ago
    Sorry; I shouldn’t have written Cloudflare specifically. The CAPTCHA page now contains scripts from Google, not Cloudflare. I have corrected my comment.
    
    How do you know this?
    
    Because a couple months ago, archive.is/archive.today started showing me CAPTCHA pages instead of the archived articles when I use Firefox with scripts disabled. The current page contains scripts hosted by Google, which I won’t enable, so I can’t read the archived articles.
    
    What about ghostarchive.org?
    
    I haven’t used that site enough to have a consistent picture of what it’s doing. When I tried it a few minutes ago, it directed me to a CAPTCHA wall for submitting an article, but not for reading. I’ll try to remember to look at it again periodically, to be able to answer this question in the future.
    
    source
    remington@beehaw.org ⁨1⁩ ⁨year⁩ ago
    Thanks. I appreciate the info and effort.
    
    source
sabreW4K3@lazysoci.al ⁨1⁩ ⁨year⁩ ago
To be honest with you, I refuse on moral grounds. 404 are independent and do good work. You’ve already linked a pay wall bypass in the comments, if anyone would like to find it, it’s not hard to scroll.

source
- remington@beehaw.org ⁨1⁩ ⁨year⁩ ago
  OK. Fair enough.
  
  source