Comment on The Open-Source Software Saving the Internet From AI Bot Scrapers
who@feddit.org 1 day ago
She told me she’s […] also thinking about a version that doesn’t require JavaScript, which some privacy-minded disable in their browsers.
As someone who is keenly aware of the privacy and security problems that come with allowing web scripts, I hope she prioritizes this soon. It’s really disappointing to find sites that were readable without javascript suddenly inaccessible since adopting Anubis. The more sites that do this, the more people are pushed toward enabling scripts by default, exposing them to a great many trackers and web exploits that would otherwise be blocked.
exu@feditown.com 12 hours ago
There’s an option using some very new HTML tag, but it’s not the default.
anubis.techaro.lol/docs/admin/…/metarefresh
who@feddit.org 9 hours ago
Interesting. Judging by that option’s name, it seems to refer to use of the HTML
<meta>
tag to refresh a page.developer.mozilla.org/en-US/docs/…/http-equiv
Neither this tag nor using it for refresh is new at all. I don’t think I’ve seen it used to detect bots, though. I wonder what Anubis is doing here.
JohnEdwa@sopuli.xyz 3 hours ago
It’s simply checking if the connection is from an actual browser, as a scraper pretending to be one won’t actually refresh the page as instructed. It’s going to buy some time, but like the rest of Anubis in general, it will only work until the scrapers get modified to work around it.