Even its hidden behind a password?
Comment on AI Training Slop
frog@feddit.uk 1 month agoThey do, but even if they didn’t AI companies are going take them anyway. Bots make up 50% of internet traffic. AI companies have ignored robot.txt entries. Anything publicly available, even if it’s behind a password, is accessible since companies like Reddit sell that information.
KurtVonnegut@mander.xyz 1 month ago
frog@feddit.uk 1 month ago
Like private subreddits or private messages.
KurtVonnegut@mander.xyz 1 month ago
Ah when stuff is behind a password but not encrypted and still on their servers. Yes.
frog@feddit.uk 1 month ago
Correct.
Ledericas@lemm.ee 1 month ago
Reddit is about to make that somewhat more “public”, I heard they are changing the pm and DMs to a chat system
Blackmist@feddit.uk 1 month ago
If it’s on a billionaire’s computer, and they can read it, then yes. They’ll sell it, no questions asked.
E2E encrypted data is probably OK, as long as that person didn’t save it somewhere and upload it to a cloud backup.
HK65@sopuli.xyz 1 month ago
I’ve read a study that claimed ads were 50% of traffic by data volume.
Is anyone actually still using the internet, or is it all ad networks sending crap to bots?
frog@feddit.uk 1 month ago
This is my source : Forbes.
The source of the article is Imperva 2024 Bad Bot Report, but I cannot download the report. I do not know how they measured traffic. In this age of social media, I am going to guess it is by data volume and site visits.
kautau@lemmy.world 1 month ago
Here’s the report:
files.catbox.moe/bm9n2c.pdf