Even its hidden behind a password?
Comment on AI Training Slop
frog@feddit.uk 2 days agoThey do, but even if they didn’t AI companies are going take them anyway. Bots make up 50% of internet traffic. AI companies have ignored robot.txt entries. Anything publicly available, even if it’s behind a password, is accessible since companies like Reddit sell that information.
KurtVonnegut@mander.xyz 2 days ago
frog@feddit.uk 2 days ago
Like private subreddits or private messages.
KurtVonnegut@mander.xyz 1 day ago
Ah when stuff is behind a password but not encrypted and still on their servers. Yes.
frog@feddit.uk 1 day ago
Correct.
Ledericas@lemm.ee 1 day ago
Reddit is about to make that somewhat more “public”, I heard they are changing the pm and DMs to a chat system
Blackmist@feddit.uk 1 day ago
If it’s on a billionaire’s computer, and they can read it, then yes. They’ll sell it, no questions asked.
E2E encrypted data is probably OK, as long as that person didn’t save it somewhere and upload it to a cloud backup.
HK65@sopuli.xyz 2 days ago
I’ve read a study that claimed ads were 50% of traffic by data volume.
Is anyone actually still using the internet, or is it all ad networks sending crap to bots?
frog@feddit.uk 2 days ago
This is my source : Forbes.
The source of the article is Imperva 2024 Bad Bot Report, but I cannot download the report. I do not know how they measured traffic. In this age of social media, I am going to guess it is by data volume and site visits.
kautau@lemmy.world 2 days ago
Here’s the report:
files.catbox.moe/bm9n2c.pdf