Even its hidden behind a password?
Comment on AI Training Slop
frog@feddit.uk 1 year agoThey do, but even if they didn’t AI companies are going take them anyway. Bots make up 50% of internet traffic. AI companies have ignored robot.txt entries. Anything publicly available, even if it’s behind a password, is accessible since companies like Reddit sell that information.
KurtVonnegut@mander.xyz 1 year ago
frog@feddit.uk 1 year ago
Like private subreddits or private messages.
KurtVonnegut@mander.xyz 1 year ago
Ah when stuff is behind a password but not encrypted and still on their servers. Yes.
frog@feddit.uk 1 year ago
Correct.
Ledericas@lemm.ee 1 year ago
Reddit is about to make that somewhat more “public”, I heard they are changing the pm and DMs to a chat system
Blackmist@feddit.uk 1 year ago
If it’s on a billionaire’s computer, and they can read it, then yes. They’ll sell it, no questions asked.
E2E encrypted data is probably OK, as long as that person didn’t save it somewhere and upload it to a cloud backup.
HK65@sopuli.xyz 1 year ago
I’ve read a study that claimed ads were 50% of traffic by data volume.
Is anyone actually still using the internet, or is it all ad networks sending crap to bots?
frog@feddit.uk 1 year ago
This is my source : Forbes.
The source of the article is Imperva 2024 Bad Bot Report, but I cannot download the report. I do not know how they measured traffic. In this age of social media, I am going to guess it is by data volume and site visits.
kautau@lemmy.world 1 year ago
Here’s the report:
files.catbox.moe/bm9n2c.pdf