AI doesn’t produce data suitable for training AI. It’s a huge problem when AI generated slop makes its way into the training set because it generally degrades the quality of the model. Like a photocopy of a photocopy.
So where is all the data its trained on to surpass most people come from? Do you think they’re curating what they feed it based on IQ scores or something? Verifying accuracy, competency, etc? Or are you aware they just turn on the reddit/stackoverflow/github/etc. scrapers and start pumping them full of unfiltered 100% pure grade A internet bullshit?
thebestaquaman@lemmy.world 14 hours ago
That’s exactly what people have done for millennia though. It’s literally the reason you have heat in the winter and aren’t living in a cave.