Comment on If AI spits out stuff it's been trained on
OhNoMoreLemmy@lemmy.ml 1 week agoThis is one of those things where both are likely to be true. All webscale datasets have a problem with porn and csam, and it’s like that people wanting to generate csam use their own fine tuned models.
Here’s an example story. …stanford.edu/…/investigation-finds-ai-image-gene… and it’s very likely that this was the tip of the iceberg, and there’s more csam still in these datasets.