Comment on OpenAI says it’s “impossible” to create useful AI models without copyrighted material
maynarkh@feddit.nl 1 year agoThe two big arguments are:
- Substantial reproduction of the original work, you can get back substantial portions of the original work from an AI model’s output.
- The AI model replaces the use of the original work. In short, a work that uses copyrighted material under fair use can’t be a replacement for the initial work.
intensely_human@lemm.ee 1 year ago
Have you confirmed this yourself?
chaos@beehaw.org 1 year ago
www.cnn.com/2024/01/08/tech/…/index.html
The thing is, it doesn’t really matter if you have to “manipulate” ChatGPT into spitting out training material word-for-word, the fact that it’s possible at all is proof that, intentionally or not, that material has been encoded into the model itself. That might still be fair use, but it’s a lot weaker than the original argument, which was that nothing of the original material really remains after training, it’s all synthesized and blended with everything else to create something entirely new that doesn’t replicate the original.
FaceDeer@kbin.social 1 year ago
You said:
If an AI is trained on a huge number of NYT articles and you're only able to get it to regurgitate one of them, that's not a "substantial portion of the original work." That's a minuscule portion of the original work.