That would be terrible because they are both some of the best academic publishers in the humanities.
Comment on Dreams come true
ininewcrow@lemmy.ca 2 months ago
How many of these books will just be totally garbage nonsense just so they could fulfill a rearranged quota.
Now the LLM are filled with a good amount of nonsense.
poplargrove@lemmy.world 2 months ago
Tolookah@discuss.tchncs.de 2 months ago
Just use the llm to make the books that the llm then uses, what could go wrong?
runner_g@lemmy.blahaj.zone 2 months ago
Someone’s probably already coined the term, but I’m going to call it LLM inbreeding.
Naz@sh.itjust.works 2 months ago
I suggested this term in academic circles, as a joke.
I also suggested hallucinations ~3-6 years ago only to find out it was ALSO suggested in the 1970s.
Inbreeding, lol
anzo@programming.dev 2 months ago
There was some research article applying this 70s computer science concept to LLMs. It was published in Nature and hit major news outlets. Basically they further trained GPT on its output for a couple generations, until the model degraded terribly. Sounded obvious to me, but seeing it happen on the www is painful nonetheless…
chicken@lemmy.dbzer0.com 2 months ago
The real term is synthetic data
itslilith@lemmy.blahaj.zone 2 months ago
but it amounts to about the same
Benn@lemm.ee 2 months ago
It’s quite similar to another situation known as data incest
thesporkeffect@lemmy.world 2 months ago
Soylent AI? Auto-infocannibalism
rickyrigatoni@lemm.ee 2 months ago
It can only go right because corporations must be punished for trying to replace people with machines.