Besides, the article is about image gen AI, not LLMs.
I can almost guarantee that hundred billion params LLMs are not trained on that, and are trained on the whole web scraped to the furthest extent.
The only sane and ethical solution going forward is to force to opensource all LLMs. Use the datasets generated by humanity - give back to humanity.
Mika@sopuli.xyz 2 days ago
onslaught545@lemmy.zip 1 day ago
That’s an LLM, buddy.
null@lemmy.nullspace.lol 1 day ago
Image Gen AI is an LLM?
onslaught545@lemmy.zip 1 day ago
Yes, it is. LLMs do more than just text generation.
Mika@sopuli.xyz 1 day ago
Article directly complains about AI artwork. You know what LLM even means?
onslaught545@lemmy.zip 1 day ago
Yes, I do. I also know that multimodal LLMs are what generate AI artwork.
unconfirmedsourcesDOTgov@lemmy.sdf.org 1 day ago
What do you think the letters LLM stand for, pal?
Skullgrid@lemmy.world 2 days ago
Jesus fucking christ. There are SO GODDAMN MANY open source LLMs, even from fucking scumbags like facebook. I get that there’s subtleties to the argument on the ProAI vs AntiAI side, but you guys just screech and scream.
github.com/eugeneyan/open-llms
vrighter@discuss.tchncs.de 11 hours ago
there are barely any. I can’t name a single one offhand. Open weights means absolutely nothing about the actual source of those weights.
6nk06@sh.itjust.works 1 day ago
Where are the sources? All I see is binary files.
Mika@sopuli.xyz 2 days ago
Lol, ofc meta, they have the biggest bigdata out there, full of private data.
Most of the opensources are recompilations of existing opensource LLMs.
And the page you’ve listed is <10b mostly, bar LLMs with huge financing, and generally either copropate or Chinese behind them.