Comment on Bill proposed to outlaw downloading Chinese AI models.

<- View Parent
p03locke@lemmy.dbzer0.com ⁨6⁩ ⁨days⁩ ago

Nobody releases training data. It’s too large and varied. The best I’ve seen was the LAION-2B set that Stable Diffusion used, and that’s still just a big collection of links. Even that isn’t going to fit on a GitHub repo.

Besides, improving the model means using the model as a base and implementing new training data. Specialize, specialize, specialize.

source
Sort:hotnewtop