Comment on Why is AI Pornifying Asian Women?
Even_Adder@lemmy.dbzer0.com 10 months agoIt’s really hard getting dark skin sometimes. A lot of the time it’s not even just the model, LoRAs and Textual Inversions make the skin lighter again so you have to try even harder. It’s going to take conscious effort from people to tune models that are inclusive. With the way media is biased right now, I feel like it’s going to take a lot of effort.
jarfil@beehaw.org 10 months ago
“Inclusive models” would need to be larger.
Right now people seem to prefer smaller quantized models, with whatever set of even smaller LoRAs on top, that make them output what they want… and only include more generic elements in the base model.
Muehe@lemmy.ml 10 months ago
[citation needed]
To my understanding the problem is that the models reproduce biases in the training material, not model size. Alignment is currently a manual process after the initial unsupervised learning phase, often done by click-workers (Reinforcement Learning from Human Feedback, RLHF), and aimed at coaxing the model towards more “politically correct” outputs; But ultimately at that time the damage is already done since the bias is encoded in the model weights and will resurface in the outputs just randomly or if you “jailbreak” enough.
In the context of the OP, if your training material has a high volume of sexualised depictions of Asian women the model will reproduce that in its outputs. Which is also the argument the article makes. So what you need for more inclusive models is essentially a de-biased training set designed with that specific purpose in mind.
I’m glad to be corrected here, especially if you have any sources to look at.
Even_Adder@lemmy.dbzer0.com 10 months ago
I wouldn’t mind. I’m here for it.
jarfil@beehaw.org 10 months ago
Are you ready to run a 100B FP64 parameter model? Or even a 10B FP32 one?
Over time, I wouldn’t be surprised if 500B INT8 models became commonplace with neuromorphic RAM, but there’s still some time for that to happen.
Even_Adder@lemmy.dbzer0.com 10 months ago
You don’t need that many concepts, 4gb checkpoints work just fine.