Comment

Comment on Developer releases ShrimpMoss, a dataset designed to abliterate Chinese censorship and propaganda finetunes from LLMs

thelucky8@beehaw.org ⁨6⁩ ⁨months⁩ ago

Abliteration involves fine-tuning a language model to bypass built-in refusal mechanisms that prevent the model from generating responses to potentially harmful or sensitive prompts. Source

Sort:hotnew top

ericjmorey@beehaw.org ⁨6⁩ ⁨months⁩ ago
The shared repo doesn’t look like fine tuning. It just looks like prompts.

source