Comment on Grok got a Nazi patch
PonyOfWar@pawb.social 17 hours agoAt the very least shouldn’t it contain notations about why it’s wrong?
I mean it might. In both screenshots it’s clearly visible that parts of the text are cut off. Why should we trust Twitter neonazis?
njm1314@lemmy.world 16 hours ago
You’re suggesting notes are at the end of the cutoff sections but not at the end of the ones we can see? Cuz there should be notes on the ones we can see. Unless you’re suggesting points one two four and five are correct…
PonyOfWar@pawb.social 16 hours ago
So let’s assume the AI actually does have safety checks and will not display holocaust denial arguments without pointing out why they’re wrong. Maybe initially it will put notes directly after the arguments. But no problem! Just tell it to list the denialist lies first and the clarifications after. Take some screenshots of just the first paragraphs and boom - you have screenshots showing the AI denying the holocaust.
My point is that it’s easy to manipulate AI output in a variety of ways to make it show whatever you want. That’s not even taking into consideration the possibility of just editing the HTML, which can be done in seconds. Once again, why should we trust a nazi?
auraithx@piefed.social 15 hours ago
All frontier models have safety checks that mean they won’t display these arguments regardless of prompt.