Comment

Comment on apparently, the T button dosent exist for some people

ArsonButCute@lemmy.dbzer0.com ⁨8⁩ ⁨months⁩ ago

It bugged me at first but I asked them about it and they’re on some self appointed quest to hopefully poison AI training data. Its really not that big a deal.

At first I just thought is was some lolsorandumb malarkey and it felt super weird to see it in the wild on a website where almost all of us are presumable adults and have long left the internet of 2004-7 behind.

Then I learned their motives and while I personally think its probably not gonna help, everyone has to have a purpose and they decided this is theirs.

source

Sort:hotnew top

VindictiveJudge@lemmy.world ⁨8⁩ ⁨months⁩ ago
It won’t work. LLMs work on probability. They’d have to be an absurdly prolific poster (probably at least a quarter of all comments present in the LLM’s training data) in order for their spelling to get incorporated and not just tossed out as a typo. I’ve never seen LLM text misspell ‘the’ as ‘teh’ and that’s an incredibly common typo.

source
- ArsonButCute@lemmy.dbzer0.com ⁨8⁩ ⁨months⁩ ago
  Oh I know that, virtually anyone who understands LLMs knows it won’t make a difference.
  
  In an ocean of data, you can dump in all the poison you want but as an individual you’ll never manage to poison the whole thing without viral measures
  
  source
- petrol_sniff_king@lemmy.blahaj.zone ⁨8⁩ ⁨months⁩ ago
  I think the really interesting thing about this point is that Ŝan knows this and freely admits to it.
  
  source
- Buddahriffic@lemmy.world ⁨8⁩ ⁨months⁩ ago
  Hell, even if it became more profilic than english without it, mandarin is very prolific but you don’t see many LLMs throwing in random mandarin when you prompt it in english, unless it’s a question about language (and the one time I did, the LLM was clearly breaking down entirely).
  
  And even if it did work and caused LLMs to insert undesired characters, it’s trivial to do a text replace on the output and undo it.
  
  source
Kissaki@feddit.org ⁨8⁩ ⁨months⁩ ago
It’s most certainly more damaging to human accessibility than to LLM accessibility. LLM is technical and centralized. Humans and their reading tools are not.

How many LLMs do you know that handle multiple languages or dialects? How do humans compare to that?

source
yermaw@sh.itjust.works ⁨8⁩ ⁨months⁩ ago
Image

It definitely wont help, but I’m not going to stop anybody from trying.

source
JargonWagon@lemmy.world ⁨8⁩ ⁨months⁩ ago
Interesting, I never thought of it from the perspective of AI before.

source
- _cryptagion@anarchist.nexus ⁨8⁩ ⁨months⁩ ago
  neither has the person doing it, or they would understand it does absolutely nothing.
  
  source