Biggest threat to humanity
AGI achieved đ¤
Submitted â¨â¨3⊠â¨weeks⊠ago⊠by â¨cyrano@lemmy.dbzer0.com⊠to â¨[deleted]âŠ
https://lemmy.dbzer0.com/pictrs/image/7efced45-504a-4177-a992-a5a2ce0e8b6f.webp
Comments
VirgilMastercard@reddthat.com â¨3⊠â¨weeks⊠ago
idiomaddict@lemmy.world â¨3⊠â¨weeks⊠ago
I know thereâs no logic, but itâs funny to imagine itâs because itâs pronounced Mrs. Sippy
jaybone@lemmy.zip â¨3⊠â¨weeks⊠ago
And if it messed up on the other word, we could say because itâs pronounced Louisianer.
sp3ctr4l@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
I was gonna say something similar, I have heard a LOT of people pronounce Mississippi as if it does have an R in it.
merc@sh.itjust.works â¨3⊠â¨weeks⊠ago
How do you pronounce âMrsâ so that thereâs an ârâ sound in it?
cyrano@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
It is going to be funny those implementation of LLM in accounting software
RedstoneValley@sh.itjust.works â¨3⊠â¨weeks⊠ago
Itâs funny how people always quickly point out that an LLM wasnât made for this, and then continue to shill it for use cases it wasnât made for either (The âintelligenceâ part of AI, for starters)
UnderpantsWeevil@lemmy.world â¨3⊠â¨weeks⊠ago
LLM wasnât made for this
Thereâs a thought experiment that challenges the concept of cognition, called The Chinese Room. What it essentially postulates is a conversation between two people, one of whom is speaking Chinese and getting responses in Chinese. And the first speaker wonders âDoes my conversation partner really understand what Iâm saying or am I just getting elaborate stock answers from a big library of pre-defined replies?â
The LLM is literally a Chinese Room. And one way we can know this is through these interactions. The machine isnât analyzing the fundamental meaning of what Iâm saying, it is simply mapping the words Iâve input onto a big catalog of responses and giving me a standard output. In this case, the problem the machine is running into is a legacy meme about people miscounting the number of "r"s in the word Strawberry. So â2â is the stock response it knows via the meme reference, even though a much simpler and dumber machine that was designed to handle this basic input question could have come up with the answer faster and more accurately.
When you hear people complain about how the LLM âwasnât made for thisâ, what theyâre really complaining about is their own shitty methodology. They build a glorified card catalog. A device that can only take inputs, feed them through a massive library of responses, and sift out the highest probability answer without actually knowing what the inputs or outputs signify cognitively.
Even if you want to argue that having a natural language search engine is useful (damn, wish we had a tool that did exactly this back in August of 1996, amirite?), the implementation of the current iteration of these tools is dogshit because the developers did a dogshit job of sanitizing and rationalizing their library of data.
Imagine asking a librarian âWhat was happening in Los Angeles in the Summer of 1989?â and that person fetching you back a stack of history textbooks, a stack of Sci-Fi screenplays, a stack of regional newspapers, and a stack of Iron-Man comic books all given equal weight? Imagine hearing the plot of the Terminator and Escape from LA intercut with local elections and the Loma Prieta earthquake.
Thatâs modern LLMs in a nutshell.
jsomae@lemmy.ml â¨3⊠â¨weeks⊠ago
Youâve missed something about the Chinese Room. The solution to the Chinese Room riddle is that it is not the person in the room but rather the room itself that is communicating with you. The fact that thereâs a person there is irrelevant, and they could be replaced with a speaker or computer terminal.
Put differently, itâs not an indictment of LLMs that they are merely Chinese Rooms, but rather one should be impressed that the Chinese Room is so capable despite being a completely deterministic machine.
If one day we discover that the human brain works on much simpler principles than we once thought, would that make humans any less valuable? It should be deeply troubling to us that LLMs can do so much while the mathematics behind them are so simple. Arguments that because LLMs are just scaled-up autocomplete they surely canât be very good at anything are not comforting to me at all.
shalafi@lemmy.world â¨3⊠â¨weeks⊠ago
You might just love Blind Sight. Here, theyâre trying to decide if an alien life form is sentient or a Chinese Room:
âTell me more about your cousins,â Rorschach sent.
âOur cousins lie about the family tree,â Sascha replied, âwith nieces and nephews and Neandertals. We do not like annoying cousins.â
âWeâd like to know about this tree.â
Sascha muted the channel and gave us a look that said Could it be any more obvious? âIt couldnât have parsed that. There were three linguistic ambiguities in there. It just ignored them.â
âWell, it asked for clarification,â Bates pointed out.
âIt asked a follow-up question. Different thing entirely.â
Bates was still out of the loop. Szpindel was starting to get it, though⌠.
RedstoneValley@sh.itjust.works â¨3⊠â¨weeks⊠ago
Thatâs a very long answer to my snarky little comment :) I appreciate it though. Personally, I find LLMs interesting and Iâve spent quite a while playing with them. But after all they are like you described, an interconnected catalogue of random stuff, with some hallucinations to fill the gaps. They are NOT a reliable source of information or general knowledge or even safe to use as an âassistantâ. The marketing of LLMs as being fit for such purposes is the problem. Humans tend to turn off their brains and to blindly trust technology, and the tech companies are encouraging them to do so by making false promises.
frostysauce@lemmy.world â¨3⊠â¨weeks⊠ago
(damn, wish we had a tool that did exactly this back in August of 1996, amirite?)
Wait, what was going on in August of '96?
outhouseperilous@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
Yes but have you considered that it agreed with me so now i need to defend it to the death against you horrible apes, no matter the allegation or terrain?
Knock_Knock_Lemmy_In@lemmy.world â¨3⊠â¨weeks⊠ago
a much simpler and dumber machine that was designed to handle this basic input question could have come up with the answer faster and more accurately
The human approach could be to write a (python) program to count the number of characters precisely.
When people refer to agents, is this what they are supposed to be doing? Is it done in a generic fashion or will it fall over with complexity?
merc@sh.itjust.works â¨3⊠â¨weeks⊠ago
Imagine asking a librarian âWhat was happening in Los Angeles in the Summer of 1989?â and that person fetching you ⌠Thatâs modern LLMs in a nutshell.
I agree, but I think youâre still being too generous to LLMs. A librarian who fetched all those things would at least understand the question. An LLM is just trying to generate words that might logically follow the words you used.
IMO, one of the key ideas with the Chinese Room is that thereâs an assumption that the computer / book in the Chinese Room experiment has infinite capacity in some way. So, no matter what symbols are passed to it, it can come up with an appropriate response. But, obviously, while LLMs are incredibly huge, they can never be infinite. As a result, they can often be âfooledâ when theyâre given input that semantically similar to a meme, joke or logic puzzle. The vast majority of the training data that matches the input is the meme, or joke, or logic puzzle. LLMs canât reason so they canât distinguish between âthis is just a rephrasing of that memeâ and âthis is similar to that meme but distinct in an important wayâ.
Leet@lemmy.zip â¨3⊠â¨weeks⊠ago
Can we say for certain that human brains arenât sophisticated Chinese roomsâŚ
REDACTED@infosec.pub â¨3⊠â¨weeks⊠ago
There are different types of Artificial intelligences. Counter-Strike 1.6 bots, by definition, were AI. They even used deep learning to figure out new maps.
ouRKaoS@lemmy.today â¨3⊠â¨weeks⊠ago
If you want an even older example, the ghosts in Pac-Man could be considered AI as well.
BarrelAgedBoredom@lemm.ee â¨3⊠â¨weeks⊠ago
Itâs marketed like its AGI, so we should treat it like AGI to show that it isnât AGI. Lots of people buy the bullshit
Knock_Knock_Lemmy_In@lemmy.world â¨3⊠â¨weeks⊠ago
AGI is only a benchmark because it gets OpenAI out of a contract with Microsoft when it occurs.
merc@sh.itjust.works â¨3⊠â¨weeks⊠ago
You can even drop the âaâ and âgâ. There isnât even âintelligenceâ here. Itâs not thinking, itâs just spicy autocomplete.
merc@sh.itjust.works â¨3⊠â¨weeks⊠ago
then continue to shill it for use cases it wasnât made for either
The only thing it was made for is âspicy autocompleteâ.
jsomae@lemmy.ml â¨3⊠â¨weeks⊠ago
Turns out spicy autocomplete can contribute to the bottom line. Capitalism :(
SoftestSapphic@lemmy.world â¨3⊠â¨weeks⊠ago
Maybe they should call it what it is
Machine Learning algorithms from 1990 repackaged and sold to us by marketing teams.
outhouseperilous@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
Hey now, thatâs unfair and queerphobic.
These models are from 1950, with juiced up data sets. Alan turing personally sid a lot of work on them, before he cracked the math and figured out they were shit and would always be shit.
jsomae@lemmy.ml â¨3⊠â¨weeks⊠ago
Machine learning algorithm from 2017, scaled up a few orders of magnitude so that it finally more or less works, then repackaged and sold by marketing teams.
Gladaed@feddit.org â¨3⊠â¨weeks⊠ago
Fair point, but a big part of âintelligenceâ tasks are memorization.
BussyCat@lemmy.world â¨3⊠â¨weeks⊠ago
Computers for all intents are purposes have perfect recall so since it was trained on a large data set it would have much better intelligence. But in reality what we consider intelligence is extrapolating from existing knowledge which is what âAIâ has shown to be pretty shit at
outhouseperilous@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
I would say more âblackpillingâ, i genuinely donât believe most humans are people anymore.
besselj@lemmy.ca â¨3⊠â¨weeks⊠ago
burgerpocalyse@lemmy.world â¨3⊠â¨weeks⊠ago
teamwork makes the teamwork makes the teamwork makes the teamwork makes the teamwork makes the teamwork makes the teamwork makes the
Emi@ani.social â¨3⊠â¨weeks⊠ago
The end is never the end The end is never the end The end is never the end The end is never the end The end is never the end The end is never the end The end is never the end The end is never the end
sp3ctr4l@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
weamwork is my new favorite word, ahahah!
kungen@feddit.nu â¨3⊠â¨weeks⊠ago
Youâre asking about a double-U. Double means two. I think AI reasoned completely correct.
UrPartnerInCrime@sh.itjust.works â¨3⊠â¨weeks⊠ago
cashsky@sh.itjust.works â¨3⊠â¨weeks⊠ago
What is that font broâŚ
UrPartnerInCrime@sh.itjust.works â¨3⊠â¨weeks⊠ago
Its called sweetpea and my sweatpea picked it out for me. How dare I stick with something my girl picked out for me.
But the fact that you actually care what font someone else uses is sad
lordbritishbusiness@lemmy.world â¨3⊠â¨weeks⊠ago
One of the interesting things I notice about the âreasoningâ models is their responses to questions occasionally include what my monkey brain perceives as âsassâ.
I wonder sometimes if they recognise the trivialness of some of the prompts they answer, and subtilly throw shade.
Oneâs going to respond to this with âclever monkey! đ Have a banana đ.â
ynthrepic@lemmy.world â¨3⊠â¨weeks⊠ago
Nice Rs.
nyamlae@lemmy.world â¨3⊠â¨weeks⊠ago
Is this ChatGPT o3-pro?
UrPartnerInCrime@sh.itjust.works â¨3⊠â¨weeks⊠ago
ChatGPT 4o
ICastFist@programming.dev â¨3⊠â¨weeks⊠ago
Now ask how many asses there are in assassinations
notdoingshittoday@lemmy.zip â¨3⊠â¨weeks⊠ago
LodeMike@lemmy.today â¨3⊠â¨weeks⊠ago
Man AI is ass at this
*laugh track*
Rin@lemm.ee â¨3⊠â¨weeks⊠ago
qx128@lemmy.world â¨3⊠â¨weeks⊠ago
I really like checking these myself to make sure itâs true. I WAS NOT DISAPPOINTED!
(Total Rs is 8. But the LOGIC ChatGPT pulls out is âŚâŚ. remarkable!)
Zacryon@feddit.org â¨3⊠â¨weeks⊠ago
âLet me know if youâd like help counting letters in any other fun words!â
Oh well, these newish calls for engagement sure take on ridiculous extents sometimes.
scholar@lemmy.world â¨3⊠â¨weeks⊠ago
ipitco@lemmy.super.ynh.fr â¨3⊠â¨weeks⊠ago
Try with o4-mini-high. Itâs made to think like a human by checking its answer and doing step by step, rather than just kinda guessing one like here
AnUnusualRelic@lemmy.world â¨3⊠â¨weeks⊠ago
What is this devilry?
LanguageIsCool@lemmy.world â¨3⊠â¨weeks⊠ago
How many times do I have to spell it out for you chargpt? S-T-R-A-R-W-B-E-R-R-Y-R
MrLLM@ani.social â¨3⊠â¨weeks⊠ago
We gotta raise the bar, so they keep struggling to make it âbetterâ
My attempt
0000000000000000 0000011111000000 0000111111111000 0000111111100000 0001111111111000 0001111111111100 0001111111111000 0000011111110000 0000111111000000 0001111111100000 0001111111100000 0001111111100000 0001111111100000 0000111111000000 0000011110000000 0000011110000000
Btw, I refuse to give my money to AI bros, so I donât have the âlatest and greatestâipitco@lemmy.super.ynh.fr â¨3⊠â¨weeks⊠ago
Tested on ChatGPT o4-mini-high
0 0 0 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 1 1 1 0 0 1 1 1 0 0 0 0 0 0 0 1 1 1 0 0 0 0 1 1 1 0 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 0 0
It sent me this
Korhaka@sopuli.xyz â¨3⊠â¨weeks⊠ago
I asked it how many Ts are in names of presidents since 2000. It said 4 and stated that âObamaâ contains 1 T.
TheOakTree@lemm.ee â¨3⊠â¨weeks⊠ago
Toebama
jsomae@lemmy.ml â¨3⊠â¨weeks⊠ago
People who think that LLMs having trouble with these questions is evidence one way or another about how good or bad LLMs are just donât understand tokenization. This is not a big-picture problem that indicates LLMs is deeply incapable. You may hate AI but that doesnât excuse being ignorant about how it works.
untorquer@lemmy.world â¨3⊠â¨weeks⊠ago
These sorts of artifacts wouldnât be a huge issue except that AI is being pushed to the general public as an alternative means of learning basic information. The meme example is obvious to someone with a strong understanding of English but learners and children might get an artifact and stamp it in their memory, working for years off bad information. Not a problem for a few false things every now and then, thatâs unavoidable in learning. Thousands accumulated over long term use, however, and your understanding of the world will be coarser, like the Swiss cheese with voids so large it canât hold itself up.
DmMacniel@feddit.org â¨3⊠â¨weeks⊠ago
We are fecking doomed!
abfarid@startrek.website â¨3⊠â¨weeks⊠ago
I get the meme aspect of this. But just to be clear, it was never fair to judge LLMs for specifically this. The LLM doesnât even see the letters in the words, as every word is broken down into tokens, which are numbers. I suppose with a big enough corpus of data it might eventually extrapolate which words have which letter from texts describing these words, but normally it shouldnât be expected.
loomy@lemy.lol â¨3⊠â¨weeks⊠ago
I donât get it
bitjunkie@lemmy.world â¨3⊠â¨weeks⊠ago
Deep reasoning is not needed to count to 3.
LMurch@thelemmy.club â¨3⊠â¨weeks⊠ago
AI is amazing, weâre so fucked.
/s
sheetzoos@lemmy.world â¨3⊠â¨weeks⊠ago
Honey, AI just did something new. Itâs time to move the goalposts again.
hornyalt@lemmynsfw.com â¨3⊠â¨weeks⊠ago
âA guy insteadâ
jsomae@lemmy.ml â¨3⊠â¨weeks⊠ago
When we see LLMs struggling to demonstrate an understanding of what letters are in each of the tokens that it emits or understand a word when there are spaces between each letter, we should compare it to a human struggling to understand a word written in IPA format (/sĘtĘ Éz ðɪs/) even though we can understand the word spoken aloud perfectly fine.
Echo5@lemmy.world â¨3⊠â¨weeks⊠ago
Maybe OP was low on the priority list for computing power? Idk how this stuff works
slaacaa@lemmy.world â¨3⊠â¨weeks⊠ago
Singularity is here
ZILtoid1991@lemmy.world â¨3⊠â¨weeks⊠ago
Reality:
The AI was trained to answer 3 to this question correctly.
Wait until the AI gets burned on a different question. Skeptics will rightfully use it to criticize LLMs for just being stochastic parrots, until LLM developers teach their models to answer it correctly, then the AI bros will use it as a proof it becoming âmore and more human likeâ.
RizzoTheSmall@lemm.ee â¨3⊠â¨weeks⊠ago
o3-pro? Damn, thatâs an expensive goof
cyrano@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
Next step how many r in Lollapalooza
Image
sexy_peach@feddit.org â¨3⊠â¨weeks⊠ago
Incredible
And009@lemmynsfw.com â¨3⊠â¨weeks⊠ago
Agi lost
Qwazpoi@lemmy.world â¨3⊠â¨weeks⊠ago
Image
cyrano@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
Tried it with o3 maybe it needs time to think đ
eager_eagle@lemmy.world â¨3⊠â¨weeks⊠ago
which model is it? I have a similar answer with 3.5, but 4o replies correctly
Image
altkey@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
Apparently, this robot is japanese.
jballs@sh.itjust.works â¨3⊠â¨weeks⊠ago
Iâm going to hell for laughing at that
sp3ctr4l@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
Obligatory âlore dumpâ on the word lollapalooza:
That word was a common term in the 1930s/40s American lingo that meant⌠essentially a very raucous, lively party.
Note/Rant on the meaning of this term
The current merriam webster and dictionary.com definitions of this term meaning âan outstanding or exceptional or extreme thingâ are wrong, they are too broad. While historical usage varied, it almost always appeared as a noun describing a gathering of many people, one that was so lively or spectacular that you would be exhausted after attending it. When it did not appear as a noun describing a lively party, it appeared as a term for some kind of action that would cause you to be bamboozled, discombobulated⌠similar to âthat was a real humdinger of a blahblahâ or âthat blahblah was a real doozyâ
So⌠in WW2, in the Pacific theatre⌠many US Marines were often engaged in brutal, jungle combat, and they adopted a system of basically verbal identification challenge checks if they noticed someone creeping up on their foxholes at night.
An example of this system used in the European theatre, I believe by the 101st and 82nd airborne, was the challenge âThunder!â to which the correct response was âFlash!â.
In the Pacific theatre⌠the Marines adopted a challenge / response system⌠where the correct response was âLolapaloozaââŚ
Because native born Japanese speakers are taught a phoneme that is roughly in between and ârâ and an âlâ ⌠and they very often struggle to say âLolapaloozaâ without a very noticable accent, unless theyâve also spent a good deal of time learning spoken English (or some other language with distinct âlâ and ârâ phonemes), which very few Japanese did in the 1940s.
::: racist and nsfw historical example of this
www.ep.tc/howtospotajap/howto06.html
:::
Now, some people will say this is a total myth, others will say it is not.
My Grandpa who served in the Pacific Theatre during WW2 told me it did happen, though he was Navy and not a Marine⌠but the stories about this Iâve always heard that say it did happen, they all say it happened with the Marines.
My Grandpa is also another source for what âlolapaloozaâ actually means.
resipsaloquitur@lemmy.world â¨3⊠â¨weeks⊠ago
en.wikipedia.org/wiki/Shibboleth
Iâve heard âsquirrelâ was used to trap Germans.
ICastFist@programming.dev â¨3⊠â¨weeks⊠ago
It does make sense to use a phoneme the enemy dialect lacks as a verbal check. Makes me wonder if there were any in the Pacific Theatre that decided for âLickâ and âLollipopâ.
altkey@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
Iâm still puzzled by the idea of what mess this war was if at times you had someone still not clearly identifiable, but that close you can do a sheboleth check on them, and that at any moment you or the other could be shot dead.
Also, the current conflict of Russia vs Ukraine seems to invent ukrainian âпаНŃниŃаâ as a check, but as I had no connection to actual ukrainians and their UAF, I canât say if thatâs not entirely localized to the internet.
cyrano@lemmy.dbzer0.com â¨3⊠â¨weeks⊠ago
Thanks for sharing
don@lemm.ee â¨3⊠â¨weeks⊠ago
u delet ur account rn
Mwa@thelemmy.club â¨3⊠â¨weeks⊠ago
Image
With Reasoning (this is QWEN on hugginchat it says there is Zero)