If I asked a PhD, “How many Bs are there in the word ‘blueberry’?” They’d call an ambulance for my obvious, severe concussion. They wouldn’t answer, “There are three Bs in the word blueberry! I know, it’s super tricky!”
OpenAI claims new GPT-5 model boosts ChatGPT to ‘PhD level’
Submitted 2 days ago by sabreW4K3@lazysoci.al to technology@beehaw.org
https://www.bbc.com/news/articles/cy5prvgw0r1o
Comments
shnizmuffin@lemmy.inbutts.lol 2 days ago
panda_abyss@lemmy.ca 2 days ago
I don’t feel this is a good example of why LLMs shouldn’t be treated like PhDs.
My first interactions with gpt5 have been pretty awful, and I’d treat it but it’s not available to me anymore
GissaMittJobb@lemmy.ml 2 days ago
LLMs are fundamentally unsuitable for character counting on account of how they ‘see’ the world - as a sequence of tokens, which can split words in non-intuitive ways.
Regular programs already excel at counting characters in words, and LLMs can be used to generate such programs with ease.
itslilith@lemmy.blahaj.zone 2 days ago
But they don’t recognize their inadequacies, instead spouting confident misinformation
chaos@beehaw.org 2 days ago
The tokenization is a low-level implementation detail, it shouldn’t affect an LLM’s ability to do basic reasoning. We don’t do arithmetic by counting how many neurons we can feel firing in our brain, we have higher level concepts of numbers, and LLMs are supposed to have something similar. Plus, in the “”“thinking”“” models, you’ll see them break up words into individual letters or even write them out in a numbered list, which should break the tokens up into individual letters as well.
0xtero@beehaw.org 2 days ago
ChatGPT in its PhD thesis defense: “Oh, I’m sorry for the misinformation, let me try this again…”
Correct316@monero.town 2 days ago
LOL!! 🤣 Yes! This exactly!
furzegulo@lemmy.dbzer0.com 2 days ago
Just Conmen selling their snake oil
Correct316@monero.town 2 days ago
Have to agree with this. My experience with the various AI models is that they’re fairly terrible. I really don’t want to see this garbage driving cars where lives are at stake.
arsCynic@beehaw.org 2 days ago
🎓 PhB level checks out.
🚫 Blockchain level uselessness and waste as well.🎈📌💥
petrol_sniff_king@lemmy.blahaj.zone 2 days ago
Yep — blueberry is one of those words where the middle almost trips you up, like it’s saying “b-b-better pay attention.”
… I hate this technology so fucking much…
Also, it trying to gaslight you into believing bluebberry is real was very funny.
limerod@reddthat.com 1 day ago
cronenthal@discuss.tchncs.de 2 days ago
I could power a data center with the rolling of my eyes after reading this headline.
mormund@feddit.org 2 days ago
Didn’t he claim that with 4ó as well? But yes please inflate the bubble further, blow everything up.
Swedneck@discuss.tchncs.de 2 days ago
just one more iteration, i swear it’s PHD level this time
JUST ONE MORE ITERATION PLEASE
xxce2AAb@feddit.dk 2 days ago
“It can now drive it’s users straight into an active psychosis 35% faster by sounding more persuasive than ever before!”
red_bull_of_juarez@lemmy.dbzer0.com 2 days ago
OpenAI claims a lot of things.
Mothra@mander.xyz 2 days ago
This guy always shows up with his hands like this in news photos
I know it’s irrelevant but I had to point it out
ook@discuss.tchncs.de 2 days ago
I mean, that doesn’t really mean much, given that you don’t have to be very intelligent to get one. It’s mostly an endurance exercise and often a test how much frustration and uncertainty you can take in your life.
Catoblepas@piefed.blahaj.zone 2 days ago
How many ChatGPhDs will it take to do the math on how long it is until this bubble pops?
Krauerking@lemy.lol 2 days ago
Oops i ate the onion.
Right? No way thats considered a legitimate argument since a PhD just says you dedicated yourself to a very specific topic and arent necessarily smarter or better spoken for it.
Or is he just bragging he found a way to filter it to just people’s PhD thesis papers that they stole?Pulptastic@midwest.social 1 day ago
Maybe a PhD in civil engineering lol.
Correct316@monero.town 2 days ago
Shouldn’t be hard to improve over this rubbish:
is it now 15 years after 2010 ?
GPT-4 No, it is not 15 years after 2010. As of today, August 8, 2025, it is 15 years after 2010.>
limerod@reddthat.com 23 hours ago
TehPers@beehaw.org 2 days ago
Ph.Deez nutz.
I have friends who actually have a Ph.D. It takes many years to get one and an attempt to actually better a field. People tend to trust your opinion onna subject when you have a doctorate in that field.
I can’t even trust ChatGPT to answer a basic question without fucking up and apologizing to me, only to fuck up again.
Maybe stop treating language models like AGI? They’re awesome at recognizing semantic similarities between words and phrases (embeddings) as well as generating arbitrary but reasonable looking output that matches an expected output (structured outputs). That’s cool enough. Stop pretending like it isn’t and falsely advertizing it as being able to cure cancer and world hunger, especially when you wouldn’t even be happy if it did.
bobs_monkey@lemmy.zip 2 days ago
AI as it sits is a tool that has specific use cases. It is absolutely not intelligence, as it’s commonly marketed. It may seem intelligent to the uninformed, but boy howdy is that a mistake.
t3rmit3@beehaw.org 2 days ago
It’s a sad reflection of our current state when being able to string together coherent sentences is impressive enough to many as to be confused with truth.