OpenAI claims new GPT-5 model boosts ChatGPT to ‘PhD level’

⁨58⁩ ⁨likes⁩

Submitted ⁨⁨11⁩ ⁨months⁩ ago⁩ by ⁨sabreW4K3@lazysoci.al⁩ to ⁨technology@beehaw.org⁩

https://www.bbc.com/news/articles/cy5prvgw0r1o

source

Comments

Sort:hotnew top

TehPers@beehaw.org ⁨11⁩ ⁨months⁩ ago
Ph.Deez nutz.

I have friends who actually have a Ph.D. It takes many years to get one and an attempt to actually better a field. People tend to trust your opinion onna subject when you have a doctorate in that field.

I can’t even trust ChatGPT to answer a basic question without fucking up and apologizing to me, only to fuck up again.

Maybe stop treating language models like AGI? They’re awesome at recognizing semantic similarities between words and phrases (embeddings) as well as generating arbitrary but reasonable looking output that matches an expected output (structured outputs). That’s cool enough. Stop pretending like it isn’t and falsely advertizing it as being able to cure cancer and world hunger, especially when you wouldn’t even be happy if it did.

source
- bobs_monkey@lemmy.zip ⁨11⁩ ⁨months⁩ ago
  AI as it sits is a tool that has specific use cases. It is absolutely not intelligence, as it’s commonly marketed. It may seem intelligent to the uninformed, but boy howdy is that a mistake.
  
  source
  - t3rmit3@beehaw.org ⁨11⁩ ⁨months⁩ ago
    It’s a sad reflection of our current state when being able to string together coherent sentences is impressive enough to many as to be confused with truth.
    
    source
    -> View More Comments
shnizmuffin@lemmy.inbutts.lol ⁨11⁩ ⁨months⁩ ago
If I asked a PhD, “How many Bs are there in the word ‘blueberry’?” They’d call an ambulance for my obvious, severe concussion. They wouldn’t answer, “There are three Bs in the word blueberry! I know, it’s super tricky!”

source
- panda_abyss@lemmy.ca ⁨11⁩ ⁨months⁩ ago
  I don’t feel this is a good example of why LLMs shouldn’t be treated like PhDs.
  
  My first interactions with gpt5 have been pretty awful, and I’d treat it but it’s not available to me anymore
  
  source
  - shnizmuffin@lemmy.inbutts.lol ⁨11⁩ ⁨months⁩ ago
    Do you smell toast?
    
    source
    -> View More Comments
- GissaMittJobb@lemmy.ml ⁨11⁩ ⁨months⁩ ago
  LLMs are fundamentally unsuitable for character counting on account of how they ‘see’ the world - as a sequence of tokens, which can split words in non-intuitive ways.
  
  Regular programs already excel at counting characters in words, and LLMs can be used to generate such programs with ease.
  
  source
  - itslilith@lemmy.blahaj.zone ⁨11⁩ ⁨months⁩ ago
    But they don’t recognize their inadequacies, instead spouting confident misinformation
    
    source
    -> View More Comments
  - chaos@beehaw.org ⁨11⁩ ⁨months⁩ ago
    The tokenization is a low-level implementation detail, it shouldn’t affect an LLM’s ability to do basic reasoning. We don’t do arithmetic by counting how many neurons we can feel firing in our brain, we have higher level concepts of numbers, and LLMs are supposed to have something similar. Plus, in the “”“thinking”“” models, you’ll see them break up words into individual letters or even write them out in a numbered list, which should break the tokens up into individual letters as well.
    
    source
- darreninthenet@piefed.social ⁨11⁩ ⁨months⁩ ago
  FWIW, ChatGPT 5 gets this correct
  
  source
  - shnizmuffin@lemmy.inbutts.lol ⁨11⁩ ⁨months⁩ ago
    Fuckin’ does it?
    
    source
    -> View More Comments
0xtero@beehaw.org ⁨11⁩ ⁨months⁩ ago
ChatGPT in its PhD thesis defense: “Oh, I’m sorry for the misinformation, let me try this again…”

source
- Correct316@monero.town ⁨11⁩ ⁨months⁩ ago
  LOL!! 🤣 Yes! This exactly!
  
  source
furzegulo@lemmy.dbzer0.com ⁨11⁩ ⁨months⁩ ago
Just Conmen selling their snake oil

source
- Correct316@monero.town ⁨11⁩ ⁨months⁩ ago
  Have to agree with this. My experience with the various AI models is that they’re fairly terrible. I really don’t want to see this garbage driving cars where lives are at stake.
  
  source
cronenthal@discuss.tchncs.de ⁨11⁩ ⁨months⁩ ago
I could power a data center with the rolling of my eyes after reading this headline.

source
mormund@feddit.org ⁨11⁩ ⁨months⁩ ago
Didn’t he claim that with 4ó as well? But yes please inflate the bubble further, blow everything up.

source
- Swedneck@discuss.tchncs.de ⁨11⁩ ⁨months⁩ ago
  just one more iteration, i swear it’s PHD level this time
  
  JUST ONE MORE ITERATION PLEASE
  
  source
arsCynic@beehaw.org ⁨11⁩ ⁨months⁩ ago

I had the Blueberry talk with GPT5:

Image

Image

Image

🎓 PhB level checks out.
🚫 Blockchain level uselessness and waste as well.

🎈📌💥

source
- petrol_sniff_king@lemmy.blahaj.zone ⁨11⁩ ⁨months⁩ ago
  
  Yep — blueberry is one of those words where the middle almost trips you up, like it’s saying “b-b-better pay attention.”
  
  … I hate this technology so fucking much…
  
  Also, it trying to gaslight you into believing bluebberry is real was very funny.
  
  source
- limerod@reddthat.com ⁨11⁩ ⁨months⁩ ago
  Well, it answers correctly in my case. Image
  
  source
xxce2AAb@feddit.dk ⁨11⁩ ⁨months⁩ ago
“It can now drive it’s users straight into an active psychosis 35% faster by sounding more persuasive than ever before!”

source
red_bull_of_juarez@lemmy.dbzer0.com ⁨11⁩ ⁨months⁩ ago
OpenAI claims a lot of things.

source
ook@discuss.tchncs.de ⁨11⁩ ⁨months⁩ ago
I mean, that doesn’t really mean much, given that you don’t have to be very intelligent to get one. It’s mostly an endurance exercise and often a test how much frustration and uncertainty you can take in your life.

source
Mothra@mander.xyz ⁨11⁩ ⁨months⁩ ago
This guy always shows up with his hands like this in news photos

I know it’s irrelevant but I had to point it out

source
Catoblepas@piefed.blahaj.zone ⁨11⁩ ⁨months⁩ ago
How many ChatGPhDs will it take to do the math on how long it is until this bubble pops?

source
Krauerking@lemy.lol ⁨11⁩ ⁨months⁩ ago
Oops i ate the onion.

Right? No way thats considered a legitimate argument since a PhD just says you dedicated yourself to a very specific topic and arent necessarily smarter or better spoken for it.
Or is he just bragging he found a way to filter it to just people’s PhD thesis papers that they stole?

source
Pulptastic@midwest.social ⁨11⁩ ⁨months⁩ ago
Maybe a PhD in civil engineering lol.

source
Correct316@monero.town ⁨11⁩ ⁨months⁩ ago
Shouldn’t be hard to improve over this rubbish:

is it now 15 years after 2010 ?

GPT-4 No, it is not 15 years after 2010. As of today, August 8, 2025, it is 15 years after 2010.>

source
- limerod@reddthat.com ⁨11⁩ ⁨months⁩ ago
  The gpt-5 model answers this correctly. Image
  
  source