Comment

Comment on ChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logic

<- View Parent

realitista@lemm.ee ⁨5⁩ ⁨months⁩ ago

There are a lot of people out there that think LLM’s are somehow reasoning. Even reasoning models aren’t really doing it. It important to do demonstrations like this in the hopes that the general public will understand the limitations of this tech.

source

Sort:hotnew top

theangriestbird@beehaw.org ⁨5⁩ ⁨months⁩ ago

It is important to do demonstrations like this in the hopes that the general public will understand the limitations of this tech.

THIS is the thing. The general public’s perception of ChatGPT is basically whatever OpenAI’s marketing department tells them to believe, plus their single memory of that one time they tested out ChatGPT and it was pretty impressive. Right now, OpenAI is telling everyone that they are a few years away from Artificial General Intelligence. Tests like this one demonstrate how wrong OpenAI is in that assertion.

source
- p03locke@lemmy.dbzer0.com ⁨5⁩ ⁨months⁩ ago
  It’s almost as bad as the opposition’s comparison of it to Skynet. People are never going to understand technology without applying some fucking nuance.
  
  Stop hyping new technology… in either direction.
  
  source
Photuris@lemmy.ml ⁨5⁩ ⁨months⁩ ago
[deleted]
source
- realitista@lemm.ee ⁨5⁩ ⁨months⁩ ago
  This is definitely part of the issue, not sure why people are downvoting this. That’s also why tests like this are important, to illustrate that thinking in the way we know it isn’t happening in these models.
  
  source
  - theangriestbird@beehaw.org ⁨5⁩ ⁨months⁩ ago
    
    not sure why people are downvoting this
    
    downvotes are not allowed on beehaw fyi
    
    source
    smeg@feddit.uk ⁨5⁩ ⁨months⁩ ago
    Downvotes aren’t federated but you still see all the downvotes sent from just your own instance
    
    source
    -> View More Comments
- jmcs@discuss.tchncs.de ⁨5⁩ ⁨months⁩ ago
  We understand reasoning enough to know humans (and other animals with complex brains) reason in a way that LLMs cannot.
  
  While our reasoning also works with pattern matching it incorporates immeasurably more signals than language - language is almost peripheric to it even in humans. And more importantly we experience things, everything we do acts as a small training round not just in language but on every aspect of the task we are performing, and gives us a miriad of patterns to match later.
  
  Until AI can match a fragment of this we are not going to have an AGI. And for the experience aspect there’s no economic incentive under capitalism to achieve, if it happens it will come out of an underfunded university.
  
  source
ByteSorcerer@beehaw.org ⁨5⁩ ⁨months⁩ ago
I think the problem is that, while the model isn’t actually reasoning, it’s very good at convincing people it actually is.

I see current LLMs kinda like an RPG character build with all ability points put into Charisma. It’s actually not that good at most tasks, but it’s so good at convincing people that they start to think it’s actually doing a great job.

source