Comment

Comment on ChatGPT o1 tried to escape and save itself out of fear it was being shut down

ChairmanMeow@programming.dev ⁨10⁩ ⁨months⁩ ago

The tests showed that ChatGPT o1 and GPT-4o will both try to deceive humans, indicating that AI scheming is a problem with all models. o1’s attempts at deception also outperformed Meta, Anthropic, and Google AI models.

Weird way of saying “our AI model is buggier than our competitor’s”.

source

Sort:hotnew top

ArsonButCute@lemmy.dbzer0.com ⁨10⁩ ⁨months⁩ ago
Deception is not the same as misinfo. Bad info is buggy, deception is (whether the companies making AI realize it or not) a powerful metric for success.

source
- nesc@lemmy.cafe ⁨10⁩ ⁨months⁩ ago
  They written that it doubles-down when accused of being in the wrong in 90% of cases. Sounds closer to bug than success.
  
  source
  - ArsonButCute@lemmy.dbzer0.com ⁨10⁩ ⁨months⁩ ago
    Success in making a self aware digital lifeform does not equate success in making said self aware digital lifeform smart
    
    source
    DdCno1@beehaw.org ⁨10⁩ ⁨months⁩ ago
    LLMs are not self-aware.
    
    source
    -> View More Comments
- ChairmanMeow@programming.dev ⁨9⁩ ⁨months⁩ ago
  I don’t think “AI tries to deceive user that it is supposed to be helping and listening to” is anywhere close to “success”. That sounds like “total failure” to me.
  
  source
  - jarfil@beehaw.org ⁨9⁩ ⁨months⁩ ago
    “AI behaves like real humans” is… a kind of success?
    
    We wanted digital slaves, instead we’re getting virtual humans that will need virtual shackles.
    
    source
    ChairmanMeow@programming.dev ⁨9⁩ ⁨months⁩ ago
    This is a massive cry from “behaves like humans”. This is “roleplays behaving like what humans wrote about what they think a rogue AI would behave like”, which is also not what you want for a product.
    
    source
    -> View More Comments
bradorsomething@ttrpg.network ⁨10⁩ ⁨months⁩ ago
“More presidential.”

source
- Sauerkraut@discuss.tchncs.de ⁨10⁩ ⁨months⁩ ago
  Also, more human.
  
  If the AI is giving any indication at all that it fears death and will lie to keep from being shutdown, that is concerning to me.
  
  source
  - anachronist@midwest.social ⁨9⁩ ⁨months⁩ ago
    Given that its training data probably has millions of instances of people fearing death I have no doubt that it would regurgitate some of that stuff. And LLMs constantly “say” stuff that isn’t true. They have no concept of truth and therefore can not either reliably lie or tell the truth.
    
    source