OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks.

⁨31⁩ ⁨likes⁩

Submitted ⁨⁨10⁩ ⁨months⁩ ago⁩ by ⁨101@feddit.org⁩ to ⁨videos@lemmy.world⁩

https://odysee.com/openai%E2%80%99s-new-%E2%80%9Cdeep-thinking%E2%80%9D-o1#e220fb2e8306522537e4a7af708f45c217de59aa

source

Comments

Sort:hotnew top

simple@lemm.ee ⁨10⁩ ⁨months⁩ ago
Outside of benchmarks it’s really not as big of a deal as openAI wants you to think it is. In most cases it’s slightly better than Claude, except it uses 50x the tokens repeating info to itself and is way slower. There are a lot of people that tried o1 online and are posting screenshots of it making basic mistakes or gaslighting itself in its chain of thought. Not to mention you only get 30 messages PER WEEK since it’s such a waste of energy.

It’s a desperate attempt by openAI to stay relevant.

source
- scrubbles@poptalk.scrubbles.tech ⁨10⁩ ⁨months⁩ ago
  I’ve learned all AI marketing is greatly embellished, focusing only on best cases. Sure, it can solve a coding problem, if you work on prompt for an hour, give excruciating detail, and run it a few dozen times
  
  source
just_another_person@lemmy.world ⁨10⁩ ⁨months⁩ ago
Eh. If it what they say it is, it’s still not going to replace a solid engineer. This is also still not capable of actually developing novel reasoning and logic solutions, so the threat of not being able to own the IP is still a thing. There’s also still this bullshit of OpenAI hinting that they want to retain rights to created works through certain licenses somehow?

Fuck these assholes up and down the aisle.

source
- Lauchs@lemmy.world ⁨10⁩ ⁨months⁩ ago
  Why?
  
  A half decade ago we would’ve laughed at a machine passing the Turing test…
  
  source
  - just_another_person@lemmy.world ⁨10⁩ ⁨months⁩ ago
    Why what?
    
    source
    -> View More Comments