In a quite unexpected turn of events, it is claimed that OpenAI’s ChatGPT “got absolutely wrecked on the beginner level” while playing Atari Chess.
Who the hell thought this was “unexpected”?
What’s next? ChatGPT vs. Microwave to see which can make instant oatmeal the fastest? 😂
30p87@feddit.org 4 months ago
Anyone even believing that a generic word auto completer would beat classic algorithms wherever possible probably belongs into a psychiatry.
realitista@lemm.ee 4 months ago
There are a lot of people out there that think LLM’s are somehow reasoning. Even reasoning models aren’t really doing it. It important to do demonstrations like this in the hopes that the general public will understand the limitations of this tech.
theangriestbird@beehaw.org 4 months ago
THIS is the thing. The general public’s perception of ChatGPT is basically whatever OpenAI’s marketing department tells them to believe, plus their single memory of that one time they tested out ChatGPT and it was pretty impressive. Right now, OpenAI is telling everyone that they are a few years away from Artificial General Intelligence. Tests like this one demonstrate how wrong OpenAI is in that assertion.
Photuris@lemmy.ml 4 months ago
ByteSorcerer@beehaw.org 4 months ago
I think the problem is that, while the model isn’t actually reasoning, it’s very good at convincing people it actually is.
I see current LLMs kinda like an RPG character build with all ability points put into Charisma. It’s actually not that good at most tasks, but it’s so good at convincing people that they start to think it’s actually doing a great job.
jjjalljs@ttrpg.network 4 months ago
I think I remember some doge goon asking online about using an LLM to parse JSON. Many people don’t understand things.
Photuris@lemmy.ml 4 months ago
MadMadBunny@lemmy.ca 4 months ago
That’s too much critical thinking for most people