Comment on OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks.

<- View Parent
stephen01king@lemmy.zip ⁨1⁩ ⁨month⁩ ago

I think you’re completely wrong by still comparing skills that have no relation to each other. What’s the similarity between driving and coding that would require an LLM to be need to do one before you can believe it can do the other? Explain that leap in logic properly before you continue with your argument.

An LLM is designed to output text. Expecting them to drive to prove their ability to output code is like expecting them to dance to prove their ability to produce poems. It’s inability to do an unrelated skill has no bearing on it’s ability to do a different one. You’re basically judging a fish on its ability to walk on land, and using that as the basis to judge its ability to swim.

source
Sort:hotnewtop