Look up the definition of the word cynical. It means, more or less, asserting that no one is motivated by sincere integrity. Accusing some specific people of lacking integrity, while holding up others as good examples of integrity that everyone should aspire to, is the opposite of cynicism.
He doesn’t address very much the idea that DeepSeek “distilled” their model from OpenAI’s model and others specifically because that is just a rumor with very minimal evidence for it.
OpenAI has reportedly found “evidence” that DeepSeek used OpenAI’s models to train its rivals, according to the Financial Times, although it failed to make any formal allegations, though it did say that using ChatGPT to train a competing model violates its terms of service. David Sacks, the investor and Trump Administration AI and Crypto czar, says “it’s possible” that this occurred, although he failed to provide evidence.
Personally, I genuinely want OpenAI to point a finger at DeepSeek and accuse it of IP theft, purely for the hypocrisy factor. This is a company that exists purely from the wholesale industrial larceny of content produced by individual creators and internet users, and now it’s worried about a rival pilfering its own goods?
Cry more, Altman, you nasty little worm.
The “rumors” you say he discusses about novel ways the Chinese researchers found to outperform OpenAI are based on an extremely detailed look at their paper and their code, as interpreted by experts. The thing you’re upset he doesn’t discuss is based on rumors. He doesn’t discuss it, except to note that it’s just a rumor but would be funny if it’s true, because he is not doing what you accuse him of.
If you’re upset that he was mean to Sam Altman, so much so that you simply don’t care if he also goes deep into a lot of important details and cares about integrity enough to hate a lot on people who don’t have it, then say so. The things you are accusing him of doing are not true, though, and pretty easy to disprove if you can look honestly at his work.
theComposer@beehaw.org 3 days ago
But it’s not just that “they effectively trained their model using OpenAI’s model”. The point Ed goes on to make is why hasn’t OpenAI done the same thing? The marvel of DeepSeek is how much more efficient it is, whereas Big Tech keeps insisting that they need ever bigger data centers.
masterspace@lemmy.ca 2 days ago
They HAVE done that. It’s one of the techniques they use to produce things like o1 mini models and the other mini models that run on device.
But that’s not a valid technique for creating new foundation models, just for creating refined versions of existing models. You would never have been able to create for instance, an o1 model from Chat PT 3.5 using distillation.