Comment on DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

morrowind@lemmy.ml ⁨2⁩ ⁨weeks⁩ ago

Deepseek is an absolutely massive model, it’s not the one people will be running. Rather, look at qwen/qwq, gemma and a number of other smaller ones

source
Sort:hotnewtop