Comment

Comment on DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

IndeterminateName@beehaw.org ⁨8⁩ ⁨months⁩ ago

A bit like a syllable when you are talking about text based responses. 20 tokens a second is faster than most people could read the output so that’s sufficient for a real time feeling “chat”.

source

Sort:hotnew top

SteevyT@beehaw.org ⁨8⁩ ⁨months⁩ ago
Huh, yeah that actually is above my reading speed assuming 1 token = 1 word. Although, I found that anything above 100 words per minute, while slow to read, feels real time to me since that’s about the absolute top end of what most people type.

source