Comment on Boffins detail new algorithms that boost AI perf up to 2.8x

Hirom@beehaw.org ⁨2⁩ ⁨months⁩ ago

There’s a Github issue to enable speculative decoding in Ollama.

source
Sort:hotnewtop