Comment on Boffins detail new algorithms that boost AI perf up to 2.8x

Hirom@beehaw.org ⁨1⁩ ⁨week⁩ ago

There’s a Github issue to enable speculative decoding in Ollama.

source
Sort:hotnewtop