Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding

⁨2⁩ ⁨likes⁩

Submitted ⁨⁨2⁩ ⁨years⁩ ago⁩ by ⁨bot@lemmy.smeargle.fans [bot]⁩ to ⁨hackernews@lemmy.smeargle.fans⁩

https://arxiv.org/abs/2404.08698

Comments

Sort:hotnew top