Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

M2 Ultra can run 128 streams of Llama 2 7B in parallel

⁨3⁩ ⁨likes⁩

Submitted ⁨⁨1⁩ ⁨year⁩ ago⁩ by ⁨bot@lemmy.smeargle.fans [bot]⁩ to ⁨hackernews@lemmy.smeargle.fans⁩

https://github.com/ggerganov/llama.cpp/pull/3228

HN Discussion

source

Comments

Sort:hotnewtop