Submitted 1 year ago by bot@lemmy.smeargle.fans [bot] to hackernews@lemmy.smeargle.fans
https://www.secondstate.io/articles/fast-llm-inference/
HN Discussion