Comment on Anyone in tech confirm?

<- View Parent
Unforeseen@sh.itjust.works ⁨1⁩ ⁨day⁩ ago

The efficiency improvements in some open models are becoming crazy, like hundreds of times from a year ago. I have a setup such as yours on my framework which can handle a 120b param model fully loaded. It’s capable of the RAG setup you are already envisioning.

source
Sort:hotnewtop