Technically the technology is open to the public but regular people cannot afford to implement it.
The thing that makes Large Language Models hardly functional is scaling up their databases and processing power of one of several of their small models with specialized tasks. One model creates output from input, another model checks it for accuracy, a third model polices it for things that are not allowed.
So unless you’ve got a datacenter and three high powered servers with top-grade cooling systems and a military grade power supply, fat fucking chance.
Linkerbaan@lemmy.world 8 months ago
Huggingface usually Mixtral recently released a pretty good model that’s not very big