Comment on Are there any AI services that don't work on stolen data?
Pamasich@kbin.earth 1 week ago
Switzerland announced a new LLM project which might be of interest here.
Here's a German article on it. If you're okay with a Reddit link, here's a translation.
Some points on it:
- fully open source in its entirety — source code, model weights, and training data will all be publically released.
- licensed under Apache 2.0
- compliant with Swiss data protection laws, copyright law, and the EU AI act
- respects crawler opt-outs on websites
While nothing there explicitly says the data is ethically sourced, we'll be able to tell based on the opensource training data, and I assume copyright law takes care of stuff like books being used (though idk if the AI has a way to determine the license of web content, or if it fully relies on opt-outs there).