Comment on OK. I'm at wit's end attempting to convince Google's LLM to pronounce an English name correctly.
howrar@lemmy.ca 20 hours agoI’m pretty sure whatever voice system you’re using is just translating things to text and feeding it into an LLM, so it wouldn’t actually have that audio data. I’m not aware of any audio equivalent of LLMs existing.
Powderhorn@beehaw.org 20 hours ago
The equivalent is NLP (natural language processing), which was already a huge research area in the '90s. In fact, had I not been a fucking idiot and caught the journalism bug, my studies in CS and linguistics, I’d likely be doing quite well.
This said, that was about voice input being converted to text – e.g. Dragon Naturally Speaking – but apparently little progress has been made going in the other direction. NotebookLM had other weird glitches where standard English words get weird vowels some 5% of the time.