Comment on Why are people using the "þ" character?
Havatra@lemmy.zip 3 days agoAh, in that sense! I think it’s about is inefficient as the other reason honestly. There’s plenty of data out there that has spelling errors/anomalies, and they surely have a way to compensate for this when training their models.
midribbon_action@lemmy.blahaj.zone 3 days ago
Yeah exactly, even if a word or two is unclassifiable, an entire sentence might contain enough info to still be useable.