Comment on Why would a UTF-8 MySQL backup contain invalid UTF-8 characters?

<- View Parent
bjoern_tantau@swg-empire.de ⁨2⁩ ⁨weeks⁩ ago

There is no such thing* as a UTF-8 file. It’s just text encoded in some way. It’s only a UTF-8 file if everything is encoded as UTF-8 which it’s evidently not.

You can even tell MySQL to export perfectly valid UTF-8 text encoded as ISO 8859-1 to import into a UTF-8 table without any troubles (maybe apart from stuff that could not be encoded in ISO 8859-1).

*Yes, technically there could be a BOM at the beginning but almost no tool uses that and most get confused by it. And it would still not force any data written to it to be UTF-8.

source
Sort:hotnewtop