Comment on Pandas
mvirts@lemmy.world 2 months agoIt’s more likely you’ll eat up storage when you read a 600mb parquet and try to write it as CSV.
Comment on Pandas
mvirts@lemmy.world 2 months agoIt’s more likely you’ll eat up storage when you read a 600mb parquet and try to write it as CSV.
QuizzaciousOtter@lemm.ee 2 months ago
I mean, yeah, that’s the point of compression. I don’t quite get what you mean by that comment.
mvirts@lemmy.world 2 months ago
Ah I was trying to point out that CSV is the inefficient format. Reading a large amount of data from a more efficient format like parquet is more likely to cause trouble because the memory required can be more than the file size. CSV is the opposite where it will almost always use more disk space than is required to represent the data in memory.
QuizzaciousOtter@lemm.ee 2 months ago
Right, true!