I don't mean to disparage pandas, which is a library that does a lot of things f...

disgruntledphd2 · on Sept 14, 2021

Agreed on your major points.

That being said: > I hate finding CSVs that other data scientists have created from pandas, because they invariably include the index ...

This is also default in R, with row numbers (like I have ever needed them). To be fair, it's gotten better since people stopped putting important information in rownames.

Polars looks interesting, thanks for the recommendation!

deshpand · on Sept 14, 2021

< I hate finding CSVs that other data scientists

Ideally you should be using the parquet format which will use the binary format, preserve column types and indexes [df.to_parquet(<file>); df = pd.read_parquet(<file>)]

You can get away from a lot of problems by simply avoiding text files