2 Ways to Randomly Sample Rows from a large CSV file

In this post, we will be learning how to randomly sample/select rows from a large CSV file that is either taking too long to load as a Pandas dataframe or can’t load at all. The key idea is to not to load the whole file as a Pandas dataframe. Instead, we use skiprows argument in… Continue reading 2 Ways to Randomly Sample Rows from a large CSV file