• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar

Python and R Tips

Learn Data Science with Python and R

  • Home
  • Python
  • Pandas
    • Pandas 101
  • tidyverse
    • tidyverse 101
  • R
  • Linux
  • About
    • Privacy Policy
You are here: Home / Archives for Pandas 101

Pandas 101

2 Ways to Randomly Sample Rows from a large CSV file

July 10, 2022 by cmdlinetips

In this post, we will be learning how to randomly sample/select rows from a large CSV file that is either taking too long to load as a Pandas dataframe or can’t load at all. The key idea is to not to load the whole file as a Pandas dataframe. Instead, we use skiprows argument in […]

Filed Under: Pandas 101, Python Tips Tagged With: Python sample a large CSV file

Pandas Cumulative Sum by Group

July 8, 2022 by cmdlinetips

Cumulative sum by a group in Pandas

In this tutorial as part of our Pandas 101 series, we will learn how to compute cumulative sum of a column based on values from a grouping column in Pandas dataframe. Pandas cumsum() function can compute cumulative sum over a DataFrame, In this example we are interested getting cumulative sum of just one column by […]

Filed Under: Pandas 101, Python Tips Tagged With: Cumulative sum by group, Pandas cumsum()

Pandas pipe function in Pandas: performing PCA

June 15, 2022 by cmdlinetips

Pandas Pipe method to perform PCA

Pandas pipe function can help us chain together functions that takes either dataframe or series as input. In this introductory tutorial, we will learn how to use Pandas pipe method to simplify code for data analysis. We start with a dataframe as input and do a series of analysis such that that each step takes […]

Filed Under: Pandas 101, Python Tips Tagged With: Pandas pipe method, PCA example using Pandas Pipe Method

How to Select Columns/Rows by substring match in Pandas

May 24, 2022 by cmdlinetips

In this post, we will learn how to select columns of a Pandas dataframe or a rows of a dataframe based on substring match in Pandas. We will use Pandas filter() function with argument “like” to select columns/rows, whose names partially match with a string of interest. Let us load the necessary modules. We are […]

Filed Under: Pandas 101, Python Tips Tagged With: Pandas select based on substring match

How to randomly sample letters in Python

April 15, 2022 by cmdlinetips

In this tutorial, we will learn how to randomly sample from letters or alphabets. Python’s random module has number of functions to generate random numbers from different distribution. We will first randomly sample single letter using random module’s choice() function and then randomly sample multiple letters using random module’s choices() function. Let us first load […]

Filed Under: Pandas 101 Tagged With: Python random.choice(), Python random.choices(), Python string.ascii_letters, randomly sample letters in Python

  • « Go to Previous Page
  • Page 1
  • Page 2
  • Page 3
  • Page 4
  • Interim pages omitted …
  • Page 13
  • Go to Next Page »

Primary Sidebar

Subscribe to Python and R Tips and Learn Data Science

Learn Pandas in Python and Tidyverse in R

Tags

Altair Basic NumPy Book Review Data Science Data Science Books Data Science Resources Data Science Roundup Data Visualization Dimensionality Reduction Dropbox Dropbox Free Space Dropbox Tips Emacs Emacs Tips ggplot2 Linux Commands Linux Tips Mac Os X Tips Maximum Likelihood Estimation in R MLE in R NumPy Pandas Pandas 101 Pandas Dataframe Pandas Data Frame pandas groupby() Pandas select columns Pandas select_dtypes Python Python 3 Python Boxplot Python Tips R rstats R Tips Seaborn Seaborn Boxplot Seaborn Catplot Shell Scripting Sparse Matrix in Python tidy evaluation tidyverse tidyverse 101 Vim Vim Tips

RSS RSS

  • How to convert row names to a column in Pandas
  • How to resize an image with PyTorch
  • Fashion-MNIST data from PyTorch
  • Pandas case_when() with multiple examples
  • An Introduction to Statistical Learning: with Applications in Python Is Here
  • 10 Tips to customize ggplot2 title text
  • 8 Plot types with Matplotlib in Python
  • PCA on S&P 500 Stock Return Data
  • Linear Regression with Matrix Decomposition Methods
  • Numpy’s random choice() function

Copyright © 2026 · Lifestyle Pro on Genesis Framework · WordPress · Log in

Go to mobile version