• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar

Python and R Tips

Learn Data Science with Python and R

  • Home
  • Python
  • Pandas
    • Pandas 101
  • tidyverse
    • tidyverse 101
  • R
  • Linux
  • About
    • Privacy Policy

How to randomly sample letters in Python

April 15, 2022 by cmdlinetips

In this tutorial, we will learn how to randomly sample from letters or alphabets. Python’s random module has number of functions to generate random numbers from different distribution. We will first randomly sample single letter using random module’s choice() function and then randomly sample multiple letters using random module’s choices() function. Let us first load […]

Filed Under: Pandas 101 Tagged With: Python random.choice(), Python random.choices(), Python string.ascii_letters, randomly sample letters in Python

How to Replace Multiple Column Names of a Dataframe with tidyverse

March 1, 2022 by cmdlinetips

Of late, I am renaming column names of a dataframe a lot, in different flavors, in R using tidyverse. And every time I have to google it up :). Just came across, a really neat trick from Shannon Pileggi on twitter to replace multiple column names using deframe() function and !!! splice operator. Here is […]

Filed Under: R, R Tips, tidyverse 101 Tagged With: rename column names rename_with(), rename column names using deframe and !!!, rename multiple column names tidyverse

How to lump factors in Pandas

January 26, 2022 by cmdlinetips

Sometimes you would like to collapse least frequent values of a factor or character variable in to a new category “Other”. In R forcats library has a suit of functions for lumping the variables. This post contains a Pandas solution that can lump factors or values in three common ways. First, we will see how […]

Filed Under: Pandas 101, Python Tagged With: Pandas lump least frequent values, Pandas lump values present in small numbers

Barplots and Countplot with Seaborn’s catplot

January 20, 2022 by cmdlinetips

Seaborn countplot with bars in descending order

Love it or hate it, barplots are often useful in a quick exploratory data analysis to understand the variables in a dataset. In this post, we will see multiple examples on how to make barplots/countplot using Seaborn’s catplot() function. A couple of years ago Seaborn introduced catplot() function that provides a common framework to make […]

Filed Under: Python, Python Tips, Seaborn Tagged With: Seaborn barplot, Seaborn Catplot, Seaborn countplot

How to Replace NAs with column mean or row means with tidyverse

January 15, 2022 by cmdlinetips

Replace NAs with Column/Row Mean

Just a quick rstat post on a simple imputation approach here for the future self. SVD/PCA is one of the first things I do for analyzing any new high dimensional data. Often such data are messy and have some missing values. Depending on the situation, I often resort to removing the rows with missing data […]

Filed Under: R, R Tips, tidyverse 101 Tagged With: replace NA with column mean

  • « Go to Previous Page
  • Page 1
  • Interim pages omitted …
  • Page 10
  • Page 11
  • Page 12
  • Page 13
  • Page 14
  • Interim pages omitted …
  • Page 74
  • Go to Next Page »

Primary Sidebar

Subscribe to Python and R Tips and Learn Data Science

Learn Pandas in Python and Tidyverse in R

Tags

Altair Basic NumPy Book Review Data Science Data Science Books Data Science Resources Data Science Roundup Data Visualization Dimensionality Reduction Dropbox Dropbox Free Space Dropbox Tips Emacs Emacs Tips ggplot2 Linux Commands Linux Tips Mac Os X Tips Maximum Likelihood Estimation in R MLE in R NumPy Pandas Pandas 101 Pandas Dataframe Pandas Data Frame pandas groupby() Pandas select columns Pandas select_dtypes Python Python 3 Python Boxplot Python Tips R rstats R Tips Seaborn Seaborn Boxplot Seaborn Catplot Shell Scripting Sparse Matrix in Python tidy evaluation tidyverse tidyverse 101 Vim Vim Tips

RSS RSS

  • How to convert row names to a column in Pandas
  • How to resize an image with PyTorch
  • Fashion-MNIST data from PyTorch
  • Pandas case_when() with multiple examples
  • An Introduction to Statistical Learning: with Applications in Python Is Here
  • 10 Tips to customize ggplot2 title text
  • 8 Plot types with Matplotlib in Python
  • PCA on S&P 500 Stock Return Data
  • Linear Regression with Matrix Decomposition Methods
  • Numpy’s random choice() function

Copyright © 2026 · Lifestyle Pro on Genesis Framework · WordPress · Log in

Go to mobile version