Python and R Tips

dplyr groupby() and summarize(): Group By One or More Variables

August 31, 2020 by cmdlinetips

dplyr, is a R package provides that provides a great set of tools to manipulate datasets in the tabular form. dplyr has a set of core functions for “data munging”,including select(),mutate(), filter(), groupby() & summarise(), and arrange(). dplyr’s groupby() function is the at the core of Hadley Wickham’ Split-Apply-Combine paradigm useful for most common data […]

dplyr filter(): Filter/Select Rows based on conditions

August 21, 2020 by cmdlinetips

dplyr, R package that is at core of tidyverse suite of packages, provides a great set of tools to manipulate datasets in the tabular form. dplyr has a set of useful functions for “data munging”, including select(), mutate(), summarise(), and arrange() and filter(). And in this tidyverse tutorial, we will learn how to use dplyr’s […]

dplyr arrange(): Sort/Reorder by One or More Variables

August 11, 2020 by cmdlinetips

dplyr, R package part of tidyverse suite of packages, provides a great set of tools to manipulate datasets in the tabular form. dplyr has a set of core functions for “data munging”,including select(),mutate(), filter(), summarise(), and arrange(). And in this tidyverse tutorial, we will learn how to use dplyr’s arrange() function to sort a data […]

Pandas value_counts: How To Get Counts of Unique Variables in a Dataframe?

August 6, 2020 by cmdlinetips

As part of exploring a new data, often you might want to count unique values of one or more columns in a dataframe. Pandas value_counts() can get counts of unique values of columns in a Pandas dataframe. Starting from Pandas version 1.1.0, we can use value_counts() on a Pandas Series and dataframe as well. In […]

How To Compare Two Dataframes with Pandas compare?

August 2, 2020 by cmdlinetips

In this post, we will learn how to compare two Pandas dataframes and summarize their differences using Pandas compare() function. Sometimes you may have two similar dataframes and would like to know exactly what those differences are between the two data frames. Starting from Pandas 1.1.0 version, Pandas has a new function compare() that lets […]