In this post we will see examples of how to drop rows of a dataframe based on values of one or more columns in Pandas. Pandas drop function makes it really easy to drop rows of a dataframe using index number or index names. We can use Pandas drop function to drop rows and columns […]
Pandas groupby: 13 Functions To Aggregate
Pandas groupby function enables us to do “Split-Apply-Combine” data analysis paradigm easily. Basically, with Pandas groupby, we can split Pandas data frame into smaller groups using one or more variables. Pandas has a number of aggregating functions that reduce the dimension of the grouped object. In this post will examples of using 13 aggregating function […]
How To Drop Duplicate Rows in Pandas?
In this post, we will learn how to drop duplicate rows in a Pandas dataframe. We will use Pandas drop_duplicates() function to can delete duplicated rows with multiple examples. One of the common data cleaning tasks is to make a decision on how to deal with duplicate rows in a data frame. If the whole […]
Introduction to Linear Regression in Python
Linear regression is one of the most commonly used statistical technique to understand relationship between two quantitative variables (in the simplest case). Simple linear regression models relationship between two variables X and Y, where X and Y are vectors with multiple values. For example, X could how well each country is doing economically, like GDP […]
tidyr 1.0.0 is here. pivot_longer & pivot_wider replace spread & gather
tidyr version 1.0.0 is here with a lot of new changes. tidyr has been around for about five years and it has finally tidyr has reached version 1.0.0. There are four big changes in the new version of tidyr. One of the biggest changes is the new functions pivot_longer() and pivot_wider() for reshaping tabular dataserts. […]