In this post, we will learn how to filter Pandas dataframe by column values. More specifically, we will subset a pandas dataframe based on one or more values of a specific column. In this tutorial, we will see SIX examples of using Pandas dataframe to filter rows or select rows based values of a column(s). […]
Python
Pandas GroupBy: Introduction to Split-Apply-Combine
In a classic paper published at 2011, Hadley Wickham asked What do we do when we analyze data? What are common actions and what are common mistakes? And then went ahead to spell it out one of the most common strategies, Split-Apply-Combine, that is used in common data analysis. Intuitively, while solving a big problem, […]
JupyterLab is Here: First Impressions
JupyterLab, the next-generation web-based user interface for Python and R from Project Jupyter. It is still a beta release, but stable for daily use. One of the cool features of JupyterLab is that it is the go to browser based app for classic Jupyter Notebook, file browser for your computer files, text editor and a […]
What Does *args and **kwargs Mean in Python?
If you are new Python and saw the use of *args and **kwargs as function arguments and wondered what those *-thingies are, you are not alone. Typically when you write functions, you will have specific number and types of arguments the function can take as input. However, the more Python code you write, you might […]
How To Randomly Select Rows in Pandas?
Creaating unbiased training and testing data sets are key for all Machine Learning tasks. Pandas’ sample function lets you randomly sample data from Pandas data frame and help with creating unbiased sampled datasets. It is a great way to get downsampled data frame and work with it. In this post, we will learn three ways […]