Python package managers, like Anaconda and pip, have made our life much simpler working with Python in different operating systems. However, if you work long enough, you are likely to encounter weird installation problems. One such problem is even if you have installed a package, you won’t be able to import it in the Jupyter […]
Data Science at the Command Line: Free Online Book
Data Science at the Command Line by Jeroen Janssens, one of the few books focuses on the use of command line tools to learn Data Science is available now for free online at https://www.datascienceatthecommandline.com/. The book teaches the power of command line to do data science more effectively. It covers over 80 command line tools […]
3 Ways to Read a File and Skip Initial Comments in Python
Reading a text file line by line is one of the common activities you do while dealing with a big text file. Often, you are not interested in initial few lines and want to skip them and work with rest of the file. The initial few lines of the text file that you want to […]
How to Get Unique Values from a Column in Pandas Data Frame?
In this tutorial, we will learn how to get unique values of a column in a Pandas dataframe using two approaches. We will first use Pandas unique() function to get unique values of a column and then use Pandas drop_duplicates() function to get unique values of a column. Pandas unique() function To Get Unique values […]
Why can’t ggplot2 use %>% instead of “+”?
Have you ever mixed up dplyr’s pipe operator or magittr %>% with ggplot’s “+” while piping dplyr commands with ggplot2, you are not alone. Depending on where you misused “%>%” for “+”, you might get confusing errors like Error: Mapping must be created by `aes()` or `aes_()` Error in as.vector(x, “character”) : cannot coerce type […]

