Often while working with pandas dataframe you might have a column with categorical variables, string/characters, and you want to find the frequency counts of each unique elements present in the column. Pandas’ value_counts() easily let you get the frequency counts. Let us get started with an example from a real world data set. Load gapminder […]
Installing Python 3 from Python 2 with Anaconda
If you have already installed Anaconda 2.7 and finally decided to take a plunge into Python 3 and want to install Python 3. Congrats. You don’t have to start fresh. You can easily upgrade to Python 3 using Anaconda package manager by creating new environment for Python 3. Note that this virtual environment is completely […]
Slide Decks and Packages in Tweets from 2018 rstudio::conf
2018 RStudio conference, one of the interesting conferences for anyone interested in R and RStudio just ended over the weekend. In case you missed it, twitter was abuzz with interesting bytes from the conference, including cool new R package that was presented and slides of the talks. Here is a compilation of tweets containing slides, […]
How to Install Packages from the Jupyter Notebook?
Python package managers, like Anaconda and pip, have made our life much simpler working with Python in different operating systems. However, if you work long enough, you are likely to encounter weird installation problems. One such problem is even if you have installed a package, you won’t be able to import it in the Jupyter […]
Data Science at the Command Line: Free Online Book
Data Science at the Command Line by Jeroen Janssens, one of the few books focuses on the use of command line tools to learn Data Science is available now for free online at https://www.datascienceatthecommandline.com/. The book teaches the power of command line to do data science more effectively. It covers over 80 command line tools […]