Installing Python 3 from Python 2 with Anaconda

If you have already installed Anaconda 2.7 and finally decided to take a plunge into Python 3 and want to install Python 3. Congrats. You don’t have to start fresh. You can easily upgrade to Python 3 using Anaconda package manager by creating new environment for Python 3. Note that this virtual environment is completely […]

Slide Decks and Packages in Tweets from 2018 rstudio::conf

2018 RStudio conference, one of the interesting conferences for anyone interested in R and RStudio just ended over the weekend. In case you missed it, twitter was abuzz with interesting bytes from the conference, including cool new R package that was presented and slides of the talks. Here is a compilation of tweets containing slides, […]

How to Install Packages from the Jupyter Notebook?

Python package managers, like Anaconda and pip, have made our life much simpler working with Python in different operating systems. However, if you work long enough, you are likely to encounter weird installation problems. One such problem is even if you have installed a package, you won’t be able to import it in the Jupyter […]

Data Science at the Command Line: Free Online Book

Data Science at the Command Line by Jeroen Janssens, one of the few books focuses on the use of command line tools to learn Data Science is available now for free online at https://www.datascienceatthecommandline.com/. The book teaches the power of command line to do data science more effectively. It covers over 80 command line tools […]

3 Ways to Read a File and Skip Initial Comments in Python

Reading a text file line by line is one of the common activities you do while dealing with a big text file. Often, you are not interested in initial few lines and want to skip them and work with rest of the file. The initial few lines of the text file that you want to […]

How to Get Unique Values from a Column in Pandas Data Frame?

Often while working with a big data frame in pandas, you might have a column with string/characters and you want to find the number of unique elements present in the column. Pandas library in Python easily let you find the unique values. Let us get started with some examples from a real world data set. […]

Why can’t ggplot2 use %>% instead of “+”?

Have you ever mixed up dplyr’s pipe operator or magittr %>% with ggplot’s “+” while piping dplyr commands with ggplot2, you are not alone. Depending on where you misused “%>%” for “+”, you might get confusing errors like Error: Mapping must be created by `aes()` or `aes_()` Error in as.vector(x, “character”) : cannot coerce type […]

Interested in Data Science? Follow DataFramed Podcast from Datacamp

Datacamp, one of the leading data science education portal with over 100 courses on variety of data science aspects in both Python and R, has launched a new podcast named “DataFramed“. The main theme of the podcast DataFramed is, you guessed it, data science. DataFramed is a weekly data science podcast hosted by DataCamp’s Hugo Bowne-Anderson. […]

How To Add a New Column to Using a Dictionary in Pandas Data Frame ?: Pandas Tutorial

Pandas library in Python has a really cool function called map that lets you manipulate your pandas data frame much easily. Pandas’ map function lets you add a new column with values from a dictionary if the data frame has a column matching the keys in the dictionary. Adding a New Column Using keys from […]

6 Most Useful dplyr Commands to Manipulate a Data Frame in R

dplyr is one of the R packages developed by Hadley Wickham to manipulate data stored in data frames. Data frame is a two-dimensional data structure, where each column can contain a different type of data, like numerical, character and factors. In case you wondered the meaning of the word “dplyr”, it is like “pliers” for […]