Often you may want to collapse two or multiple columns in a Pandas data frame into one column. For example, you may have a data frame with data for each year as columns and you might want to get a new column which summarizes multiple columns. One may need to have flexibility of collapsing columns […]
How To Convert a Column to Row Name/Index in Pandas?
Pandas has a method set_index to covert a column in Pandas dataframe into rowname or row index. Let us see an example of converting a column name into rowname in Pandas. Let us load pandas as “pd”. Let us use real-world gapminder data from vega_datasets. Convert a Column to Row Name Let us convert the […]
Short videos to Learn Basics of Probability and Statistics
Basic concepts in Probability and statistics are at the heart of Data Science. And there is no better person than Prof.Joe Blitzstein to learn Probability and statistics. Joe has come up with amazing short videos explaining the basic concepts for his new course. If you are data science beginner or a veteran, Joe’s short videos […]
ggplot2 Version 3.0.0 Brings Tidy Evaluation to ggplot
RStudio has unveiled major updates to ggplot2 with new version 3.0.0. The new ggplot2 version is available on CRAN about two weeks ago. ggplot2 3.0.0 was originally announced as ggplot2 2.3.0, but big updates made RStudio to bump the version number to 3.0.0. One of the biggest additions in the new version is that ggplo2 […]
Publication Quality Graphics in #rstats
The visualization guru, Edward Tufte, known for all things visualization, tweeted that #rstats alone is not good enough for phublication quality graphics. He claimed “Publication-quality work requires: R + Adobe Illustrator + reasoning about words on graphics + respect for audience/readers/viewers “. #Rstats coders and users just can’t do words on graphics and typography. Proof: […]


