R for Data Science book by Garrett Grolemund and Hadley Wickham is the best book for doing data science with tidyverse. tidyverse, the meta-package, has loads of useful packages like tidyr, dplyr, and ggplot2 to make your life as data scientist easy. Last fall, tidyr package got a big update with version 1.0.0. Until now, […]
Pandas 1.0.0 is Here: Top New Features of Pandas You Should Know
Pandas 1.0.0 is ready for prime time now. Pandas project has come a long way since the early release of Pandas version 0.4 in 2011. It had contributions from 2 developers including Wes Kinney then, now Pandas has over 300 contributors. The latest version of Pandas can be installed from standard package managers like Anaconda, […]
Slides from RStudio Conference 2020 #rstudioconf
RStudio Conference 2020, one of the biggest R/data science conferences ended this week. Among many interesting things at the conference, one of the biggest announcements was that “RStudio has become a Public Benefit Corporation“. This year’s RStudio conference was at San Francisco and next year RStudio conference 2021 will be at Orlando, Florida. For all […]
Data Science with R and Python- A Round Up: January 2020
Here you go with the first post for the year on “Data Science with R and Python Round Up”. The new year resolution is that to continue the monthly round up. This roundup is an attempt to compile interesting news, Python, R blog posts on anything related to data, data science, ML and AI. Hopefully […]
Heatmaps with Seaborn’s ClusterMap
Just recently stumbled on to Seaborn’s ClusterMap function for making heatmaps. Till now relied on Seaborn’s heatmap function for making simple heatmaps with Seaborn heatmap() function and using pheatmap package in R for anything bit complex. Seaborn’s Clustermap function is great for making simple heatmaps and hierarchically-clustered heatmaps with dendrograms on both rows and/or columns. […]