Visualizing data as a heatmap is a great data exploration technique for high dimensional data. Sometimes you would like to visualize the correlation as heatmap instead of the raw data to understand the relationship between the variables in your data. In this post we will see examples of visualizing correlation matrix as a heatmap in […]
Python
Pandas 1.0.0 is Here: Top New Features of Pandas You Should Know
Pandas 1.0.0 is ready for prime time now. Pandas project has come a long way since the early release of Pandas version 0.4 in 2011. It had contributions from 2 developers including Wes Kinney then, now Pandas has over 300 contributors. The latest version of Pandas can be installed from standard package managers like Anaconda, […]
11 Tips to Make Plots with Pandas
Python Pandas library is well known for its amazing data munging capabilities. However, a little underused feature of Pandas is its plotting capabilities. Yes, one can make better visualizations with Matplotlib or Seaborn or Altair. However, Pandas plotting capabilities can be extremely handy when you are in exploratory data analysis mode and want to quickly […]
Altair 4.0 is here: Barplots, Scatter Plots with Regression Line and Boxplots
Altair 4.0 is here with a lot of new features. Altair is one of the newest data visualization libraries in Python using a grammar of interactive graphics. Altair is one of my favorites. Not a long ago, but still remember the first time I saw an Altair plot, a chart in “Altair-speak” and pretty impressed […]
Three New Interesting Books on Data Science/Machine Learning
Three new practical books on Data Science/Machine Learning have got published recently. All three are introductory level books taking Hands-on approaches to teach Machine Learning and Data Science toolbox. All three books are great additions to learn Machine Learning with minimal math. Hands-on Machine Learning with R Hands-on Machine Learning with R by Brad Boehmke […]