Three new practical books on Data Science/Machine Learning have got published recently. All three are introductory level books taking Hands-on approaches to teach Machine Learning and Data Science toolbox. All three books are great additions to learn Machine Learning with minimal math. Hands-on Machine Learning with R Hands-on Machine Learning with R by Brad Boehmke […]
9 Tips to Make Better Scatter Plots with ggplot2 in R
Scatter plot is one of the common data visualization method used to understand the relationship between two quantitative variables. When there is strong association between two variables you would easily see the relationship with scatterplot. However, when the relationship is subtle it may be tricky to see it. In this post we will see 9 […]
How to Highlight Data Points with Colors and Text in Python
Sometimes you might want to highlight a select data points on a scatter plot. Often when plotting scatter plots you might want to highlight data points in a different color from the rest of the data points. Other times you want to show select data points in different color and annotate them with text. In […]
Introduction to Linear Regression in R
Linear Regression is one of the most commonly used statistical methods. Linear modeling and Linear regression helps us understand the relationship between multiple variables. In the simplest case, linear regression is about understanding the relation between two variables, X and Y. One of the ways to understand linear regression is that we have the observed […]
Data Science From Scratch 2nd Edition: Book Review
The second edition of Data Science from Scratch, First Principles with Python from Joel Grus is here (since the summer of 2019). The first edition of the book came about 4-5 years ago when data science as a field was nascent and majority of Python was in 2.7. There are two aspects to learn data […]