Ever wondered what are the most useful R packages for doing Data Science? Don’t have to wonder anymore, RStudio has collated a list of most useful R packages in its github repository titled “RStartHere“. How did R Studio come up with the list for doing Data Science? RStudio used the iconic Data Science work flow […]
R Tips
How to Make Boxplot in R with ggplot2?
One of many strengths of R is the tidyverse packages and the ability to make great looking plots easily. Boxplot or Box and Whisker plot, introduced by John Tukey is great for visualizing data from multiple groups/ distributions. Boxplot allows you to actually display the data together with efficient summary of the data using min, […]
Reticulate: a RStudio package to use both R and Python from R Markdown
Do you love working with Python, but just can’t get enough of ggplot, R Markdown or any other tidyverse packages. You are not alone, many love both R and Python and use them all the time. Now RStudio, has made reticulate package that offers awesome set of tools for interoperability between Python and R. One […]
How To Plot Ridgeline Plots in R?
Ridgeline plots is a great way to visualize changes in multiple distributions/histogram either over time or space. It was initially called as joyplots, for a brief time. ggridges package from UT Austin professor Claus Wilke lets you make ridgeline plots in combinaton with ggplot. Here is how Claus describes the ridgeline plot with a brief […]
How To Generate Random Numbers from Probability Distributions in R?
Understanding probability distributions and how one can simulate random numbers from a specific probability distribution is very useful in understanding probability and use them effectively in doing data science. Here we will be looking at how to simulate/generate random numbers from 9 most commonly used probability distributions in R and visualizing the 9 probability distributions […]