Basic concepts in Probability and statistics are at the heart of Data Science. And there is no better person than Prof.Joe Blitzstein to learn Probability and statistics. Joe has come up with amazing short videos explaining the basic concepts for his new course. If you are data science beginner or a veteran, Joe’s short videos […]
ggplot2 Version 3.0.0 Brings Tidy Evaluation to ggplot
RStudio has unveiled major updates to ggplot2 with new version 3.0.0. The new ggplot2 version is available on CRAN about two weeks ago. ggplot2 3.0.0 was originally announced as ggplot2 2.3.0, but big updates made RStudio to bump the version number to 3.0.0. One of the biggest additions in the new version is that ggplo2 […]
Publication Quality Graphics in #rstats
The visualization guru, Edward Tufte, known for all things visualization, tweeted that #rstats alone is not good enough for phublication quality graphics. He claimed “Publication-quality work requires: R + Adobe Illustrator + reasoning about words on graphics + respect for audience/readers/viewers “. #Rstats coders and users just can’t do words on graphics and typography. Proof: […]
How To Split A Column or Column Names in Pandas and Get Part of it?
Often you may want to create a new variable either from column names of a pandas data frame or from one of the columns of the data frame. Just like Python, Pandas has great string manipulation abilities that lets you manipulate strings easily. Let us see an example of using Pandas to manipulate column names […]
Plotnine: A Python library to use ggplot2 in Python
Python’s visualization options can be difficult to grasp, if you recently moved to use Python after using R for a while. There are a lot of options. Multiple options for visualizations are good. Especially, mature matplotlib, Seaborn, Bokeh and the new library like Altair are great to learn and use for data visualization in Python. […]