One of the good things of being active in twitter is that you get exposed to interesting content. The bad thing about it is that you retweet with the goal of finding it later and read it. However, you never find it or read it afterwards. So, here is a new attempt to catch up […]
31 Days of #rstats Resources Advent
R community is amazing with sharing excellent learning resources for free. And it seems never ending!. If you are like me, you might have missed a lot of “#rstats resource advent” daily tweets containing fantastic R resources from RStudio’s Mara Averick (@dataandme). Not to worry, you have all the #rstats resource advent tweets from Day […]
Altair 4.0 is here: Barplots, Scatter Plots with Regression Line and Boxplots
Altair 4.0 is here with a lot of new features. Altair is one of the newest data visualization libraries in Python using a grammar of interactive graphics. Altair is one of my favorites. Not a long ago, but still remember the first time I saw an Altair plot, a chart in “Altair-speak” and pretty impressed […]
Introduction to Probabilistic Programming with PyStan
Stan, developed by a team led by Andrew Gelman, is one of the leading languages to do probabilistic computing. The core of probabilistic computing lies in Bayesian statistics. Stan gets its name in honor of Stanislaw Ulam, co-inventor of the Monte Carlo method, the computational engine behind all Bayesian computing. Stan is C++ package providing […]
How To Discretize/Bin a Variable in Python with NumPy and Pandas?
Sometimes you may have a quantitative variable in your data set and you might want to discretize it or bin it or categorize it based on the values of the variable. For example, let us say you have measurements of height and want to discretize it such that it is 0 or 1 depending on […]

