Altair 4.0 is here with a lot of new features. Altair is one of the newest data visualization libraries in Python using a grammar of interactive graphics. Altair is one of my favorites. Not a long ago, but still remember the first time I saw an Altair plot, a chart in “Altair-speak” and pretty impressed […]
Introduction to Probabilistic Programming with PyStan
Stan, developed by a team led by Andrew Gelman, is one of the leading languages to do probabilistic computing. The core of probabilistic computing lies in Bayesian statistics. Stan gets its name in honor of Stanislaw Ulam, co-inventor of the Monte Carlo method, the computational engine behind all Bayesian computing. Stan is C++ package providing […]
How To Discretize/Bin a Variable in Python with NumPy and Pandas?
Sometimes you may have a quantitative variable in your data set and you might want to discretize it or bin it or categorize it based on the values of the variable. For example, let us say you have measurements of height and want to discretize it such that it is 0 or 1 depending on […]
10 Tricks for tidyverse in R
Just happened to come across this tweet about David Robinson’s talk on “Ten Tremendous Tricks for Tidyverse”. It looked like a fantastic and useful talk. These ten tricks involve tidyverse functions one may not have heard of or thought of using in a scenario. The first four tidyverse tips is about counting and summarizing, next […]
Data Science Books on Black Friday Deals 2019
Amazon is running a limited time book deal on Black Friday 2019, in which you can get three books for the price of two books. (The offer is still available as of 7am (EST).) Basically if you buy three books from a list, you don’t pay for the book with the least price. Are you […]