NumPy or Numerical Python is one of the packages in Python for all things computing with numerical values. Learning NumPy makes one’s life much easier to compute with multi-dimensional arrays and matrices. A huge collection of very useful mathematical functions available to operate on these arrays these arrays makes it one of the powerful environment […]
How To Separate a Column into Multiple Rows with in R?
I just came across a useful little function in tidyr called separate_rows(). Often you may have a data frame with a column containing multiple information concatenated together with a delimiter. For example, we might have data frame with members of a family in a column separated by a delimiter. Here is a pictorial representation of […]
How To Reshape Pandas Dataframe with melt and wide_to_long()?
Reshaping data frames into tidy format is probably one of the most frequent things you would do in data wrangling. In this post, we will learn how to use Pandas melt() function and wide_long_long() function to reshape Pandas dataframe in wide form to long tidy form. A data frame is tidy when it satisfies the […]
Introduction to Sparse Matrices in R
Often you may deal with large matrices that are sparse with a few non-zero elements. In such scenarios, keeping the data in full dense matrix and working with it is not efficient. A better way to deal with such sparse matrices is to use the special data structures that allows to store the sparse data […]
Book Review: Fundamentals of Data Visualization
Finally got a chance to write down quick thoughts on Fundamentals of Data Visualization: A Primer on Making Informative and Compelling Figures by Claus Wilke. ICYMI, Fundamentals of Data Visualization is a fantastic book on data visualization that was developed openly, freely available and just recently the physical book is available for purchase. I have […]