Finally got a chance to get a look at Sebastian Raschka’s Third Edition of Python Machine Learning with the focus on Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2. It is a big book and around for a while in ML/DL time scales. I always wanted to check it. Thanks to the […]
tidyr’s pivot_longer(): Reshape Wide Data to Long/Tidy Data
One of the most common activities while doing data analysis is to reshape data from one form to another. For human eyes and data collection, often it is easier to work with data in wider form. However, for analyzing data it is more convenient to have the data in tidy/long form in most circumstances. tidyr, […]
How To Reshape Tidy Data to Wide Data with pivot_wider() from tidyr
Reshaping the data from one for form to another is one of the most common data munging activities. tidyr, R package part of tidyverse, provides core functions to manipulate datasets in wide or long form. In this post, we will see examples of one of tidyr’s core function pivot_wider() to convert data in long tidy […]
New Version of Patchwork is Here: Inset a plot inside a plot
A new version of the R package Patchwork is available on CRAN. If you are not familiar with the Patchwork, it is a R package developed by “one and only” Thomas Lin Pedersen and it makes it possible to combine multiple plots made with R either ggplot2 or base R into a single graphics. As […]
How to Compute Summary Statistics Across Multiple Columns in R
dplyr’s groupby() function lets you group a dataframe by one or more variables and compute summary statistics on the other variables in a dataframe using summarize function. Sometimes you might want to compute some summary statistics like mean/median or some other thing on multiple columns. Naive approach is to compute summary statistics by manually doing […]