In this quick post we will see 14 simple tips that will help get started dealing with date as variable instead of a string. We will see tips on how to create a date variable in few different ways and get components of date variable using the R package lubdridate. The R package lubdridate was […]
7 Dimensionality Reduction Techniques by Examples in Python
Dimension Reduction techniques are one of the most useful methods in unsupervised learning of high dimensional datasets. In this post, we will learn how to use Python to perform 7 most commonly used dimensionality reduction techniques by example, PCA: Principal Component Analysis SVD: Singular Value Decomposition ICA: Independent Component Analysis NMF: Non-negative Matrix Factorization FA: […]
6 Tips to change column positions with dplyr’s relocate()
Sometimes you want to change the order of columns in a dataframe. dpyr’s relocate() function makes it easy to move one or more columns to new positions easily by using the same syntax as select() function. In this post we will see 7 tips to change column order or column position using dplyr’s relocate(). Let […]
6 Dimensionality Reduction Techniques in R (with Examples)
Dimension Reduction techniques are one of the most useful methods in unsupervised learning of high dimensional datasets. In this post, we will learn how to use R to perform 6 most commonly used dimensionality reduction techniques, PCA: Principal Component Analysis SVD: Singular Value Decomposition ICA: Independent Component Analysis NMF: Non-negative Matrix Factorization tSNE UMAP We […]
2 Ways to Randomly Sample Rows from a large CSV file
In this post, we will be learning how to randomly sample/select rows from a large CSV file that is either taking too long to load as a Pandas dataframe or can’t load at all. The key idea is to not to load the whole file as a Pandas dataframe. Instead, we use skiprows argument in […]