• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar

Python and R Tips

Learn Data Science with Python and R

  • Home
  • Python
  • Pandas
    • Pandas 101
  • tidyverse
    • tidyverse 101
  • R
  • Linux
  • About
    • Privacy Policy

Empirical cumulative distribution function (ECDF) in Python

May 17, 2019 by cmdlinetips

Histograms are a great way to visualize a single variable. One of the problems with histograms is that one has to choose the bin size. With a wrong bin size your data distribution might look very different. In addition to bin size, histograms may not be a good option to visualize distributions of multiple variables […]

Filed Under: ECDF, Python ECDF Tagged With: ECDF in Python

How To Randomly Add NaN to Pandas Dataframe?

May 12, 2019 by cmdlinetips

In this post we will see an example of how to introduce missing value, i.e. NaNs randomly in a data frame uusisng Pandas. Sometimes while testing a method, you might want to create a Pandas dataframe with NaNs randomly distributed. Here wee show how to do it. Let us load the packages we need Let […]

Filed Under: add random NaNs in Pandas, Pandas mask Tagged With: add random NaNs in Pandas, Pandas mask

How To Highlight Select Data Points with ggplot2 in R?

May 9, 2019 by cmdlinetips

The power of ggplot2 lies in making it easy to make great plots and in easily tweaking it to the one wants. Sometimes, one might want to highlight certain data points in a plot in different color. Here we will see an example of highlighting specific data points in a plot. Let us first load […]

Filed Under: ggplot2, highlight data in ggplot2, R, R Tips, tidyverse 101 Tagged With: highlight data in ggplot2, highlight select points in R

How to Implement Pandas Groupby operation with NumPy?

May 8, 2019 by cmdlinetips

Pandas’ GroupBy function is the bread and butter for many data munging activities. Groupby enables one of the most widely used paradigm “Split-Apply-Combine”, for doing data analysis. Sometimes you will be working NumPy arrays and may still want to perform groupby operations on the array. Just recently wrote a blogpost inspired by Jake’s post on […]

Filed Under: Groupby with NumPy, NumPy, Pandas 101, Pandas Groupby using NumPy, Python Tips Tagged With: Groupby with NumPy, Pandas Groupby using NumPy, pandas groupby()

Implementing K-means clustering in Python from Scratch

May 5, 2019 by cmdlinetips

K-means Cluster Assignment: Third Iteration

K-means clustering is one of the commonly used unsupervised techniques in Machine learning. K-means clustering clusters or partitions data in to K distinct clusters. In a typical setting, we provide input data and the number of clusters K, the k-means clustering algorithm would assign each data point to a distinct cluster. In this post, we […]

Filed Under: K-means clustering, K-means clustering example Python, K-means clustering in Python, Machine Learning Tagged With: K-means clustering, K-means clustering in Python, Machine Learning, Machine Learninhg

  • « Go to Previous Page
  • Page 1
  • Interim pages omitted …
  • Page 39
  • Page 40
  • Page 41
  • Page 42
  • Page 43
  • Interim pages omitted …
  • Page 74
  • Go to Next Page »

Primary Sidebar

Subscribe to Python and R Tips and Learn Data Science

Learn Pandas in Python and Tidyverse in R

Tags

Altair Basic NumPy Book Review Data Science Data Science Books Data Science Resources Data Science Roundup Data Visualization Dimensionality Reduction Dropbox Dropbox Free Space Dropbox Tips Emacs Emacs Tips ggplot2 Linux Commands Linux Tips Mac Os X Tips Maximum Likelihood Estimation in R MLE in R NumPy Pandas Pandas 101 Pandas Dataframe Pandas Data Frame pandas groupby() Pandas select columns Pandas select_dtypes Python Python 3 Python Boxplot Python Tips R rstats R Tips Seaborn Seaborn Boxplot Seaborn Catplot Shell Scripting Sparse Matrix in Python tidy evaluation tidyverse tidyverse 101 Vim Vim Tips

RSS RSS

  • How to convert row names to a column in Pandas
  • How to resize an image with PyTorch
  • Fashion-MNIST data from PyTorch
  • Pandas case_when() with multiple examples
  • An Introduction to Statistical Learning: with Applications in Python Is Here
  • 10 Tips to customize ggplot2 title text
  • 8 Plot types with Matplotlib in Python
  • PCA on S&P 500 Stock Return Data
  • Linear Regression with Matrix Decomposition Methods
  • Numpy’s random choice() function

Copyright © 2026 · Lifestyle Pro on Genesis Framework · WordPress · Log in

Go to mobile version