Stats with Python: Unbiased Variance
January 17, 2021 | 7 min readWhat is unbiased sample variance? Why divide by n-1? With a little programming with Python, it's easier to understand.
Under the sea, in the hippocampus's garden...
What is unbiased sample variance? Why divide by n-1? With a little programming with Python, it's easier to understand.
Let's re-inplement face swapping in 10 minutes! This post shows a naive solution using a pre-trained CNN and OpenCV.
Lightweight GAN has opened the way for generating fine images with ~100 training samples and affordable computing resources. This post presents "This Sushi Does Not Exist" and how I built it with GAE.
If you want to use a custom loss function with a modern GBDT model, you'll need the first- and second-order derivatives. This post shows how to implement them, using LightGBM as an example
This post introduces how to sample groups from a dataset, which is helpful when you want to avoid data leakage.
This post compares the behaviors of different feature importance measures in tricky situations.
This post introduces the Pandas method of `query`, which allows us to query dataframes in an SQL-like manner.
This post introduces PFRL, a new reinforcement learning library, and uses it to learn to play the Slime Volleyball game on Colaboratory.
This post summarizes how to group data by some variable and draw boxplots on it using Pandas and Seaborn.
Double descent is one of the mysteries of modern machine learning. I reproduced the main results of the recent paper by Nakkiran et al. and posed some questions that occurred to me.
Have you ever confused Pandas methods `loc`, `at`, and `iloc` with each other? It's no more confusing when you have this table in mind.
How does Google's PageRank work? Its theory and algorithm are explained, followed by numerical experiments.
Want to generate realistic images with a single GPU? This post demonstrates how to downsize StyleGAN2 with slight performance degradation.