Under the sea, in the hippocampus's garden...
This post introduces how to sample groups from a dataset, which is helpful when you want to avoid data leakage.
This post introduces the Pandas method of `query`, which allows us to query dataframes in an SQL-like manner.
This post summarizes how to group data by some variable and draw boxplots on it using Pandas and Seaborn.
Have you ever confused Pandas methods `loc`, `at`, and `iloc` with each other? It's no more confusing when you have this table in mind.
Citation counts shouldn't be the only measurement of the impact of academic papers. I applied Google's PageRank to evaluating academic papers's importance.