Hippocampus's Garden

Under the sea, in the hippocampus's garden...

Home | Hippocampus's Garden
[object Object]
Welcome to Hippocampus's Garden, a Shion Honda's blog. I regularly write about machine learning, statistics, programming, and my hobbies.
post

Evolution of Preference Optimization Techniques

November 13, 2024  |  5 min read

RLHF is not the only method for AI alignment. This article introduces modern algorithms like DPO and KTO that offer simpler and more stable alternatives.

post

Kaggle Competition Report: Automated Essay Scoring 2.0

July 07, 2024  |  5 min read

This competition was all about distribution shift. Let's learn how the winners conquered the challenge.

post

Aligning LLMs without Reinforcement Learning

April 27, 2024  |  3 min read

DPO reduces the effort required to align LLMs. Here is how I created the Reviewer #2 Bot from TinyLlama using DPO.

post

Elo vs Bradley-Terry: Which is Better for Comparing the Performance of LLMs?

March 17, 2024  |  4 min read

Chatbot Arena updated its LLM ranking method from Elo to Bradley-Terry. What changed? Let's dig into the differences.

post

Unpacking the Tricky Behavior of React Navigation's navigate Function

March 09, 2024  |  2 min read

Does your React Native app go back to an unexpected screen? Here's how to deal with it.

Shion Honda

Hippocampus's Garden © 2024, Shion Honda. Built with Gatsby