![[object Object]](/static/2d0f4e01d6e61412b3e92139e5695299/e9fba/profile-pic.png)

Year in Review: Deep Learning Papers in 2024
December 29, 2024 | 13 min readReflecting on 2024's deep learning breakthroughs! Discover my top 10 favorite research papers that shaped the field this year.

Evolution of Preference Optimization Techniques
November 13, 2024 | 5 min readRLHF is not the only method for AI alignment. This article introduces modern algorithms like DPO and KTO that offer simpler and more stable alternatives.

Kaggle Competition Report: Automated Essay Scoring 2.0
July 07, 2024 | 5 min readThis competition was all about distribution shift. Let's learn how the winners conquered the challenge.

Aligning LLMs without Paired Preference Labels
May 08, 2024 | 3 min readIt's hard to collect paired preferences. Can we align LLMs without them? Yes, with KTO!

Aligning LLMs without Reinforcement Learning
April 27, 2024 | 3 min readDPO reduces the effort required to align LLMs. Here is how I created the Reviewer #2 Bot from TinyLlama using DPO.

Elo vs Bradley-Terry: Which is Better for Comparing the Performance of LLMs?
March 17, 2024 | 4 min readChatbot Arena updated its LLM ranking method from Elo to Bradley-Terry. What changed? Let's dig into the differences.