![[object Object]](/static/2d0f4e01d6e61412b3e92139e5695299/e9fba/profile-pic.png)
![post](/static/43ee2bed7576375009d996eb654eb884/855e2/ogp.jpg)
Kaggle Competition Report: Automated Essay Scoring 2.0
July 07, 2024 | 5 min readThis competition was all about distribution shift. Let's learn how the winners conquered the challenge.
![post](/static/b99e305937672113ed4aced14b15b367/0b0b8/ogp.jpg)
Aligning LLMs without Paired Preference Labels
May 08, 2024 | 3 min readIt's hard to collect paired preferences. Can we align LLMs without them? Yes, with KTO!
![post](/static/2ae68f0677655e5a34f125ca7f49cf73/0fea4/ogp.jpg)
Aligning LLMs without Reinforcement Learning
April 27, 2024 | 3 min readDPO reduces the effort required to align LLMs. Here is how I created the Reviewer #2 Bot from TinyLlama using DPO.
![post](/static/22b0089e087222479b1bf4275956b7ff/e6ac2/figure.png)
Elo vs Bradley-Terry: Which is Better for Comparing the Performance of LLMs?
March 17, 2024 | 4 min readChatbot Arena updated its LLM ranking method from Elo to Bradley-Terry. What changed? Let's dig into the differences.
![post](/static/a0af896c2b4d5c6a7376f87b85717ed6/139c3/ogp.jpg)
Unpacking the Tricky Behavior of React Navigation's navigate Function
March 09, 2024 | 2 min readDoes your React Native app go back to an unexpected screen? Here's how to deal with it.
![post](/static/96ca3887b1e402fd94695d4c069a6fc4/44e57/ogp.jpg)
『デュアルキャリア・カップル』を読んで夫婦で話したこと
February 03, 2024 | 6 min read『デュアルキャリア・カップル』を読んで、「第一の転換期」を乗り越えるために夫婦で話したことについて書きます。
![post](/static/f78ad4ef46029b1c91cbcb4d6faf02d2/9bfa7/ogp.jpg)
Year in Review: Deep Learning Papers in 2023
January 27, 2024 | 12 min readLet's look back at the significant progress made in deep learning in 2023! Here are my 10 favorite papers.
![post](/static/30458ccbb771d00b239fb12042b0b363/88246/ogp.jpg)
Kaggle Competition Report: LLM Science Exam
November 19, 2023 | 4 min readCan LLMs answer scientific questions? See how Kaggle winners used LLMs and RAG!