Home | Hippocampus's Garden

21 posts tagged with "deep-learning"

Year in Review: Deep Learning Papers in 2024

December 29, 2024 | 13 min read

Reflecting on 2024's deep learning breakthroughs! Discover my top 10 favorite research papers that shaped the field this year.

Evolution of Preference Optimization Techniques

November 13, 2024 | 5 min read

RLHF is not the only method for AI alignment. This article introduces modern algorithms like DPO and KTO that offer simpler and more stable alternatives.

Aligning LLMs without Paired Preference Labels

May 08, 2024 | 3 min read

It's hard to collect paired preferences. Can we align LLMs without them? Yes, with KTO!

Aligning LLMs without Reinforcement Learning

April 27, 2024 | 3 min read

DPO reduces the effort required to align LLMs. Here is how I created the Reviewer #2 Bot from TinyLlama using DPO.

Year in Review: Deep Learning Papers in 2023

January 27, 2024 | 12 min read

Let's look back at the significant progress made in deep learning in 2023! Here are my 10 favorite papers.

Training A Not-So-Large Language Model for $50

September 08, 2023 | 3 min read

What I cannot create, I do not understand. Let's train your own LLM!

torch.compile Benchmarked

May 19, 2023 | 2 min read

PyTorch 2.0 introduced a new feature for JIT-compiling. How can it accelerate model training and inference?

Tuning Large Language Models with Reinforcement Learning on a Single GPU

March 30, 2023 | 7 min read

A quick guide for RLHF using trlX, OPT-1.5B, and LoRA.

忙しい人のためのTuning Playbook

January 26, 2023 | 11 min read

ハイパーパラメータを決めるためのガイドである『Deep Learning Tuning Playbook』をまとめました。

Year in Review: Deep Learning in 2022

January 17, 2023 | 6 min read

Uncover the top deep learning advancements of 2022. A year-in-review of key research papers and applications.

DALL･E 2 Exhibition

June 21, 2022 | 2 min read

A collection of images I asked DALL･E 2 to generate.

A Look Back at Deep Learning in 2021 (Application)

January 03, 2022 | 4 min read

Let's look back at the updates in deep learning in 2021! This post covers four application projects worth checking out

A Look Back at Deep Learning in 2021 (Research)

January 03, 2022 | 11 min read

Let's look back at the updates in deep learning in 2021! This post covers eight research papers worth checking out.

The ability of StyleGAN to generate super-realistic images has been inspiring many application works. To have some sort of organized view on them, this post covers important papers with a focus on image manipulation.

NeurIPS 2020 Favorite Papers

January 10, 2021 | 6 min read

NeurIPS 2020 virtual conference was full of exciting presentations! Here I list some notable ones with brief introductions.

Best Machine Learning Papers of 2020

January 02, 2021 | 12 min read

Let's look back on the machine learning papers published in 2020! This post covers 10 representative papers that I found interesting and worth reading.

Transformers Now: A Survey of Recent Advances

November 02, 2020 | 10 min read

Transformer has undergone various application studies, model enhancements, etc. This post aims to provide an overview of these studies.

Reproducing Deep Double Descent

June 13, 2020 | 7 min read

Double descent is one of the mysteries of modern machine learning. I reproduced the main results of the recent paper by Nakkiran et al. and posed some questions that occurred to me.

BERT That Works on Browser

May 22, 2020 | 4 min read

This post explains how MobileBERT succeeded in reducing both model size and inference time and introduce its implementation in TensorFlow.js that works on web browsers.