
Training A Not-So-Large Language Model for $50
September 08, 2023 | 3 min readWhat I cannot create, I do not understand. Let's train your own LLM!
Under the sea, in the hippocampus's garden...
What I cannot create, I do not understand. Let's train your own LLM!
PyTorch 2.0 introduced a new feature for JIT-compiling. How can it accelerate model training and inference?
A quick guide for RLHF using trlX, OPT-1.5B, and LoRA.
ハイパーパラメータを決めるためのガイドである『Deep Learning Tuning Playbook』をまとめました。
Uncover the top deep learning advancements of 2022. A year-in-review of key research papers and applications.
A collection of images I asked DALL・E 2 to generate.
Let's look back at the updates in deep learning in 2021! This post covers four application projects worth checking out
Let's look back at the updates in deep learning in 2021! This post covers eight research papers worth checking out.
The ability of StyleGAN to generate super-realistic images has been inspiring many application works. To have some sort of organized view on them, this post covers important papers with a focus on image manipulation.