From 1b3afc37aaffedf1d2606d625eff0bfc674f3215 Mon Sep 17 00:00:00 2001 From: Brad Hilton Date: Mon, 7 Oct 2024 14:45:25 -0600 Subject: [PATCH] Add blog post to README.md --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index f906b0d..3f30693 100644 --- a/README.md +++ b/README.md @@ -20,6 +20,7 @@ And the repository will be continuously updated to track the frontier of LLM Rea - [Nathan Lambert] [Reverse engineering OpenAI’s o1](https://www.interconnects.ai/p/reverse-engineering-openai-o1) - [Andreas Stuhlmüller, jungofthewon] [Supervise Process, not Outcomes](https://www.alignmentforum.org/posts/pYcFPMBtQveAjcSfH/supervise-process-not-outcomes) - [Nouha Dziri] [Have o1 Models Cracked Human Reasoning?](https://substack.com/home/post/p-148782195) +- [Brad Hilton] [How I Think OpenAI’s o1 Model Works and How I Think it Was Trained](https://lapis-nova-b3f.notion.site/How-I-Think-OpenAI-s-o1-Model-Works-and-How-I-Think-it-Was-Trained-11362e1157a18094ab35dcb42f5fad41) ## Talks - [Noam Brown] [Parables on the Power of Planning in AI: From Poker to Diplomacy](https://www.youtube.com/watch?app=desktop&v=eaAonE58sLU)