Skip to content

geogod42/RedFruitCOT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RedFruitCOT

A speculative attempt at reverse-engineering OpenAI's "Strawberry" (now o1) before its public release. Built on September 11, 2024 — one day before o1-preview was announced.

The idea: if you make an LLM spend more compute at inference time by critiquing and refining its own outputs, you get better answers. This implements a generate → critique → grade → refine loop using Llama 3.1 70B on AWS Bedrock.

How It Works

  1. Generate 3 independent candidate answers to a prompt
  2. Critique each answer using the same LLM
  3. Grade each on a -100 to 100 scale based on the critique
  4. Select the best, refine it
  5. Repeat for N iterations

Usage

Requires boto3 and AWS CLI credentials configured.

python3 redfruit.py

Convert the output to readable HTML:

python3 htmlize.py

About

OpenAI "Strawberry" like chain of thought (COT) py script

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages