RedFruitCOT

A speculative attempt at reverse-engineering OpenAI's "Strawberry" (now o1) before its public release. Built on September 11, 2024 — one day before o1-preview was announced.

The idea: if you make an LLM spend more compute at inference time by critiquing and refining its own outputs, you get better answers. This implements a generate → critique → grade → refine loop using Llama 3.1 70B on AWS Bedrock.

How It Works

Generate 3 independent candidate answers to a prompt
Critique each answer using the same LLM
Grade each on a -100 to 100 scale based on the critique
Select the best, refine it
Repeat for N iterations

Usage

Requires boto3 and AWS CLI credentials configured.

python3 redfruit.py

Convert the output to readable HTML:

python3 htmlize.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
htmlize.py		htmlize.py
readme.md		readme.md
redfruit.py		redfruit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RedFruitCOT

How It Works

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RedFruitCOT

How It Works

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages