Release v2.4.0 — Context Compression (15-40% Token Savings) · BlockRunAI/franklin

7-Layer Context Compression

Integrated BlockRun's compression library directly into the agent loop. Saves 15-40% tokens automatically:

How it works:

Runs automatically when conversation has >10 messages
Adds a header explaining codes to the model
Layer 6 (observation) is the biggest win — summarizes large tool results to ~300 chars
All layers are LLM-safe (model can still understand compressed text)

This is the same compression used by BlockRun's backend, now running client-side for immediate token reduction.