Release v1.3.0 — /compact, Thinking Display, Auto-Retry, 20+ Context Windows · BlockRunAI/franklin

What's New

/compact Command

Manually compress conversation history to save tokens:

> /compact
Compacted: ~45,200 → ~12,800 tokens

Thinking Content Display

Ink UI now shows a preview of the model's thinking process instead of just a spinner.

Transient Error Auto-Retry

Network timeouts, 429 rate limits, and 500/502/503 server errors now automatically retry with exponential backoff (1s → 2s → 4s, up to 3 attempts). Sessions no longer die on momentary hiccups.

Expanded Context Window Registry

Added 20+ model context windows (xAI Grok, GLM-5, Minimax, O3/O4, GPT-4.1, Gemini 3.1, etc.) plus pattern-based inference for unknown models. Compaction thresholds are now accurate across all providers.

Other Improvements

System prompt: All 11 tools documented with constraints
16 model shortcuts synced to terminal picker
Token estimation: +16 token overhead for tool_use framing
Router: Code block detection + byte-length estimation
Glob: Only ** triggers recursion (was over-recursing)
WebSearch: Fallback parser when DuckDuckGo changes HTML
First-run tips for new users
GLM promo auto-expiry fallback to Gemini Flash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.3.0 — /compact, Thinking Display, Auto-Retry, 20+ Context Windows

Choose a tag to compare

Sorry, something went wrong.