Skip to content

v1.3.0 — /compact, Thinking Display, Auto-Retry, 20+ Context Windows

Choose a tag to compare

@1bcMax 1bcMax released this 04 Apr 06:01
· 125 commits to main since this release
3bf3520

What's New

/compact Command

Manually compress conversation history to save tokens:

> /compact
Compacted: ~45,200 → ~12,800 tokens

Thinking Content Display

Ink UI now shows a preview of the model's thinking process instead of just a spinner.

Transient Error Auto-Retry

Network timeouts, 429 rate limits, and 500/502/503 server errors now automatically retry with exponential backoff (1s → 2s → 4s, up to 3 attempts). Sessions no longer die on momentary hiccups.

Expanded Context Window Registry

Added 20+ model context windows (xAI Grok, GLM-5, Minimax, O3/O4, GPT-4.1, Gemini 3.1, etc.) plus pattern-based inference for unknown models. Compaction thresholds are now accurate across all providers.

Other Improvements

  • System prompt: All 11 tools documented with constraints
  • 16 model shortcuts synced to terminal picker
  • Token estimation: +16 token overhead for tool_use framing
  • Router: Code block detection + byte-length estimation
  • Glob: Only ** triggers recursion (was over-recursing)
  • WebSearch: Fallback parser when DuckDuckGo changes HTML
  • First-run tips for new users
  • GLM promo auto-expiry fallback to Gemini Flash