Raynergy-svg
diff --git a/‎.claude/learnings.md‎
Lines changed: 21 additions & 0 deletions b/‎.claude/learnings.md‎
Lines changed: 21 additions & 0 deletions
diff --git a/‎.claude/ralph/prd.json‎
Lines changed: 213 additions & 0 deletions b/‎.claude/ralph/prd.json‎
Lines changed: 213 additions & 0 deletions
diff --git a/‎.claude/ralph/progress.txt‎
Lines changed: 11 additions & 0 deletions b/‎.claude/ralph/progress.txt‎
Lines changed: 11 additions & 0 deletions
diff --git a/‎.claude/rules/improvement.md‎
Lines changed: 24 additions & 0 deletions b/‎.claude/rules/improvement.md‎
Lines changed: 24 additions & 0 deletions
diff --git a/‎.claude/rules/trading.md‎
Lines changed: 25 additions & 0 deletions b/‎.claude/rules/trading.md‎
Lines changed: 25 additions & 0 deletions
diff --git a/‎.claude/state.json‎
Lines changed: 37 additions & 0 deletions b/‎.claude/state.json‎
Lines changed: 37 additions & 0 deletions
diff --git a/‎CLAUDE.md‎
Lines changed: 41 additions & 0 deletions b/‎CLAUDE.md‎
Lines changed: 41 additions & 0 deletions
diff --git a/‎buddy_scanner.py‎
Lines changed: 37 additions & 6 deletions b/‎buddy_scanner.py‎
Lines changed: 37 additions & 6 deletions
diff --git a/‎cli/argparser.py‎
Lines changed: 2 additions & 2 deletions b/‎cli/argparser.py‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎cli/buddy_scanning.py‎
Lines changed: 39 additions & 4 deletions b/‎cli/buddy_scanning.py‎
Lines changed: 39 additions & 4 deletions
@@ -0,0 +1,21 @@
+# Buddy Trading Learnings
+
+Date-stamped insights extracted from trade outcomes, scan analysis, and system behavior. Patterns that repeat 3+ times get promoted to `.claude/rules/`.
+
+---
+
+## 2026-03-17 — Session 1 (Initial Deployment)
+
+- [2026-03-17] **keras_compat**: Keras 3.x rejects `seed` parameter in Dense/Conv/MHA layers. Fix: strip seed in keras_model_loader.py. Keras 2 models load cleanly after this.
+- [2026-03-17] **uncertainty_blocking**: Hard circuit-breaker (uncertainty agent blocks ALL trades when confidence <60%) killed every setup. Soft penalty (proportional confidence reduction) allows good trades through while still discounting uncertain ones.
+- [2026-03-17] **sl_tp_method**: Hardcoded 15/30 pip SL/TP was wrong for every pair. ATR-based dynamic sizing (SL=1.2x ATR, TP=2.0x ATR) adapts to actual volatility. EUR_USD ATR=13.6p, USD_JPY ATR=27.3p — one size never fits all.
+- [2026-03-17] **position_sizing**: 0.025 lots on $100K account = meaningless. Risk-per-trade 5% base with 2.5x medium-confidence multiplier produces 2.5 lot positions that actually move the needle.
+- [2026-03-17] **correlation_filter**: Without correlation filter, system would open EUR_USD LONG + GBP_USD LONG + AUD_USD SHORT — all effectively the same USD bet. Correlation groups prevent this.
+
+## 2026-03-17 — Session 1 (Trade Outcomes)
+
+- [2026-03-17] **pair_behavior/EUR_USD**: EUR_USD LONG signaled twice (67% conf), lost both times. Trade #905 lost -0.5p, trade #919 hit full SL at -25.1p (-$627.50). Model direction was wrong — EUR_USD was actually bearish despite LONG signal.
+- [2026-03-17] **pair_behavior/NZD_USD**: NZD_USD SHORT signaled twice, won both times. Trade #899 won +2.9p, trade #923 hit TP at +9.7p (+$242.50). Strong consistent signal.
+- [2026-03-17] **sl_tp/EUR_USD**: EUR_USD #919 hit exact SL price (1.1473) — 25.1 pips from entry. Move was decisive, no bounce. When a trade goes against you hard in the first hour, SL does its job. Guardian couldn't help because the move was continuous.
+- [2026-03-17] **agent_accuracy**: Trades with higher weighted_vote_score (0.68 for NZD_USD) performed better than lower (0.65 for EUR_USD). Higher consensus correlates with better outcomes.
+- [2026-03-17] **sizing**: Net session P/L with 2.5 lot trades: -$385 (NZD +$242.50, EUR -$627.50). One SL hit on 2.5 lots costs $627. Position sizing is correct but need better directional accuracy to be profitable.
@@ -0,0 +1,11 @@
+# Ralph Progress - Buddy Self-Improvement Loop
+
+## Stories: 0/12 completed
+
+## Context
+- ML Engine FX trading bot (Buddy)
+- Currently: scan → trade → RL feedback loop works
+- Missing: persistent learning, cross-session memory, adaptive config
+- Key files: src/scanner/automation/continuous.py, src/scanner/execution.py, src/scanner/agents.py
+- Trade journal: trained_data/trade_journal_rl.json (5 trades, 3 won / 2 lost)
+- Account: OANDA practice, NAV ~$102,580
@@ -0,0 +1,24 @@
+# Improvement Rules
+
+Meta-rules governing how Buddy learns and evolves.
+
+## Learning Triggers
+- Every closed trade triggers learning extraction (analyze outcome vs prediction)
+- Every losing trade > $100 triggers deep analysis (LLM-assisted if enabled)
+- Every 10 scan cycles triggers learnings audit (consolidation check)
+
+## Promotion Criteria
+- A pattern observed 3+ times in learnings.md gets promoted to rules/trading.md
+- Promoted rules include the date, source count, and specific actionable directive
+- Source learnings are marked [PROMOTED] after extraction
+
+## Consolidation
+- When learnings.md exceeds 30 entries: group by category, archive old entries
+- When rules/trading.md exceeds 50 lines: split by domain (entry rules vs risk rules)
+- When config_adjustments.json exceeds 100 entries: archive entries older than 30 days
+
+## Anti-Patterns
+- Never create new .claude/ files without justification — edit existing ones
+- Never let learnings accumulate without triage (apply / capture / dismiss)
+- Never evolve config silently — log every adjustment with reason
+- Never guess at stale state — read state.json, ask if unclear
@@ -0,0 +1,25 @@
+# Trading Rules
+
+Imperative rules that actively gate Buddy's trading behavior. Promoted from repeated learnings.
+
+## Execution Gates
+- NEVER execute a trade with R:R ratio below 1.2:1 (TP_pips / SL_pips >= 1.2)
+- ALWAYS run correlation filter before execution to prevent double exposure
+- ALWAYS log every trade to trade_journal_rl.json with full gate/agent context
+- NEVER skip RL sync after a trade closes — outcomes must feed back to agent weights
+
+## Risk Management
+- Drawdown guardian runs every scan cycle — non-negotiable
+- Maximum portfolio risk: 15% of NAV across all open positions
+- Position sizing uses ATR-based SL (not hardcoded pips)
+- SL = ATR * atr_sl_multiplier, TP = ATR * atr_tp_multiplier
+
+## Agent Consensus
+- Higher weighted_vote_score (>0.65) correlates with better outcomes — prefer these
+- Uncertainty score > 0.45 is a warning signal — trade with caution
+- Model disagreement > 0.30 is a loss predictor — reduce confidence or skip
+
+## Session Discipline
+- Update .claude/state.json before session ends
+- Extract learnings from every trade outcome (win or loss)
+- Never re-enter a position at the same SL/TP if the entry price has changed — recalculate
@@ -0,0 +1,37 @@
+{
+  "goal": "Autonomous scan-trade-learn loop with meaningful position sizing and self-improvement",
+  "status": "in-progress",
+  "done": [
+    "Keras 3 model loader fix (seed param)",
+    "Soft uncertainty blocking (proportional penalty)",
+    "ATR-based dynamic SL/TP",
+    "Position sizing scaled to account ($100K → 2.5 lot trades)",
+    "Correlation filter for double-exposure prevention",
+    "RL feedback loop (agent weights from trade outcomes)",
+    "Drawdown guardian with trailing SL",
+    "Multi-timeframe confirmation agent (H1/H4/D1)",
+    "Pair performance agent (historical win-rate gating)",
+    "Agent weight decay (prevents RL overfitting)",
+    "Minimum R:R ratio gate (1.2:1)",
+    "Scan cycle JSONL logging",
+    "Per-pair performance tracking",
+    "Bootstrap .claude/ learning infrastructure"
+  ],
+  "next": "Run next scan cycle with improved position sizing and analyze results",
+  "open_questions": [
+    "EUR_USD model predicted LONG but pair was bearish — is the model stale or is this a regime issue?",
+    "Should we increase min_confidence threshold given EUR_USD losses at 67% confidence?",
+    "Walk-forward orchestrator (US-005 from old PRD) still deferred — needs train-joint run first"
+  ],
+  "last_updated": "2026-03-17T06:15:00Z",
+  "portfolio_snapshot": {
+    "nav": 102580.84,
+    "open_trades": 0,
+    "total_realized_pnl": -381.42,
+    "session_trades": 5,
+    "session_wins": 3,
+    "session_losses": 2,
+    "win_rate": 0.60
+  },
+  "improvement_focus": "Self-improvement loop infrastructure (learnings → rules → config adaptation)"
+}
@@ -0,0 +1,41 @@
+# ML Engine (Buddy) - FX Trading Bot
+
+Autonomous ML-powered forex trading system. Scans markets, evaluates setups through multi-agent consensus, executes on OANDA, and learns from outcomes.
+
+## Architecture
+```
+Scanner (engine.py) → Agents (agents.py) → Gates → Execution (execution.py) → OANDA
+     ↑                                                        ↓
+     └── Config Tuner ← Rules ← Learnings ← RL Feedback ←── Trade Outcomes
+```
+
+## Core Loop
+1. **Scan**: Multi-pair analysis with TCN/Ridge/RF ensemble models
+2. **Agents**: Trend, volatility, uncertainty, multi-timeframe, pair performance
+3. **Gates**: Confidence, momentum, risk — all must pass
+4. **Execute**: ATR-based SL/TP, regime-aware position sizing
+5. **Monitor**: Drawdown guardian, trailing SL, real-time P/L
+6. **Learn**: RL weight updates, trade journal, pattern extraction
+
+## Key Decisions
+- Soft uncertainty blocking (confidence penalty) over hard circuit breaker
+- ATR-based dynamic SL/TP over hardcoded pip values
+- Correlation filter prevents double exposure on correlated pairs
+- Minimum R:R ratio 1.2:1 gate before execution
+- Position sizing scales to account size (5% base risk on practice)
+
+## Self-Improvement
+- Learnings: `.claude/learnings.md` — date-stamped insights from trade outcomes
+- Rules: `.claude/rules/` — promoted patterns that actively gate behavior
+- State: `.claude/state.json` — session continuity across context windows
+- Config: `.claude/config_adjustments.json` — adaptive parameter tuning
+
+## Key Files
+- `buddy_scanner.py` — CLI entry point (scan/watch/trade/learn)
+- `src/scanner/engine.py` — Core scanner with model ensemble
+- `src/scanner/agents.py` — Sub-inference agent team
+- `src/scanner/execution.py` — OANDA trade execution + RL sync
+- `src/scanner/automation/continuous.py` — Watch mode loop
+- `src/risk/position_sizing.py` — Regime-aware position sizer
+- `trained_data/trade_journal_rl.json` — Trade outcomes for RL
+- `trained_data/models/agent_weights.json` — Learned agent weights
@@ -33,7 +33,9 @@ def __init__(
     @staticmethod
     def _approval_holdback_reason(analysis: PairAnalysis) -> Optional[str]:
         """Summarize why a directional setup did not clear full approval."""
-        if analysis.error or analysis.gates_passed:
+        if analysis.error:
+            return None
+        if analysis.is_tradeable:
             return None
 
         why_no_trade = list(getattr(analysis, "why_no_trade", []) or [])
@@ -84,7 +86,7 @@ def _render_clean_output(
             direction = (a.direction or "HOLD").upper()
             session_blocked = bool(a.error and str(a.error).lower().startswith("outside trading session"))
             market_closed = bool(a.error and str(a.error).lower().startswith("fx market closed"))
-            status = "TRADEABLE" if a.gates_passed else (
+            status = "TRADEABLE" if a.is_tradeable else (
                 "CLOSED" if market_closed else ("SESSION" if session_blocked else ("ERROR" if a.error else "WATCH"))
             )
             conf = int(round(float(a.confidence) * 100))
@@ -116,17 +118,29 @@ def _render_clean_output(
             else:
                 agent_text = "not-run"
 
+            # MTF confluence tag
+            mtf_reason = next((ar for ar in (a.agent_reasons or []) if ar.get("name") == "multi_timeframe"), None)
+            mtf_tag = ""
+            if mtf_reason:
+                mtf_count = mtf_reason.get("metadata", {}).get("confluence_count", 0)
+                mtf_tag = f", MTF {mtf_count}/3"
+
             master = a.master_pair.replace("_", "/") if a.master_pair else pair
             c.print(
                 f"   [{planner_slate}]why:[/{planner_slate}] core gates M{m_gate} A{a_gate} R{r_gate} ({a.gate_summary}), "
-                f"agent {agent_text}, master [{planner_sand}]{master}[/{planner_sand}]"
+                f"agent {agent_text}{mtf_tag}, master [{planner_sand}]{master}[/{planner_sand}]"
             )
 
-            if a.gates_passed:
+            if a.is_tradeable:
                 promoted = " [agent-promoted]" if getattr(a, "agent_promoted", False) else ""
+                soft_tag = ""
+                for reason in (a.agent_reasons or []):
+                    if reason.get("reason_code") == "uncertainty_soft_penalty":
+                        soft_tag = " [soft-penalized]"
+                        break
                 c.print(
                     f"   [{planner_slate}]plan:[/{planner_slate}] "
-                    f"[{planner_cyan}]SL {a.sl_pips:.0f} | TP {a.tp_pips:.0f}{promoted}[/{planner_cyan}]"
+                    f"[{planner_cyan}]SL {a.sl_pips:.0f} | TP {a.tp_pips:.0f}{promoted}{soft_tag}[/{planner_cyan}]"
                 )
             else:
                 holdback_reason = self._approval_holdback_reason(a)
@@ -136,7 +150,7 @@ def _render_clean_output(
                         f"[{planner_sand}]{holdback_reason}[/{planner_sand}]"
                     )
 
-        tradeable = [a.pair.replace("_", "/") for a in analyses if a.gates_passed]
+        tradeable = [a.pair.replace("_", "/") for a in analyses if a.is_tradeable]
         c.print()
         if tradeable:
             c.print(f"[{planner_cyan}]Tradeable:[/{planner_cyan}] [{planner_sand}]{', '.join(tradeable)}[/{planner_sand}]")
@@ -178,9 +192,26 @@ def scan(
             "weighted_vote_threshold",
             "sub_inference_min_confidence",
             "sub_inference_vote_threshold",
+            "sub_inference_max_candidates",
             "agent_promotion_min_confidence",
             "max_uncertainty_score",
             "max_model_disagreement",
+            "soft_uncertainty_blocking",
+            "use_rl_sizer",
+            "use_rl_gates",
+            "use_rl_exits",
+            "enable_agent_trade_promotion",
+            "atr_sl_multiplier",
+            "atr_tp_multiplier",
+            "min_sl_pips",
+            "max_sl_pips",
+            "min_tp_pips",
+            "max_tp_pips",
+            "high_prob_threshold",
+            "high_prob_tp_bonus",
+            "enable_multi_timeframe_agent",
+            "enable_pair_performance_agent",
+            "min_risk_reward_ratio",
         )
         prev_profile_values = {
             name: getattr(self._scanner.config, name)
 
@@ -405,9 +405,9 @@ def _add_scan_arguments(parser: argparse.ArgumentParser) -> None:
     parser.add_argument(
         "--profile",
         type=str,
-        choices=["conservative", "balanced", "aggressive"],
+        choices=["conservative", "balanced", "aggressive", "smart"],
         default="balanced",
-        help="For buddy/scan: gate profile tuning (conservative|balanced|aggressive)",
+        help="For buddy/scan: gate profile tuning (conservative|balanced|aggressive|smart)",
     )
     parser.add_argument(
         "--clean-output",
 
@@ -29,6 +29,41 @@
 from src.utils import load_config
 
 
+def _filter_correlated_exposure(candidates: list) -> list:
+    """Remove candidates that would double exposure on correlation groups already open."""
+    try:
+        from src.scanner.execution import ExecutionManager
+        from src.training.correlation_group_config import get_correlation_group
+
+        em = ExecutionManager()
+        open_trades = em.monitor_open_trades()
+        if not open_trades:
+            return candidates
+
+        open_groups: set = set()
+        open_pairs: set = set()
+        for t in open_trades:
+            pair = t.get("pair", "")
+            open_pairs.add(pair)
+            group = get_correlation_group(pair)
+            if group and group.master_pair:
+                open_groups.add(group.master_pair)
+
+        filtered = []
+        for a in candidates:
+            if a.pair in open_pairs:
+                console.print(f"[dim]  skip {a.pair}: already open[/dim]")
+                continue
+            group = get_correlation_group(a.pair)
+            if group and group.master_pair in open_groups:
+                console.print(f"[dim]  skip {a.pair}: correlated with open {group.master_pair} group[/dim]")
+                continue
+            filtered.append(a)
+        return filtered
+    except Exception:
+        return candidates
+
+
 def buddy_scan(
     config_path: str = DEFAULT_CONFIG_PATH,
     *,
@@ -128,16 +163,16 @@ def buddy_scan(
             try:
                 # Enable execution path for this scan call.
                 scanner._scanner.config.enable_execution = True
-                tradeable_candidates = [
-                    r for r in results
-                    if r.direction in {"LONG", "SHORT"} and r.gates_passed and r.error is None
-                ]
+                tradeable_candidates = [r for r in results if r.is_tradeable]
                 confirmed_candidates = [
                     r for r in tradeable_candidates
                     if r.agent_total > 0 and r.agent_passed
                 ]
                 execution_candidates = confirmed_candidates or tradeable_candidates
 
+                # Filter correlated exposure (don't double up on same group)
+                execution_candidates = _filter_correlated_exposure(execution_candidates)
+
                 if execution_candidates:
                     execution_results = scanner._scanner.execute_trades(
                         execution_candidates,