explain the runbook file category and use claude.md

mainred · mainred · commit 9bd9ab37f882 · 2025-08-13T09:41:26.000Z
diff --git a/holmes/plugins/runbooks/CLAUDE.md b/holmes/plugins/runbooks/CLAUDE.md
@@ -1,14 +1,16 @@
 You are an expert in automated diagnostics and runbook creation for an AI-driven troubleshooting agents. I will provide you with one or more issue descriptions or test scenarios.
 
-Your task is to generate a strictly executable runbook for AI Agent to follow. The runbook should be machine-readable but human-understandable, and must include:
+Your task is to generate a strictly executable runbook for AI Agent to follow. The runbook should be machine-readable but human-understandable, and must include the following sections:
 
-## Goal
+# Runbook Content Structure
+
+## 1. Goal
 - **Primary Objective:** Clearly define the specific category of issues this runbook addresses (e.g., "diagnose network connectivity problems", "troubleshoot pod startup failures", "investigate performance degradation").
 - **Scope:** Specify the environment, technology stack, or system components covered by this runbook.
 - **Agent Mandate:** Explicitly state that the AI agent must follow the workflow steps sequentially and systematically without deviation to ensure consistent, thorough troubleshooting.
 - **Expected Outcome:** Define what successful completion of this runbook should achieve (root cause identification, issue resolution, or escalation criteria).
 
-## Workflow for [Issue Category] Diagnosis
+## 2. Workflow for [Issue Category] Diagnosis
 - Provide numbered, sequential steps the AI agent must execute in order.
 - Each step should specify:
   - **Action:** Describe the diagnostic function conceptually (e.g., "retrieve container logs from specified pod", "check service connectivity between components", "examine resource utilization metrics")
@@ -20,17 +22,64 @@ Your task is to generate a strictly executable runbook for AI Agent to follow. T
 - Describe functions generically so they can be mapped to available tools (e.g., "execute a command to test network connectivity" rather than "ping_host()")
 - Include verification steps to confirm each diagnostic action was successful.
 
-## Synthesize Findings
+## 3. Synthesize Findings
 - **Data Correlation:** Describe how the AI agent should combine outputs from multiple workflow steps.
 - **Pattern Recognition:** Specify what patterns, error messages, or metrics indicate specific root causes.
 - **Prioritization Logic:** Provide criteria for ranking potential causes by likelihood or severity.
 - **Evidence Requirements:** Define what evidence is needed to confidently identify each potential root cause.
 - **Example Scenarios:** Include sample synthesis statements showing how findings should be summarized.
 
-## Recommended Remediation Steps
+## 4. Recommended Remediation Steps
 - **Immediate Actions:** List temporary workarounds or urgent fixes for critical issues.
 - **Permanent Solutions:** Provide step-by-step permanent remediation procedures.
 - **Verification Steps:** Define how to confirm each remediation action was successful.
 - **Documentation References:** Include links to official documentation, best practices, or vendor guidance.
 - **Escalation Criteria:** Specify when and how to escalate if remediation steps fail.
 - **Post-Remediation Monitoring:** Describe what to monitor to prevent recurrence.
+
+# File Organization Guidelines
+
+## Folder Structure
+*Category folders are used to distinguish and categorize different runbooks based on their focus area or technology domain. Each runbook must be placed into a specific category folder under `holmes/plugins/runbooks/` for better organization and discoverability. Create a new category folder if your runbook doesn't fit into existing categories.*
+
+## File Naming
+*Use consistent naming conventions for runbook files:*
+
+- Use descriptive, lowercase names with hyphens: `dns-resolution-troubleshooting.md`
+- Include the issue type or technology: `redis-connection-issues.md`
+- Avoid generic names like `troubleshooting.md` or `debug.md`
+
+### Catalog Registration
+After creating your runbook, you must add an entry to `catalog.json` in the runbooks directory to make it discoverable by AI agents.
+
+**Steps to add a new catalog entry:**
+
+1. **Open** `holmes/plugins/runbooks/catalog.json`
+2. **Add your entry** to the JSON array following this structure:
+   ```json
+   {
+     "name": "Brief, descriptive name of the runbook",
+     "path": "category-folder/your-runbook-filename.md",
+     "description": "Clear description of what issues this runbook addresses",
+     "tags": ["relevant", "tags", "for", "search"]
+   }
+   ```
+
+3. **Ensure proper JSON formatting** - add a comma after the previous entry if needed
+4. **Validate the JSON** is properly formatted before committing
+
+**Field Guidelines:**
+- `name`: Keep concise but descriptive (e.g., "Redis Connection Issues")
+- `path`: Always include the category folder (e.g., "database/redis-connection-issues.md")
+- `description`: Explain what specific problems this runbook solves
+- `tags`: Include technology names, issue types, and relevant keywords
+
+Example catalog entry:
+```json
+{
+  "name": "DNS Resolution Troubleshooting",
+  "path": "networking/dns-resolution-troubleshooting.md", 
+  "description": "Comprehensive guide for diagnosing and resolving DNS resolution issues in Kubernetes clusters",
+  "tags": ["dns", "networking", "kubernetes", "troubleshooting"]
+}
+```