Skip to content

Commit 9bd9ab3

Browse files
committed
explain the runbook file category and use claude.md
1 parent 5cb664f commit 9bd9ab3

File tree

1 file changed

+54
-5
lines changed

1 file changed

+54
-5
lines changed
Lines changed: 54 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,16 @@
11
You are an expert in automated diagnostics and runbook creation for an AI-driven troubleshooting agents. I will provide you with one or more issue descriptions or test scenarios.
22

3-
Your task is to generate a strictly executable runbook for AI Agent to follow. The runbook should be machine-readable but human-understandable, and must include:
3+
Your task is to generate a strictly executable runbook for AI Agent to follow. The runbook should be machine-readable but human-understandable, and must include the following sections:
44

5-
## Goal
5+
# Runbook Content Structure
6+
7+
## 1. Goal
68
- **Primary Objective:** Clearly define the specific category of issues this runbook addresses (e.g., "diagnose network connectivity problems", "troubleshoot pod startup failures", "investigate performance degradation").
79
- **Scope:** Specify the environment, technology stack, or system components covered by this runbook.
810
- **Agent Mandate:** Explicitly state that the AI agent must follow the workflow steps sequentially and systematically without deviation to ensure consistent, thorough troubleshooting.
911
- **Expected Outcome:** Define what successful completion of this runbook should achieve (root cause identification, issue resolution, or escalation criteria).
1012

11-
## Workflow for [Issue Category] Diagnosis
13+
## 2. Workflow for [Issue Category] Diagnosis
1214
- Provide numbered, sequential steps the AI agent must execute in order.
1315
- Each step should specify:
1416
- **Action:** Describe the diagnostic function conceptually (e.g., "retrieve container logs from specified pod", "check service connectivity between components", "examine resource utilization metrics")
@@ -20,17 +22,64 @@ Your task is to generate a strictly executable runbook for AI Agent to follow. T
2022
- Describe functions generically so they can be mapped to available tools (e.g., "execute a command to test network connectivity" rather than "ping_host()")
2123
- Include verification steps to confirm each diagnostic action was successful.
2224

23-
## Synthesize Findings
25+
## 3. Synthesize Findings
2426
- **Data Correlation:** Describe how the AI agent should combine outputs from multiple workflow steps.
2527
- **Pattern Recognition:** Specify what patterns, error messages, or metrics indicate specific root causes.
2628
- **Prioritization Logic:** Provide criteria for ranking potential causes by likelihood or severity.
2729
- **Evidence Requirements:** Define what evidence is needed to confidently identify each potential root cause.
2830
- **Example Scenarios:** Include sample synthesis statements showing how findings should be summarized.
2931

30-
## Recommended Remediation Steps
32+
## 4. Recommended Remediation Steps
3133
- **Immediate Actions:** List temporary workarounds or urgent fixes for critical issues.
3234
- **Permanent Solutions:** Provide step-by-step permanent remediation procedures.
3335
- **Verification Steps:** Define how to confirm each remediation action was successful.
3436
- **Documentation References:** Include links to official documentation, best practices, or vendor guidance.
3537
- **Escalation Criteria:** Specify when and how to escalate if remediation steps fail.
3638
- **Post-Remediation Monitoring:** Describe what to monitor to prevent recurrence.
39+
40+
# File Organization Guidelines
41+
42+
## Folder Structure
43+
*Category folders are used to distinguish and categorize different runbooks based on their focus area or technology domain. Each runbook must be placed into a specific category folder under `holmes/plugins/runbooks/` for better organization and discoverability. Create a new category folder if your runbook doesn't fit into existing categories.*
44+
45+
## File Naming
46+
*Use consistent naming conventions for runbook files:*
47+
48+
- Use descriptive, lowercase names with hyphens: `dns-resolution-troubleshooting.md`
49+
- Include the issue type or technology: `redis-connection-issues.md`
50+
- Avoid generic names like `troubleshooting.md` or `debug.md`
51+
52+
### Catalog Registration
53+
After creating your runbook, you must add an entry to `catalog.json` in the runbooks directory to make it discoverable by AI agents.
54+
55+
**Steps to add a new catalog entry:**
56+
57+
1. **Open** `holmes/plugins/runbooks/catalog.json`
58+
2. **Add your entry** to the JSON array following this structure:
59+
```json
60+
{
61+
"name": "Brief, descriptive name of the runbook",
62+
"path": "category-folder/your-runbook-filename.md",
63+
"description": "Clear description of what issues this runbook addresses",
64+
"tags": ["relevant", "tags", "for", "search"]
65+
}
66+
```
67+
68+
3. **Ensure proper JSON formatting** - add a comma after the previous entry if needed
69+
4. **Validate the JSON** is properly formatted before committing
70+
71+
**Field Guidelines:**
72+
- `name`: Keep concise but descriptive (e.g., "Redis Connection Issues")
73+
- `path`: Always include the category folder (e.g., "database/redis-connection-issues.md")
74+
- `description`: Explain what specific problems this runbook solves
75+
- `tags`: Include technology names, issue types, and relevant keywords
76+
77+
Example catalog entry:
78+
```json
79+
{
80+
"name": "DNS Resolution Troubleshooting",
81+
"path": "networking/dns-resolution-troubleshooting.md",
82+
"description": "Comprehensive guide for diagnosing and resolving DNS resolution issues in Kubernetes clusters",
83+
"tags": ["dns", "networking", "kubernetes", "troubleshooting"]
84+
}
85+
```

0 commit comments

Comments
 (0)