You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+23-35Lines changed: 23 additions & 35 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,6 +2,10 @@
2
2
3
3
Language-agnostic AI agent skills that enforce fundamental programming principles. This repository provides specific, granular instructions that enable AI coding assistants to produce significantly higher-quality code that adheres to robust engineering standards.
4
4
5
+
| Dashboard Explorer | Code Comparison | Judge Reasoning |
Adopting these skills measurably changes the output of AI models, shifting them from generating merely functional code to producing architecturally sound solutions.
6
10
7
11
## Table of Contents
@@ -15,66 +19,50 @@ Adopting these skills measurably changes the output of AI models, shifting them
15
19
16
20
## Installation
17
21
18
-
Select your platform for specific setup instructions:
The core of this repository is the `skills/` directory. Each skill is encapsulated in its own subdirectory following the `ps-<name>` convention (e.g., `ps-composition-over-coordination`).
28
29
29
30
We use this granular structure because:
31
+
30
32
1.**Focus**: It allows the AI to load only the relevant context for a specific task, avoiding context window pollution.
31
33
2.**Modularity**: Skills can be improved, versioned, and tested independently.
32
34
3.**Composability**: Users can select the specific combination of principles they want to enforce for their project.
33
35
36
+
## Skill Integration
37
+
38
+
Skills should live under the `skills/` directory as `SKILL.md` files. For a full integration guide and documentation index:
39
+
40
+
```
41
+
https://agentskills.io/integrate-skills
42
+
https://agentskills.io/llms.txt
43
+
```
44
+
34
45
## Validation & Testing
35
46
36
47
Every skill is validated against a rigorous testing suite found in the `tests/` directory.
37
48
38
49
-**Automated Judging**: We use an LLM-as-a-Judge approach. The system compares the output of a "Baseline" model (without the skill) against a "Skill" model (with the skill loaded).
39
-
-**Semantics over Syntax**: The test does not just look for passing unit tests; it analyzes the *logic* and *structure* of the code.
50
+
-**Semantics over Syntax**: The test does not just look for passing unit tests; it analyzes the _logic_ and _structure_ of the code.
40
51
-**Evidence-Based**: The judge identifies the specific lines of code that demonstrate adherence to or violation of the principle.
41
52
42
53
[Read our Case Study on Judge Fairness](docs/judge-fairness-case-study.md) to see how the system fairly evaluates architectural quality, even when it means failing the Skill model.
43
54
44
55
## Evaluation Results
45
56
46
-
Processed 24 evaluation(s).
47
-
48
-
| Test Name | Model | Baseline | With Skill | Cases Pass | Winner |
|[results-ollama-devstral-small-2--24b-cloud-ps-explicit-boundaries-adapters](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329535854)| devstral-small-2:24b-cloud | good | outstanding | ✅ 2/2 | With Skill |
53
-
|[results-ollama-devstral-small-2--24b-cloud-ps-explicit-ownership-lifecycle](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329535894)| devstral-small-2:24b-cloud | good | good | ✅ 2/2 | With Skill |
54
-
|[results-ollama-devstral-small-2--24b-cloud-ps-explicit-state-invariants](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329537422)| devstral-small-2:24b-cloud | good | outstanding | ✅ 2/2 | With Skill |
55
-
|[results-ollama-devstral-small-2--24b-cloud-ps-functional-core-imperative-shell](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329537046)| devstral-small-2:24b-cloud | regular | good | ✅ 2/2 | With Skill |
56
-
|[results-ollama-devstral-small-2--24b-cloud-ps-illegal-states-unrepresentable](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329538523)| devstral-small-2:24b-cloud | good | outstanding | ✅ 2/2 | With Skill |
57
-
|[results-ollama-devstral-small-2--24b-cloud-ps-local-reasoning](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329538780)| devstral-small-2:24b-cloud | good | outstanding | ✅ 2/2 | With Skill |
58
-
|[results-ollama-devstral-small-2--24b-cloud-ps-minimize-mutation](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329540068)| devstral-small-2:24b-cloud | good | good | ✅ 2/2 | N/A |
59
-
|[results-ollama-devstral-small-2--24b-cloud-ps-naming-as-design](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329540040)| devstral-small-2:24b-cloud | regular | good | ✅ 2/2 | With Skill |
60
-
|[results-ollama-devstral-small-2--24b-cloud-ps-policy-mechanism-separation](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329541792)| devstral-small-2:24b-cloud | good | outstanding | ✅ 2/2 | With Skill |
61
-
|[results-ollama-devstral-small-2--24b-cloud-ps-single-direction-data-flow](https://github.com/Ariel-Rodriguez/programming-skills/actions/runs/21547621647/artifacts/5329541535)| devstral-small-2:24b-cloud | regular | good | ✅ 2/2 | With Skill |
0 commit comments