-
Notifications
You must be signed in to change notification settings - Fork 17
Description
π₯ Daily Health Check β 2026-03-07
Status: π΄ 1 critical Β· π‘ 11 warnings Β· π΅ 1 info
Since yesterday: π 13 new Β· β
0 resolved Β· π 0 unchanged
β οΈ This is the first health check run. All findings appear as new. Starting from the next run, only changes will be highlighted.
π§© Skill Inventory
Comprehensive health status of all skills derived from quality checks.
| Status | Component | Skill | Skilled | Vanilla | Ξ | Scenarios | Issues |
|---|---|---|---|---|---|---|---|
| π‘ Warning | dotnet | analyzing-dotnet-performance | 7.3 | 2 | 5.3 | 11 | timedOut |
| π‘ Warning | dotnet | android-tombstone-symbolication | 8.4 | 7.2 | 1.1 | 8 | timedOut |
| π‘ Warning | dotnet | clr-activation-debugging | 7.4 | 2.2 | 5.3 | 7 | timedOut |
| π΄ No Value | dotnet | csharp-scripts | 4.8 | 6 | -1.2 | 1 | overfitting, overfittingScore, timedOut |
| π‘ Warning | dotnet | dotnet-aot-compat | 3.6 | 2.4 | 1.2 | 1 | timedOut |
| π’ OK | dotnet | dotnet-pinvoke | 10 | 8 | 2 | 2 | - |
| π΄ No Value | dotnet | dotnet-trace-collect | 3.9 | 5.5 | -1.6 | 16 | timedOut |
| π‘ Warning | dotnet | dump-collect | 7.8 | 6.5 | 1.2 | 9 | timedOut |
| π‘ Warning | dotnet | microbenchmarking | 4 | 3.2 | 0.8 | 1 | timedOut |
| π΄ Critical | dotnet | migrate-nullable-references | 8.7 | 8.7 | 0 | 3 | notActivated, timedOut |
| π‘ Warning | dotnet | nuget-trusted-publishing | 4.7 | 4.5 | 0.1 | 3 | timedOut |
| π‘ Warning | dotnet | optimizing-ef-core-queries | 9.6 | 8.8 | 0.8 | 1 | overfitting, overfittingScore |
| π’ OK | dotnet | thread-abort-migration | 9.8 | 8.1 | 1.7 | 5 | - |
| π΄ No Value | dotnet-msbuild | binlog-failure-analysis | 9.2 | 9.6 | -0.4 | 1 | - |
| π‘ Warning | dotnet-msbuild | binlog-generation | 10 | 4.1 | 5.9 | 3 | timedOut |
| π΄ No Value | dotnet-msbuild | build-parallelism | 2 | 2 | 0 | 1 | timedOut |
| π΄ No Value | dotnet-msbuild | build-perf-baseline | 2.4 | 5.2 | -2.8 | 1 | overfitting, overfittingScore, timedOut |
| π΄ No Value | dotnet-msbuild | build-perf-diagnostics | 3.2 | 7.2 | -4 | 1 | overfitting, overfittingScore, timedOut |
| π΄ No Value | dotnet-msbuild | check-bin-obj-clash | 4.4 | 6.8 | -2.4 | 1 | timedOut |
| π’ OK | dotnet-msbuild | directory-build-organization | 10 | 6.8 | 3.2 | 1 | - |
| π’ OK | dotnet-msbuild | eval-performance | 9.6 | 8 | 1.6 | 1 | - |
| π‘ Warning | dotnet-msbuild | including-generated-files | 8.8 | 6 | 2.8 | 1 | timedOut |
| π‘ Warning | dotnet-msbuild | incremental-build | 7.2 | 6 | 1.2 | 1 | timedOut |
| π΄ No Value | dotnet-msbuild | msbuild-antipatterns | 9.2 | 10 | -0.8 | 1 | timedOut |
| π‘ Warning | dotnet-msbuild | msbuild-modernization | 9.2 | 8.8 | 0.4 | 1 | timedOut |
Legend: π’ OK Β· π‘ Warning Β· π΄ No Value / Critical
π New Findings (13)
These appeared since the last health check (first run).
π΄ Critical: Skill not activated β migrate-nullable-references
Category: Quality | First seen: 2026-03-07
The migrate-nullable-references skill was not activated by the agent during evaluation. All scenarios show the notActivated flag, indicating the skill is broken or its triggering conditions are never met.
Recommendation: Investigate skill metadata, description, and triggering conditions. Review agent logs to understand why the skill was never selected.
π [Dashboard]((dotnet.github.io/redacted)
π‘ Warning: Evaluation workflow cancelled on main
Category: Pipeline | First seen: 2026-03-07
The evaluation workflow run for dotnet-msbuild was cancelled during the "Run skill-validator" step. This was a scheduled run on the main branch.
π Run #22787808011
π‘ Warning: Skill shows no quality improvement β csharp-scripts
Category: Quality | First seen: 2026-03-07
Skilled quality (4.8) is lower than vanilla quality (6.0) in the scenario "Test a C# language feature with a script". The skill is not providing value in this scenario.
Additional flags: overfitting, overfittingScore, timedOut
Recommendation: Review the skill's approach and prompts for this scenario. Consider if the scenario tests edge cases the skill isn't designed for.
π‘ Warning: Skill shows no quality improvement β dotnet-trace-collect
Category: Quality | First seen: 2026-03-07
Average skilled quality (3.95) is lower than vanilla quality (5.5) across 16 scenarios. The skill is consistently underperforming compared to the baseline.
Additional flags: timedOut (most scenarios)
Recommendation: This skill requires urgent attention. The widespread timeout issues and poor quality suggest fundamental problems with the skill implementation or evaluation scenarios.
π‘ Warning: Multiple stale PRs (8 PRs open > 2 days)
Category: PR Review | First seen: 2026-03-07
8 pull requests have been open for 2-3 days with minimal review activity:
- #207 β Add convert-blazor-server-to-webapp skill (2d, inactive 1d)
- #201 β Add iOS crash symbolication skill (2d)
- #200 β Add migrating-newtonsoft-to-system-text-json skill (2d)
- #199 β Add dotnet-maui plugin with dotnet-maui-doctor skill (2d)
- #182 β Add migrate-dotnet8-to-dotnet9 skill (3d)
- #181 β Add migrate-dotnet10-to-dotnet11 skill (3d, inactive 1d)
- #180 β Add migrate-dotnet9-to-dotnet10 skill (3d)
- #177 β dotnet-ai-ml (3d)
π΅ Info: Evaluation uses verdict-warn-only mode
Category: Infrastructure | First seen: 2026-03-07
The evaluation workflow uses the --verdict-warn-only flag, which suppresses skill validation failures. This is intentional for the current repository state.
π Investigation Results
Deep investigations are dispatched for new critical/warning findings.
Results will be linked here by the grooming workflow ~3 hours after dispatch.
| Finding | Severity | Status | Result |
|---|---|---|---|
| Skill not activated: migrate-nullable-references | π΄ Critical | β Done | Weak prompt-to-skill matching |
| Evaluation workflow cancelled on main | π‘ Warning | β Done | Runner infrastructure issue |
| Skill shows no quality improvement: csharp-scripts | π‘ Warning | π Dispatching... | β |
| Skill shows no quality improvement: dotnet-trace-collect | π‘ Warning | π Dispatching... | β |
β Resolved Since Yesterday (0)
None β this is the first health check run.
π Existing Findings (0)
None β this is the first health check run.
π Trends (7-day)
Baseline established β trends will be available after 7 days of data collection.
| Metric | Today | 7d Avg | Ξ | Trend |
|---|---|---|---|---|
| Eval duration (main) | 50.4 min | β | β | β‘οΈ Baseline |
| Eval success rate (main) | 66.7% (2/3) | β | β | β‘οΈ Baseline |
| Eval success rate (all branches) | 55.6% (5/9) | β | β | β‘οΈ Baseline |
| PRs merged/day | β | β | β | β‘οΈ Baseline |
| Open PRs | 8 | β | β | β‘οΈ Baseline |
| Active skills | 25 | β | β | β‘οΈ Baseline |
| Skills with issues | 22 (88%) | β | β |
π€ Generated by DevOps Health Check agentic workflow Β· Run #22791267527 Β· 2026-03-07 03:41 UTC
Generated by DevOps Daily Health Check
Generated by DevOps Health β Groom Dashboard