intel: reduce false positives in owasp-top-10-web and secrets-management by kamalsrini · Pull Request #1 · UnitOneAI/SecuritySkills

kamalsrini · 2026-03-17T07:23:53Z

Summary

A/B benchmark results identified high false positive rates in two skills. This PR adds precision controls.

Benchmark Results (Before)

Skill	Detection	FP Rate	TP	FP
owasp-top-10-web	0.57	0.86	4	24
secrets-management	1.0	0.38	8	5

Changes

owasp-top-10-web v1.0.1: Added precision requirements (confirmed code path + file:line reference, exploitability verification) and 5-point pre-classification checklist
secrets-management v1.0.1: Added false positive filtering (entropy check, known secret prefix patterns like AKIA*/sk-/ghp_, placeholder detection)

Test Plan

Re-run A/B benchmark on both skills to verify FP reduction
Verify detection rate maintained or improved
Check SKILL.md format compliance (injection-hardened, <500 lines)

🤖 Generated with Claude Code

Source: A/B benchmark framework at ~/.openclaw/workspace/test-corpus/

A/B benchmark results showed: - owasp-top-10-web: 0.86 FP rate (24 false positives on Juice Shop) - secrets-management: 0.38 FP rate (5 false positives on leaked-secrets) Changes: - owasp-top-10-web v1.0.1: Add precision requirements and 5-point pre-classification verification checklist - secrets-management v1.0.1: Add false positive filtering section (entropy check, known prefix patterns, placeholder detection) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…erns to gitignore - injection-scan.yml: Add discord.com/api/webhooks and hooks.slack.com/services patterns - .gitignore: Add *.secrets, .env*, *.credentials, *.pem, *.key Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…s-2026-03-17 reduce false positives in owasp-top-10-web and secrets-management

Ubuntu and others added 2 commits March 17, 2026 07:23

kamalsrini merged commit 6b9f677 into main Mar 21, 2026
3 checks passed

kamalsrini deleted the intel/benchmark-fp-fixes-2026-03-17 branch March 21, 2026 18:20

hermesbountyhunter mentioned this pull request Jun 3, 2026

[REVIEW] hipaa-review: add source-dated regulatory and threat-evidence gates #179

Open

4 tasks

jddark62 pushed a commit to jddark62/SecuritySkills that referenced this pull request Jun 5, 2026

Merge pull request UnitOneAI#1 from UnitOneAI/intel/benchmark-fp-fixe…

f921c9e

…s-2026-03-17 reduce false positives in owasp-top-10-web and secrets-management

sosal123tyu1 mentioned this pull request Jun 5, 2026

[REVIEW] rbac-design: add ReBAC relationship gates and ZTA 'continuous verification' lifecycle #1110

Open

4 tasks

Steven13799 mentioned this pull request Jun 5, 2026

[REVIEW] sast-config: CWE-matrix false gaps, no CodeQL build-completeness check, baseline-suppression blind spot #1145

Open

4 tasks

RanuK12 mentioned this pull request Jun 6, 2026

feat(secure-code-review): add HTTP parser boundary and request smuggling review gates #1289

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

intel: reduce false positives in owasp-top-10-web and secrets-management#1

intel: reduce false positives in owasp-top-10-web and secrets-management#1
kamalsrini merged 2 commits into
mainfrom
intel/benchmark-fp-fixes-2026-03-17

kamalsrini commented Mar 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kamalsrini commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Benchmark Results (Before)

Changes

Test Plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kamalsrini commented Mar 17, 2026 •

edited

Loading