Add content based type detection and filename sanitization controls u… by mattijsmoens · Pull Request #147 · OWASP/AISVS

mattijsmoens · 2026-03-13T20:46:39Z

Related to #144

Adds two controls to C2.7 (Multi Modal Input Validation) addressing AI-specific file type risks:

C2.7.6: Content-based file type detection to prevent attackers from disguising malicious payloads as valid input modalities (e.g., a script with a .png extension bypassing image-only safety filters and reaching the model pipeline unvalidated).
C2.7.7: Filename sanitization against null byte injection, path traversal, and double extension attacks that could manipulate how AI pipelines route, store, or process uploaded files (e.g., "prompt.jpg.exe" or "../../model_config.json").

Both are deterministic, zero-cost checks that sit at the boundary between user input and model ingestion, before any modality-specific AI safety filters are applied.

Reference implementation: https://github.com/mattijsmoens/sovereign-shield (MultiModalFilter module)

…nder C2.7 Adds two controls that complement existing C2.7 multi-modal validation: - C2.7.6: Content-based file type detection using magic bytes/file headers to catch MIME type spoofing. Mismatches between declared and detected type are rejected. - C2.7.7: Filename sanitization against null byte injection, path traversal sequences, and double extension attacks (e.g., "image.jpg.exe"). Both are deterministic, zero cost checks that provide a hard security floor for any system accepting file uploads. Reference implementation: https://github.com/mattijsmoens/sovereign-shield (MultiModalFilter module)

ottosulin · 2026-03-13T21:18:35Z

1.0/en/0x10-C02-User-Input-Validation.md

 | **2.7.3** | **Verify that** image/audio inputs are checked for adversarial perturbations or known attack patterns, and detections trigger gating (block or degrade capabilities) before model use. | 2 | D/V |
 | **2.7.4** | **Verify that** multi-modal input validation failures trigger detailed logging including all input modalities, validation results, threat scores, and trace metadata (source, tool or MCP server, agent ID, session as applicable), and generate alerts for investigation. | 3 | D/V |
 | **2.7.5** | **Verify that** cross-modal attack detection identifies coordinated attacks spanning multiple input types (e.g., steganographic payloads in images combined with prompt injection in text) with correlation rules and alert generation, and that confirmed detections are blocked or require HITL (human-in-the-loop) approval. | 3 | D/V |
+| **2.7.6** | **Verify that** file type validation uses content based detection (magic bytes/file headers) in addition to declared MIME types and file extensions, and that mismatches between declared and detected type are rejected as potential type spoofing. | 1 | D/V |


While I agree this should be done, I have the concern as in #145 - we should try to keep to our scope of AI without starting to replicate e.g. ASVS contents when going to details.

@jmanico this PR is a good example of something we should discuss and probably clarify in Preface / Using AISVS: scope of the project and relation to other standards and frameworks.

We do currently include controls that are e.g. about runtime environment which are applicable also to non-AI applications. But replicating same control implementation details from e.g. ASVS may not be useful. On the other hand, I think we do kind of expect & want AISVS L2 systems to be roughly on same level of security as ASVS L2 - same controls are needed.

Good point on scope. These controls exist in ASVS for web apps, but in the AI context they sit at the boundary between user input and model ingestion the risk isn't just RCE from a malicious upload, it's that a spoofed file type could bypass modality specific safety filters (e.g., a script disguised as an image bypassing image only validation). I can adjust the wording to emphasize the AI specific angle if that helps.

That works for me. If you can frame this as a specific AISVS control that is not covered in ASVS yet, I'm all for it.

But Otto is right, we do want to avoid repeating the core of ASVS. I'm happy to make exceptions where appropriate.

Now checked against ASVSv5, I think that is covered already. I would here rather interpret ASVS broadly to cover similar use cases to prevent AISVS and ASVS unnecessary overlaps.

That being said I agree with Jim, if in the world of file handling there is an AI specific implementation that would not exist without AI very interested to discuss and include it.

You're right, V5.2.2 and V5.3.2 cover the core file validation mechanisms sufficiently. I am closing this one. Thanks for the thorough review.

Reframed controls with AI-specific language per reviewer feedback

ottosulin reviewed Mar 13, 2026

View reviewed changes

mattijsmoens added 2 commits March 16, 2026 10:01

Update 0x10-C02-User-Input-Validation.md

1a1ae2e

Reframed controls with AI-specific language per reviewer feedback

Update 0x10-C02-User-Input-Validation.md

972e4eb

mattijsmoens closed this Mar 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add content based type detection and filename sanitization controls u…#147

Add content based type detection and filename sanitization controls u…#147
mattijsmoens wants to merge 3 commits intoOWASP:mainfrom
mattijsmoens:patch-3

mattijsmoens commented Mar 13, 2026 •

edited

Loading

Uh oh!

ottosulin Mar 13, 2026

Uh oh!

mattijsmoens Mar 13, 2026

Uh oh!

jmanico Mar 15, 2026 •

edited

Loading

Uh oh!

ottosulin Mar 16, 2026

Uh oh!

mattijsmoens Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mattijsmoens commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ottosulin Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

mattijsmoens Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

jmanico Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ottosulin Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

mattijsmoens Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mattijsmoens commented Mar 13, 2026 •

edited

Loading

jmanico Mar 15, 2026 •

edited

Loading