feat(app): wire defensive middleware into chain + E2E test by hidetzu · Pull Request #12 · hidetzu/prism-api

hidetzu · 2026-04-11T22:15:17Z

Summary

PR 6/9 of the Phase 2 series. Wires `body_limit`, `rate_limit`, and `concurrency_limit` (merged individually in PRs 3-5) into the `app.New()` middleware chain in the canonical §6 order, and adds end-to-end integration tests that exercise the wired chain through the real `app.server.Handler`.

This PR is also the first Fly.io verification point after merge — health endpoints must remain operational through the new chain before we move on to PRs 7-9.

Wiring change

Before (Phase 1):
```
RequestID → Recover → Logger → Timeout → mux
```

After (this PR):
```
RequestID → Recover → Logger → BodyLimit → RateLimit → ConcurrencyLimit → Timeout → mux
```

Each new middleware reads config values that were already loaded at startup in Phase 1 (`MaxRequestBytes`, `RateLimitRPM`, `RateLimitBurst`, `MaxConcurrentRequests`). No new config knobs.

Why this needs an E2E test

The Phase 1 QA review flagged "full middleware chain never tested end-to-end" as a real gap. Unit tests for each middleware cover their individual behavior, but they can't catch:

Wrong ordering — e.g., BodyLimit placed before RequestID would silently drop `request_id` from the 413 error body. Nothing in the unit tests would notice.
Missing wiring — a typo that omits a middleware would pass every unit test.
Config value propagation — wrong field name in `app.New()` wouldn't be caught by middleware unit tests.

This PR adds `internal/app/app_chain_test.go` to close that gap by driving real requests through `a.server.Handler`.

Test plan

All tests drive `a.server.Handler.ServeHTTP(...)` — the exact same handler chain that production serves.

`TestChain_HealthEndpointsPassThroughAllLayers`

Positive path. GET `/healthz`, `/readyz`, `/version` all return 200 with `X-Request-Id` header and `Content-Type: application/json`. Captures slog output and asserts the Logger middleware emitted "request completed" entries with the `request_id` field populated — proves RequestID → Logger ordering.

`TestChain_BodyLimitRejectsOversizedWithRequestID`

Sets `MaxRequestBytes=64` and POSTs a 1024-byte body. Asserts 413, verifies the error body has `code=payload_too_large`, and — crucially — that `error.request_id` equals the `X-Request-Id` response header. This is the single most load-bearing assertion in the entire chain: if BodyLimit were placed before RequestID, the error body would have empty request_id and this test would catch it.

`TestChain_RateLimitRejectsExcessFromSameIP`

Sets `RateLimitRPM=60`, `RateLimitBurst=2`, same `RemoteAddr` across 3 requests. First 2 return 200, 3rd returns 429.

`TestChain_RateLimitKeysByXForwardedFor`

Sets `RateLimitRPM=60`, `RateLimitBurst=1`, and sends requests with the same `RemoteAddr` but different `X-Forwarded-For` values. Verifies that two logical clients arriving through a shared Fly edge proxy are keyed independently — client A can be rate-limited without affecting client B. This is the scenario that motivated the `clientIP` refactor in PR 4.

Why concurrency_limit isn't E2E-tested here

Exercising `concurrency_limit` against the fast-returning health endpoints would require orchestrating blocking handlers (goroutines + signal channels) from inside an E2E test. That's doable but makes the test much more complex with little added value: the middleware's own unit tests already cover capacity, rejection, recovery, and multiple-slot scenarios with identical orchestration patterns, and its presence in the chain is observable by inspection of `app.go`. This is a conscious coverage tradeoff, called out in the commit message.

If a subtle wiring bug slipped in (e.g. concurrency_limit accidentally removed from the Chain call), the same-file unit tests wouldn't catch it but the code would fail review. Acceptable.

Local checks

`go vet ./...` clean
`go test ./... -race -count=1` all pass (4 new + 2 existing lifecycle + 39 others)
`golangci-lint run` 0 issues
`go mod tidy` clean

Fly.io verification plan (after merge)

Per the Phase 2 plan, this PR is the 1st of 3 Fly verifications. The target is to confirm that health endpoints still work after the new chain is active:

`fly deploy` from `main` post-merge
curl `/healthz`, `/readyz`, `/version` → expect 200
`flyctl logs --no-tail` → expect `"request completed"` entries with status=200, `request_id`, `duration_ms`, `remote_ip`
(Optional) send an oversized POST to `/healthz` and confirm 413 with `request_id` populated

Phase 2 context

#	PR	Status
1	response codes	merged
2	validation helpers	merged
3	body_limit middleware	merged
4	rate_limit middleware	merged
5	concurrency_limit middleware	merged
6	wire defensive middleware + E2E test (this PR)	open
7	analyze usecase adapter for pkg/prism	blocked on 1
8	/v1/analyze handler and routing	blocked on 1, 2, 7
9	/v1/prompt endpoint	blocked on 1, 2, 7

Intended to be squash-merged.

Insert the three defensive middleware from the Phase 2 series into app.New()'s Chain call in the canonical §6 order: RequestID Recover Logger BodyLimit (new) RateLimit (new) ConcurrencyLimit (new) Timeout router/handler Each middleware consumes config values that were already loaded at startup in Phase 1 (MaxRequestBytes, RateLimitRPM, RateLimitBurst, MaxConcurrentRequests); no new config knobs required. Add internal/app/app_chain_test.go for end-to-end coverage of the wired chain (Phase 1 QA gap: full middleware chain was never exercised against the real app.New() output). The tests drive requests through a.server.Handler so they see exactly what a production request would see. Cases: - Positive path: GET /healthz /readyz /version all return 200 with X-Request-Id header and Content-Type: application/json; the Logger middleware emits "request completed" log lines with the request_id field populated, proving RequestID -> Logger ordering. - body_limit with a tight MaxRequestBytes rejects an oversized POST with 413, and the error body carries the same request_id as the X-Request-Id response header — this proves RequestID runs upstream of BodyLimit, which is the single most load- bearing ordering assertion in the chain. - rate_limit rejects the third request from a single client IP when burst=2, and the standard 429 body is returned. - rate_limit keys by X-Forwarded-For, not RemoteAddr: two clients sharing a Fly edge proxy (same RemoteAddr) but distinct XFF values are limited independently. concurrency_limit is not E2E-tested here because exercising it against the fast-returning health endpoints would require orchestrating blocking handlers; the middleware is thoroughly unit-tested in its own file and the wiring correctness is observable by inspection.

hidetzu merged commit 03c689b into main Apr 11, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(app): wire defensive middleware into chain + E2E test#12

feat(app): wire defensive middleware into chain + E2E test#12
hidetzu merged 1 commit into
mainfrom
feat/app-wire-defensive-middleware

hidetzu commented Apr 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hidetzu commented Apr 11, 2026

Summary

Wiring change

Why this needs an E2E test

Test plan

`TestChain_HealthEndpointsPassThroughAllLayers`

`TestChain_BodyLimitRejectsOversizedWithRequestID`

`TestChain_RateLimitRejectsExcessFromSameIP`

`TestChain_RateLimitKeysByXForwardedFor`

Why concurrency_limit isn't E2E-tested here

Local checks

Fly.io verification plan (after merge)

Phase 2 context

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant