test(e2e): full LLM round-trip with FakeAssistantProvider by yuga-hashimoto · Pull Request #463 · yuga-hashimoto/open-dash

yuga-hashimoto · 2026-04-19T23:44:45Z

Priority 5: Refactor / Quality (testing the priority-2 contract)

Builds on #460. Drives the real `VoicePipeline` through a non-fast-path utterance so it falls into the LLM agent loop, with a scripted `FakeAssistantProvider` standing in for the embedded LLM.

No DI module changes needed: `ConversationRouter` already accepts dynamically registered providers via `registerProvider`, and Manual policy lets a test pin its own fake as the resolved provider.

What changed

Tests (`app/src/androidTest/...`)

`e2e/fakes/FakeAssistantProvider` — AssistantProvider impl that returns scripted responses from a queue, records every `send()` call into `sentMessages` for assertions. `capabilities.isLocal=true` so the Auto policy never gates this provider on the emulator's network state
`e2e/AssistantProviderE2ETest` —
- `llm_response_is_spoken_via_tts` — register the fake, queue an Assistant message, drive `processUserInput("explain quantum computing")` (an ambiguous-info utterance that fast-path ignores), assert (a) the user message was sent to the provider, (b) the canned reply was spoken via `FakeTextToSpeech`
- `fake_provider_is_resolved_under_manual_policy` — guards against silent re-routing if the production model-download path registers a real provider mid-suite

Why this matters

This is the first L3 test that exercises the priority-2 contract (local-LLM agentic path) end-to-end through the production graph. The L1 unit tests can't catch a router-vs-pipeline integration regression here because they don't bind the actual `VoicePipeline` singleton.

Test plan

`./gradlew assembleStandardDebugAndroidTest` — green
`./gradlew assembleStandardDebug testStandardDebugUnitTest` — green
Once ci: add instrumented test workflow on macOS AVD #461 (CI emulator workflow) merges, `connectedStandardDebugAndroidTest` will run this in CI

## Priority 5: Refactor / Quality Builds on #460. Drives the **real** VoicePipeline through a non-fast- path utterance so it falls into the LLM agent loop, with a scripted FakeAssistantProvider standing in for the embedded LLM. No DI module changes needed: ConversationRouter already accepts dynamically registered providers via registerProvider, and Manual policy lets a test pin its own fake as the resolved provider. Test additions: - FakeAssistantProvider: AssistantProvider impl that returns scripted responses from a queue, records every send() call into sentMessages for assertions. capabilities.isLocal=true so the Auto policy never gates this provider on the emulator's network state. - AssistantProviderE2ETest: * llm_response_is_spoken_via_tts — register the fake, queue an Assistant message, drive `processUserInput("explain quantum computing")` (an ambiguous-info utterance that fast-path ignores), assert (a) the user message was sent to the provider, (b) the canned reply was spoken via FakeTextToSpeech. * fake_provider_is_resolved_under_manual_policy — guards against silent re-routing if the production model-download path registers a real provider mid-suite. This is the first L3 test that exercises the priority-2 contract (local-LLM agentic path) end-to-end through the production graph. The L1 unit tests can't catch a router-vs-pipeline integration regression here because they don't bind the actual VoicePipeline singleton. Verification: - ./gradlew assembleStandardDebugAndroidTest — green - ./gradlew assembleStandardDebug testStandardDebugUnitTest — green

yuga-hashimoto merged commit 4c50ea6 into main Apr 19, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(e2e): full LLM round-trip with FakeAssistantProvider#463

test(e2e): full LLM round-trip with FakeAssistantProvider#463
yuga-hashimoto merged 1 commit into
mainfrom
feat/e2e-assistant-swap

yuga-hashimoto commented Apr 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant