[copilot-harness] attempt 1: process exit event exitCode=1
[copilot-harness] attempt 1: process closed exitCode=1 duration=0s stdout=0B stderr=746B hasOutput=true
[copilot-harness] attempt 1 failed: exitCode=1 ... isAuthError=true isAuthenticationFailedError=false permissionDeniedCount=0 ... retriesRemaining=3
[copilot-harness] attempt 1: no authentication information found — not retrying (COPILOT_GITHUB_TOKEN, GH_TOKEN, and GITHUB_TOKEN are all absent or invalid)
[copilot-harness] done: exitCode=1 totalDuration=1s
Problem statement
copilot-engine agentic workflows failed in the last 6h window, both in theagentjob, with an identical, newly-classified signature: the GitHub Copilot CLI process exits in ~0–1s withexitCode=1, recording 0 turns / 0 tokens and an emptyagent_output.json({"items":[]}).isAuthError=trueand refuses to retry, emitting:agentic-workflowsissue. It is filed as a sub-issue of the most-recent open failure report [aw-failures] Daily Issues Report Generator — 100% failure (5+ days): copilot CLI exits in ~30s with zero token usage, unclassif [Content truncated due to length] #36325 (which tracks a related but distinct copilot symptom: a ~30s exit on a single workflow, root cause unclassified). This issue classifies the early-copilot-exit symptom for the first time as auth-token absence and supplies fresh same-day run IDs.Affected workflows and run IDs
agentjob, 4.1m, 0 turnsagentjob, 3.6m, 0 turnsNote on coverage: the deterministic pre-fetch payload reported 0 failures for this window, and the
logsMCP tool timed out (120s);gh run listpagination beyond page 1 was firewall-refused (dial tcp ...: connection refused). The two runs above were found by sampling page 1 only — the true count of affected scheduledcopilotruns in the 6h window is likely higher and could not be enumerated.Probable root cause
awf-agentcontainer, none ofCOPILOT_GITHUB_TOKEN,GH_TOKEN, orGITHUB_TOKENis present/valid in the process environment, so the CLI aborts immediately before any model turn.copilotruns in the same window succeeded (e.g.Copilot§26890979977,Agentic Maintenance§26890508514), so this is not a total auth outage.Proposed remediation
agentjob, verify that the resolved Copilot/GitHub token is non-empty immediately before the CLI launches (mask-safe presence check, not value logging). Confirm whether the token is being passed into theawf-agentcontainer env for scheduled events.isAuthError=truewith all three token vars absent, surface a distinct, classified error in the step summary (today it lands as a genericexitCode=1and the pre-fetch counts it as zero failures). This would also let the deterministic pre-fetch detect it.isAuthErrorcause; if so, merge tracking.Success criteria / verification
copilotrun whose token is momentarily unavailable either (a) recovers via bounded retry, or (b) fails with an explicit, classified auth error visible in the step summary and counted by the pre-fetch.copilotagentic run exits with 0 turns / 0 tokens without a classified auth diagnostic.isAuthError-class 0-turn failures.Evidence
agent-stdio.log tail — PR Triage Agent §26889734909 (identical for §26890506259)
audit-diff — failed vs successful PR Triage Agent baseline (same workflow)
agentjob failure, 0 turns, token_usage=None, errors=0, emptyagent_output.json,missing_tools=None,mcp_failures=None.turns 1→1,posture read_only).Parent / correlation
[aw-failures]report; related early-copilot-exit symptom, previously unclassified).node:lts-alpinepull from Docker Hub times out — mirror base image to ghcr.io #36595), squid-startup ([aw-failures] Squid firewall container (awf-squid) unhealthy → claude engine fails to start (0-turn run failures) #35780), and safe-output contract ([aw-failures] Contribution Checksafe_outputsjob fails — agent emitsadd_commentwithtarget: "*"and noissue_number#35984) classes — none of those match thisisAuthError/ 0s signature.References: §26889734909 · §26890506259 · §26871722704
Related to #36325