feat(daily-fact): add reasoning_depth A/B experiment + fix compiler expression transform for inline mode by Copilot · Pull Request #31330 · github/gh-aw

Copilot · 2026-05-10T12:06:04Z

Implements the reasoning_depth experiment on daily-fact to test whether a multi-candidate deliberation step produces more novel/engaging verses than the current single-pass approach. Also fixes a latent compiler bug exposed by this workflow's inlined-imports: true setting.

Changes

`daily-fact.md` — experiment wiring

Added experiments.reasoning_depth frontmatter block (single_pass vs multi_candidate, 50/50, starts 2026-05-11, issue [ab-advisor] Experiment campaign for daily-fact: A/B test reasoning_depth #31324)
Replaced the single Favor recent updates guideline bullet with a variant-conditional block:

{{#if experiments.reasoning_depth == "multi_candidate"}}
- **Multi-candidate deliberation**: Before writing, identify exactly **3 distinct candidate facts** ...
  score each 1–5 on novelty and poetic potential, write verse for the highest-scoring one only.
{{else}}
- **Favor recent updates** but include variety - pick something interesting, not just the most recent
{{/if}}

`daily-fact.lock.yml` — regenerated via `gh aw compile daily-fact`

Compiler bug fix: `expression_extraction.go`

transformExperimentsExpression only handled bare ${{ experiments.NAME }}. In workflows with inlined-imports: true the compiler pre-processes all ${{ … }} expressions before runtime; the comparison form ${{ experiments.NAME == "value" }} was left untransformed, producing an invalid GitHub Actions expression where experiments is not a real context — always evaluating to false, silently killing the treatment arm.

Fix: added experimentComparisonRegex to rewrite all four comparison operators (==, ===, !=, !==):

experiments.NAME == "value"  →  steps.pick-experiment.outputs.NAME == "value"

The generated env var now produces a valid GA expression:

# Before (always false — experiments context doesn't exist in GA)
GH_AW_EXPR_70B2292B: ${{ experiments.reasoning_depth == "multi_candidate" }}

# After (evaluates correctly from pick-experiment step output)
GH_AW_EXPR_DD64DF46: ${{ steps.pick-experiment.outputs.reasoning_depth == "multi_candidate" }}

Four new test cases cover ==, ===, !=, !== in expression_extraction_test.go. This bug would have affected any workflow combining inlined-imports: true with value-comparison experiment conditionals.

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

… mode (#31324) Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

…= test cases Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot

Pull request overview

Adds an A/B experiment (reasoning_depth) to the daily-fact workflow and fixes inline-mode experiment expression rewriting so variant conditionals evaluate correctly under inlined-imports: true.

Changes:

Introduce experiments.reasoning_depth (50/50 single_pass vs multi_candidate) and gate a new “multi-candidate deliberation” guideline via a variant-conditional block in daily-fact.md.
Update the compiler’s transformExperimentsExpression to also rewrite experiments.<name> {==,===,!=,!==} "value" comparisons to steps.pick-experiment.outputs.<name> ....
Regenerate daily-fact.lock.yml and add tests covering the comparison-operator rewrite cases.

Show a summary per file

File	Description
pkg/workflow/expression_extraction.go	Extend experiment expression rewriting to handle string comparisons in inline mode.
pkg/workflow/expression_extraction_test.go	Add unit tests for `==`, `===`, `!=`, `!==` experiment comparison transforms.
.github/workflows/daily-fact.md	Wire `reasoning_depth` experiment and add variant-conditional prompt guidance.
.github/workflows/daily-fact.lock.yml	Regenerated compiled workflow reflecting experiment wiring and fixed expression transform.

Copilot's findings

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Files reviewed: 4/4 changed files
Comments generated: 1

 // transformExperimentsExpression detects expressions of the form "experiments.<name>"
-// and rewrites them to "steps.pick-experiment.outputs.<name>" so that the placeholder
-// substitution step reads the value from the pick_experiment step output.
-// This is used for ${{ experiments.name }} expressions that appear directly in the prompt body
-// (mostly relevant in inline mode; in runtime-import mode the template conditional
-// {{#if experiments.name}} path is handled separately via ExperimentExpressionMappings).
+// (and the comparison form "experiments.<name> == "value"") and rewrites them so that the
+// placeholder substitution step reads the value from the pick_experiment step output.
+//
+// Simple form:     experiments.name          → steps.pick-experiment.outputs.name
+// Comparison form: experiments.name == "v"  → steps.pick-experiment.outputs.name == "v"


pelikhan · 2026-05-10T15:50:39Z

@copilot review all comments

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2026-05-10T15:55:52Z

@copilot review all comments

Fixed in the latest commit — switched the nested double-quoted example on line 220 to use a backtick so the rendered comment is unambiguous: `experiments.<name> == "value"`.

Initial plan

d135667

Copilot AI assigned Copilot and pelikhan May 10, 2026

Copilot started work on behalf of pelikhan May 10, 2026 12:06 View session

Copilot AI linked an issue May 10, 2026 that may be closed by this pull request

[ab-advisor] Experiment campaign for daily-fact: A/B test reasoning_depth #31324

Closed

8 tasks

Copilot AI and others added 3 commits May 10, 2026 12:12

feat(daily-fact): add A/B experiment for reasoning_depth (#31324)

36c0998

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

fix(compiler): transform experiments comparison expressions in inline…

0888aad

… mode (#31324) Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

fix(compiler): tighten experimentComparisonRegex and add == != === !=…

a5063b7

…= test cases Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add A/B test for reasoning depth in daily-fact campaign~~ feat(daily-fact): add reasoning_depth A/B experiment + fix compiler expression transform for inline mode May 10, 2026

Copilot AI requested a review from pelikhan May 10, 2026 12:29

Copilot finished work on behalf of pelikhan May 10, 2026 12:29

pelikhan marked this pull request as ready for review May 10, 2026 12:33

Copilot AI review requested due to automatic review settings May 10, 2026 12:33

Copilot started reviewing on behalf of pelikhan May 10, 2026 12:33 View session

Copilot AI reviewed May 10, 2026

View reviewed changes

Copilot started work on behalf of pelikhan May 10, 2026 15:50 View session

fix: use backtick in doc comment to avoid nested double quotes

3016e9c

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan May 10, 2026 15:56

pelikhan merged commit e8528ba into main May 10, 2026

pelikhan deleted the copilot/ab-advisor-experiment-daily-fact branch May 10, 2026 15:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(daily-fact): add reasoning_depth A/B experiment + fix compiler expression transform for inline mode#31330

feat(daily-fact): add reasoning_depth A/B experiment + fix compiler expression transform for inline mode#31330
pelikhan merged 5 commits into
mainfrom
copilot/ab-advisor-experiment-daily-fact

Copilot AI commented May 10, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

pelikhan commented May 10, 2026

Uh oh!

Copilot AI commented May 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Copilot AI commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

daily-fact.md — experiment wiring

daily-fact.lock.yml — regenerated via gh aw compile daily-fact

Compiler bug fix: expression_extraction.go

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

pelikhan commented May 10, 2026

Uh oh!

Copilot AI commented May 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented May 10, 2026 •

edited

Loading

`daily-fact.md` — experiment wiring

`daily-fact.lock.yml` — regenerated via `gh aw compile daily-fact`

Compiler bug fix: `expression_extraction.go`