microsoft · DavidKoleczek · Jan 20, 2026 · Jan 20, 2026
diff --git a/AGENTS.md b/AGENTS.md
@@ -64,6 +64,7 @@ Push for extreme simplicity in these areas:
 - Include `# Copyright (c) Microsoft. All rights reserved` at the top of each Python file.
 - When writing documentation, you write as if you were a professional and experienced developer making their code available publicly on GitHub.
 - Never add back in code or comments that the user has removed or changed.
+- llms.txt is auto generated by `.github/workflows/generate-llms-txt.yaml`. Do not edit it directly.
 
 ## Python Development Rules
 - This project uses Python >=3.11, uv as the package and project manager, and Ruff as a linter and code formatter.

diff --git a/README.md b/README.md
@@ -22,10 +22,10 @@ which are a mix of code and LLM calls to achieve a desired tradeoff between flex
 ```bash
 # Benchmarking requires certain prerequisites, see the full documentation for more details.
 # With uv (add to project dependencies, pinned to a release tag)
-uv add "eval-recipes @ git+https://github.com/microsoft/eval-recipes@v0.29"
+uv add "eval-recipes @ git+https://github.com/microsoft/eval-recipes@v0.0.30"
 
 # With pip
-pip install "git+https://github.com/microsoft/eval-recipes@v0.29"
+pip install "git+https://github.com/microsoft/eval-recipes@v0.0.30"
 ```
 
 > [!WARNING]

diff --git a/docs/BENCHMARKING.md b/docs/BENCHMARKING.md
@@ -8,10 +8,10 @@ This module provides a benchmarking harness for evaluating AI agents within isol
 ```bash
 # Install prerequisites below first.
 # With uv (add to project dependencies, pinned to a release tag)
-uv add "eval-recipes @ git+https://github.com/microsoft/eval-recipes@v0.29"
+uv add "eval-recipes @ git+https://github.com/microsoft/eval-recipes@v0.0.30"
 
 # With pip
-pip install "git+https://github.com/microsoft/eval-recipes@v0.29"
+pip install "git+https://github.com/microsoft/eval-recipes@v0.0.30"
 ```
 
 

diff --git a/eval_recipes/benchmarking/run_trial.py b/eval_recipes/benchmarking/run_trial.py
@@ -16,7 +16,7 @@
 from eval_recipes.benchmarking.docker_manager import DockerManager
 from eval_recipes.benchmarking.schemas import AgentConfig, TaskConfig, TrialResult
 
-DEFAULT_EVAL_RECIPES_VERSION = "0.29"
+DEFAULT_EVAL_RECIPES_VERSION = "0.0.30"
 
 
 @dataclass

diff --git a/pyproject.toml b/pyproject.toml
@@ -1,6 +1,6 @@
 [project]
 name = "eval_recipes"
-version = "0.0.29"
+version = "0.0.30"
 description = "Eval Recipes"
 authors = [{ name = "Semantic Workbench Team" }]
 readme = "README.md"