Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions AGENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -64,6 +64,7 @@ Push for extreme simplicity in these areas:
- Include `# Copyright (c) Microsoft. All rights reserved` at the top of each Python file.
- When writing documentation, you write as if you were a professional and experienced developer making their code available publicly on GitHub.
- Never add back in code or comments that the user has removed or changed.
- llms.txt is auto generated by `.github/workflows/generate-llms-txt.yaml`. Do not edit it directly.

## Python Development Rules
- This project uses Python >=3.11, uv as the package and project manager, and Ruff as a linter and code formatter.
Expand Down
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,10 +22,10 @@ which are a mix of code and LLM calls to achieve a desired tradeoff between flex
```bash
# Benchmarking requires certain prerequisites, see the full documentation for more details.
# With uv (add to project dependencies, pinned to a release tag)
uv add "eval-recipes @ git+https://github.com/microsoft/eval-recipes@v0.29"
uv add "eval-recipes @ git+https://github.com/microsoft/eval-recipes@v0.0.30"

# With pip
pip install "git+https://github.com/microsoft/eval-recipes@v0.29"
pip install "git+https://github.com/microsoft/eval-recipes@v0.0.30"
```

> [!WARNING]
Expand Down
4 changes: 2 additions & 2 deletions docs/BENCHMARKING.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,10 +8,10 @@ This module provides a benchmarking harness for evaluating AI agents within isol
```bash
# Install prerequisites below first.
# With uv (add to project dependencies, pinned to a release tag)
uv add "eval-recipes @ git+https://github.com/microsoft/eval-recipes@v0.29"
uv add "eval-recipes @ git+https://github.com/microsoft/eval-recipes@v0.0.30"

# With pip
pip install "git+https://github.com/microsoft/eval-recipes@v0.29"
pip install "git+https://github.com/microsoft/eval-recipes@v0.0.30"
```


Expand Down
2 changes: 1 addition & 1 deletion eval_recipes/benchmarking/run_trial.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@
from eval_recipes.benchmarking.docker_manager import DockerManager
from eval_recipes.benchmarking.schemas import AgentConfig, TaskConfig, TrialResult

DEFAULT_EVAL_RECIPES_VERSION = "0.29"
DEFAULT_EVAL_RECIPES_VERSION = "0.0.30"


@dataclass
Expand Down
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[project]
name = "eval_recipes"
version = "0.0.29"
version = "0.0.30"
description = "Eval Recipes"
authors = [{ name = "Semantic Workbench Team" }]
readme = "README.md"
Expand Down
Loading