refactor(search): remove composable search taskset (migrated to research-environments v1)#1854
Open
hallerite wants to merge 2 commits into
Open
refactor(search): remove composable search taskset (migrated to research-environments v1)#1854hallerite wants to merge 2 commits into
hallerite wants to merge 2 commits into
Conversation
There was a problem hiding this comment.
Cursor Bugbot has reviewed your changes and found 2 potential issues.
❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.
Reviewed by Cursor Bugbot for commit 80c3286. Configure here.
ApprovabilityVerdict: Needs human review Diff is too large for automated approval analysis. A human reviewer should evaluate this PR. No code changes detected at You can customize Macroscope's approvability policy. Learn more. |
Ports the composable (v0) search taskset family to one harness-agnostic `vf.Taskset` (`search-v1`) with a `backend` config selecting QUEST / OpenSeeker / REDSearcher. QUEST's obj_task_eval evaluator (16 files) and open_ended.py are vendored byte-identical to v0; OpenSeeker/REDSearcher judge prompts, parse, exact-match and normalization match v0 exactly. The agent writes /task/answer.txt; scoring reads it from the live runtime. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
…rch-environments v1) The QUEST/OpenSeeker/REDSearcher search tasksets move to the new v1 taskset/harness in research-environments (search-v1 + rlm-search-v1). Removes the v0 composable search taskset family from verifiers; the v1 port lives entirely in research-environments. Note: this removes the v0 `make_search_taskset` family; downstream v0 `rlm_search` usage should migrate to `rlm-search-v1`. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
6939616 to
c30669f
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

What
Removes the v0 composable search taskset family (QUEST / OpenSeeker / REDSearcher) from verifiers. The search environments are migrated to the new v1 taskset/harness and now live entirely in research-environments (PrimeIntellect-ai/research-environments#530): a harness-agnostic
search-v1taskset + therlm-search-v1agent env.Deletes
verifiers/envs/experimental/composable/tasksets/search/(and itsmake_search_taskset/make_quest_taskset/make_openseeker_taskset/make_redsearcher_tasksetfactories).Why
The v1 port reproduces v0 behavior at parity (scoring is byte-identical; verified by a live old-vs-new comparison — see #530), so the v0 composable copy is redundant. Keeping search in one place (research-environments, on v1) avoids drift.
Impact / migration
Downstream v0
rlm_searchusage (the composable env that imported…composable.tasksets.search) should migrate torlm-search-v1. No other code in verifiers imports the composable search taskset.🤖 Generated with Claude Code
Note
Medium Risk
Large removal of public experimental APIs; breakage only for code still importing composable search from verifiers, but the surface area and vendored QUEST runtime make this a significant delete rather than a trivial cleanup.
Overview
Removes the experimental composable search taskset from verifiers now that equivalent v1 search support lives in research-environments (
search-v1+rlm-search-v1).The deleted tree under
verifiers/envs/experimental/composable/tasksets/search/included themake_search_tasksetdispatcher and three backends—QUEST (objective eval scripts + vendoredobj_task_eval, open-ended rubric judging), OpenSeeker (binary LLM semantic judge), and REDSearcher (exact-match shortcut + BROWSECOMP-style judge)—plus their READMEs and public exports fromsearch/__init__.py.Callers that used v0 composable search or
rlm_searchwired to these factories should switch to the v1 stack in research-environments instead of importing from verifiers.Reviewed by Cursor Bugbot for commit c30669f. Bugbot is set up for automated code reviews on this repo. Configure here.
Note
Remove composable search taskset migrated to research-environments v1
Deletes the README.md for the composable search taskset, which has been migrated to research-environments v1.
Macroscope summarized c30669f.