Skip to content

WIP: Support per-agent rewards in multi-agent setups#2575

Draft
nph4rd wants to merge 16 commits into
PrimeIntellect-ai:mainfrom
nph4rd:multiagent-heterogeneous-rewards
Draft

WIP: Support per-agent rewards in multi-agent setups#2575
nph4rd wants to merge 16 commits into
PrimeIntellect-ai:mainfrom
nph4rd:multiagent-heterogeneous-rewards

Conversation

@nph4rd
Copy link
Copy Markdown
Contributor

@nph4rd nph4rd commented May 20, 2026

Adds support for per-agent rewards and advantages in multi-agent environments. This is a companion change to PrimeIntellect-ai/verifiers#965 which adds abstractions for multi-agent setups and heterogeneous reward functions.

@nph4rd nph4rd force-pushed the multiagent-heterogeneous-rewards branch from ef8659c to 4937e10 Compare May 20, 2026 20:16
@nph4rd nph4rd force-pushed the multiagent-heterogeneous-rewards branch from 1a981df to 42796a7 Compare May 20, 2026 22:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant