-
Notifications
You must be signed in to change notification settings - Fork 182
Pull requests: opendilab/LightZero
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
WIP: feature(pu): add init version of alphazero batch
#444
opened Nov 25, 2025 by
puyuan1996
Loading…
feature(tj): add the loss landscape module
enhancement
New feature or request
research
Research work in progress
#443
opened Nov 24, 2025 by
tAnGjIa520
Loading…
feature(mycve): add chinese chess env and related demo
enhancement
New feature or request
environment
New or improved environment
#442
opened Nov 23, 2025 by
mycve
Loading…
feature(xjy): add the rnd-related features
research
Research work in progress
#438
opened Nov 7, 2025 by
xiongjyu
Loading…
feature(tj): add evaluation utils from open-reasoner-zero to priorzero
research
Research work in progress
#437
opened Nov 7, 2025 by
tAnGjIa520
Loading…
WIP: feature(pu): add priorzero init version
research
Research work in progress
#433
opened Oct 23, 2025 by
puyuan1996
Loading…
feature(xjy): Unizero changes MCTs to PPO for strategy optimization in the Jericho environment
research
Research work in progress
#425
opened Oct 8, 2025 by
xiongjyu
Loading…
feature(tj): add monitoring for the gradient conflict metric of MoE in ScaleZero
#421
opened Sep 27, 2025 by
tAnGjIa520
Loading…
feature(xjy): add RND configuration in unizero environment
enhancement
New feature or request
research
Research work in progress
#420
opened Sep 26, 2025 by
xiongjyu
Loading…
feature(tj): add monitoring for the gradient conflict metric of MoE in ScaleZero
config
New or improved configuration
research
Research work in progress
#418
opened Sep 19, 2025 by
tAnGjIa520
Loading…
feature(pu): add atari/dmc multitask and balance pipeline in ScaleZero paper
config
New or improved configuration
enhancement
New feature or request
research
Research work in progress
#417
opened Sep 18, 2025 by
puyuan1996
Loading…
fix(xjy): adding the messenger environment
environment
New or improved environment
research
Research work in progress
#405
opened Aug 18, 2025 by
xiongjyu
Loading…
fix(tj): add moe grad analysis toy example
config
New or improved configuration
#401
opened Aug 12, 2025 by
tAnGjIa520
Loading…
fix(pu): fix longrun performance of muzero in mspacman and qbert
bug
Something isn't working
config
New or improved configuration
#400
opened Aug 12, 2025 by
puyuan1996
Loading…
fix(tj): finetune spaceinvaders from atari26 pretrained ckpt in ScaleZero
enhancement
New feature or request
research
Research work in progress
#399
opened Aug 12, 2025 by
tAnGjIa520
Loading…
WIP: polish(pu): add a polished version of qwen prior policy
#397
opened Aug 11, 2025 by
puyuan1996
Loading…
WIP: feature(nyz/pu): add init version of async demo using task pipeline
#396
opened Aug 5, 2025 by
puyuan1996
Loading…
WIP: feature(pu): add init version of async unizero using multi-threading
#395
opened Aug 1, 2025 by
puyuan1996
Loading…
feature(xjy): add multi-task learning pipeline in jericho environment
config
New or improved configuration
enhancement
New feature or request
#365
opened May 27, 2025 by
xiongjyu
Loading…
fix(pu): fix chess reset bug when use alphazero ctree
#364
opened May 23, 2025 by
puyuan1996
Loading…
How to fix the bug of loading trained model for evaluation
#340
opened Apr 2, 2025 by
xiongjyu
Loading…
feature(xjy): add mamba2 as a unizero backbone option
algorithm
New algorithm
#338
opened Mar 31, 2025 by
xiongjyu
Loading…
WIP: feature(pu): add muzero with history encoder
algorithm
New algorithm
enhancement
New feature or request
#334
opened Mar 21, 2025 by
puyuan1996
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.