opendilab / LightZero Public

Notifications You must be signed in to change notification settings
Fork 182
Star 1.5k

Code
Issues 6
Pull requests 39
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: opendilab/LightZero

Labels 21 Milestones 0

New pull request New

39 Open 250 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

WIP: feature(pu): add init version of alphazero batch

#444 opened Nov 25, 2025 by puyuan1996

Loading…

feature(tj): add the loss landscape module enhancement

New feature or request

research

Research work in progress

#443 opened Nov 24, 2025 by tAnGjIa520

Loading…

feature(mycve): add chinese chess env and related demo enhancement

New feature or request

environment

New or improved environment

#442 opened Nov 23, 2025 by mycve

Loading…

feature(xjy): Fixed the accumulate_steps, game_segment/weighted_total_loss bugs and refine prompts, compute_llm_prior, and SFT loss, and added cprofile functionality.

#441 opened Nov 20, 2025 by xiongjyu

Loading…

feature(xjy): add the rnd-related features research

Research work in progress

#438 opened Nov 7, 2025 by xiongjyu

Loading…

feature(tj): add evaluation utils from open-reasoner-zero to priorzero research

Research work in progress

#437 opened Nov 7, 2025 by tAnGjIa520

Loading…

WIP: feature(pu): add priorzero init version research

Research work in progress

#433 opened Oct 23, 2025 by puyuan1996

Loading…

TEP: only for comparing

#426 opened Oct 9, 2025 by puyuan1996

Loading…

feature(xjy): Unizero changes MCTs to PPO for strategy optimization in the Jericho environment research

Research work in progress

#425 opened Oct 8, 2025 by xiongjyu

Loading…

feature(tj): add monitoring for the gradient conflict metric of MoE in ScaleZero

#421 opened Sep 27, 2025 by tAnGjIa520

Loading…

feature(xjy): add RND configuration in unizero environment enhancement

New feature or request

research

Research work in progress

#420 opened Sep 26, 2025 by xiongjyu

Loading…

feature(tj): add monitoring for the gradient conflict metric of MoE in ScaleZero config

New or improved configuration

research

Research work in progress

#418 opened Sep 19, 2025 by tAnGjIa520

Loading…

feature(pu): add atari/dmc multitask and balance pipeline in ScaleZero paper config

New or improved configuration

enhancement

New feature or request

research

Research work in progress

#417 opened Sep 18, 2025 by puyuan1996

Loading…

fix(xjy): adding the messenger environment environment

New or improved environment

research

Research work in progress

#405 opened Aug 18, 2025 by xiongjyu

Loading…

fix(tj): add moe grad analysis toy example config

New or improved configuration

#401 opened Aug 12, 2025 by tAnGjIa520

Loading…

fix(pu): fix longrun performance of muzero in mspacman and qbert bug

Something isn't working

config

New or improved configuration

#400 opened Aug 12, 2025 by puyuan1996

Loading…

fix(tj): finetune spaceinvaders from atari26 pretrained ckpt in ScaleZero enhancement

New feature or request

research

Research work in progress

#399 opened Aug 12, 2025 by tAnGjIa520

Loading…

WIP: polish(pu): add a polished version of qwen prior policy

#397 opened Aug 11, 2025 by puyuan1996

Loading…

WIP: feature(nyz/pu): add init version of async demo using task pipeline

#396 opened Aug 5, 2025 by puyuan1996

Loading…

WIP: feature(pu): add init version of async unizero using multi-threading

#395 opened Aug 1, 2025 by puyuan1996

Loading…

feature(xjy): add multi-task learning pipeline in jericho environment config

New or improved configuration

enhancement

New feature or request

#365 opened May 27, 2025 by xiongjyu

Loading…

fix(pu): fix chess reset bug when use alphazero ctree

#364 opened May 23, 2025 by puyuan1996

Loading…

How to fix the bug of loading trained model for evaluation

#340 opened Apr 2, 2025 by xiongjyu

Loading…

feature(xjy): add mamba2 as a unizero backbone option algorithm

New algorithm

#338 opened Mar 31, 2025 by xiongjyu

Loading…

WIP: feature(pu): add muzero with history encoder algorithm

New algorithm

enhancement

New feature or request

#334 opened Mar 21, 2025 by puyuan1996

Loading…

Previous 1 2 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!