Skip to content

[tinker] Support PPO loss with Tinker and add critic model in SkyRLTrainBackend#1389

Open
tamoghnokandar wants to merge 3 commits intoNovaSky-AI:mainfrom
tamoghnokandar:add_logprobs
Open

[tinker] Support PPO loss with Tinker and add critic model in SkyRLTrainBackend#1389
tamoghnokandar wants to merge 3 commits intoNovaSky-AI:mainfrom
tamoghnokandar:add_logprobs

Commits

Commits on Mar 25, 2026

Commits on Mar 26, 2026

  • Tamoghno KandarTamoghno Kandar
    authored andcommitted