Skip to content

SkyRL H1 2026 Roadmap #1391

@SumanthRH

Description

@SumanthRH

Note to Community

This roadmap is a living document, and we welcome community input and feedback. We will update this roadmap to link to specific Issues and PRs for each sub-task.

Overview

SkyRL's focus in H1 2026 will be pushing performance for large scale MoE async RL and improving usability via the tinker API.

Tinker-ification

Overview: Improving SkyRL's Tinker API server and expanding the supported set of configurations and modalities.

The first step here has been the unification of the skyrl-train and skyrl-tx packages into skyrl #1145

For a detailed list of issues, please see #1380

Large Scale MoE RL

Overview: Integrate the latest algorithmic improvements and improve the scalability of SkyRL for RL training on large MoE models with Megatron + vLLM.

For a detailed list of megatron backend specific tasks, see #1392

Step Wise + Async RL

Simplify the Inference Stack

Overview: Simplify the inference engine stack in SkyRL to standardize around HTTP, allowing seamless integrations with high-performance routing and scaling layers for large scale RL. Introduce native APIs into vLLM for RL to improve ease-of-use for RL frameworks like SkyRL.

Agent framework Integrations

Overview: Improve integrations with agent frameworks like Harbor

VLM Support

Overview: Support VLM training in SkyRL.

For more details, please see #1200

Improve entrypoint experience in SkyRL

Overview: Improve usability of SkyRL by providing a more pythonic instantation experience and improving the CLI

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions