[train] Support LoRA-only weight syncing for Megatron backend

# Summary

Currently, for weight syncing with LoRA, SkyRL's Megatron backend will fuse the weights and sync the merged weights with vLLM replicas. SkyRL should support syncing just the LoRA weights. and enabling LoRA serving on the vLLM side.

This was previously attempted in #885 but it also requires vLLM changes. 

1. Change Megatron's weight extractor to only transfer trainable, LoRA weights (this should be doable with `export_adapter_weights` ([link](https://github.com/NVIDIA-NeMo/Megatron-Bridge/blob/f6892964e3810f19a29f2b907234eafb70cff42a/src/megatron/bridge/models/conversion/auto_bridge.py#L403)) in Megatron-Bridge
2. Fix vLLM's weight loading  in `LRUCacheWorkerLoRAManager` upstream to allow LoRA weight loading with in-memory LoRA weights
3. Upgrade SkyRL to use the vLLM release version with the above changes after 2. lands

Reference PRs:
verl: https://github.com/verl-project/verl/pull/4632
Nemo-RL: https://github.com/NVIDIA-NeMo/RL/pull/1797 (three PRs linked and some relevant benchmarking for LoRA only weight update code path there)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[train] Support LoRA-only weight syncing for Megatron backend #1336

Summary

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[train] Support LoRA-only weight syncing for Megatron backend #1336

Description

Summary

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions