Skip to content
@NVIDIA-NeMo

NVIDIA-NeMo

NVIDIA NeMo Framework Overview

NeMo Framework is NVIDIA's GPU accelerated, end-to-end training framework for large language models (LLMs), multi-modal models and speech models. It enables seamless scaling of training (both pretraining and post-training) workloads from single GPU to thousand-node clusters for both 🤗Hugging Face/PyTorch and Megatron models. This GitHub organization includes a suite of libraries and recipe collections to help users train models from end to end.

NeMo Framework is also a part of the NVIDIA NeMo software suite for managing the AI agent lifecycle.

Latest 📣 announcements and 🗣️ discussions

🐳 NeMo AutoModel

🔬 NeMo RL

💬 NeMo Speech

More to come and stay tuned!

Getting Started

Installation Checkpoint Conversion HF<>Megatron LLM example recipes and scripts VLM example recipes and scripts
Under 1,000 GPUs NeMo Automodel, NeMo RL No Need Pre-training, SFT, LoRA, DPO, GRPO SFT, LoRA, GRPO
Over 1,000 GPUs NeMo Megatron-Bridge, NeMo-RL Conversion Pretrain, SFT, and LoRA, DPO with megatron_cfg, GRPO with megatron_cfg SFT, LoRA, GRPO megatron config

Repo organization under NeMo Framework

Summary of key functionalities and container strategy of each repo

Visit the individual repos to find out more 🔍, raise 🐛, contribute ✍️ and participate in discussion forums 🗣️!

Repo Key Functionality & Documentation Link Training Loop Training Backends Infernece Backends Model Coverage Container
NeMo Megatron-Bridge Pretraining, LoRA, SFT PyT native loop Megatron-core NA LLM & VLM NeMo Framework Container
NeMo AutoModel Pretraining, LoRA, SFT PyT native loop PyTorch NA LLM, VLM, Omni, VFM NeMo AutoModel Container
Previous NeMo ->will repurpose to focus on Speech Pretraining,SFT PyTorch Lightning Loop Megatron-core & PyTorch RIVA Speech NA
NeMo RL SFT, RL PyT native loop Megatron-core & PyTorch vLLM LLM, VLM NeMo RL container
NeMo Gym RL Environment, integrate with RL Framework NA NA NA NA NeMo RL Container (WIP)
NeMo Aligner (deprecated) SFT, RL PyT Lightning Loop Megatron-core TRTLLM LLM NA
NeMo Curator Data curation NA NA NA Agnostic NeMo Curator Container
NeMo Evaluator Model evaluation NA NA Agnostic NeMo Framework Container
NeMo Export-Deploy Export to Production NA NA vLLM, TRT, TRTLLM, ONNX Agnostic NeMo Framework Container
NeMo Run Experiment launcher NA NA NA Agnostic NeMo Framework Container
NeMo Guardrails Guardrail model response NA NA NA NA
NeMo Skills Reference pipeline for SDG & Eval NA NA NA Agnostic NA
NeMo Emerging Optimizers Collection of Optimizers NA Agnostic NA NA NA
NeMo DFM (WIP) Diffusion foundation model training PyT native loop Megatron-core and PyTorch PyTorch VFM, Diffusion TBD
NeMotron Developer asset hub for nemotron models NA NA NA Nemotron models NA
NeMo Data-designer Synthetic data generation toolkit NA NA NA NA NA
Table 1. NeMo Framework Repos

Diagram Ilustration of Repos under NeMo Framework (WIP)

image

Figure 1. NeMo Framework Repo Overview

Some background motivations and historical contexts

The NeMo GitHub Org and its repo collections are created to address the following problems

  • Need for composability: The Previous NeMo is monolithic and encompasses too many things, making it hard for users to find what they need. Container size is also an issue. Breaking down the Monolithic repo into a series of functional-focused repos to facilitate code discovery.
  • Need for customizability: The Previous NeMo uses PyTorch Lighting as the default trainer loop, which provides some out of the box functionality but making it hard to customize. NeMo Megatron-Bridge, NeMo AutoModel, and NeMo RL have adopted pytorch native custom loop to improve flexibility and ease of use for developers.

License

Apache 2.0 licensed with third-party attributions documented in each repository.

Pinned Loading

  1. Curator Curator Public

    Scalable data pre processing and curation toolkit for LLMs

    Python 1.2k 191

  2. RL RL Public

    Scalable toolkit for efficient model reinforcement

    Python 1k 173

  3. Automodel Automodel Public

    Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

    Python 190 25

  4. Megatron-Bridge Megatron-Bridge Public

    HuggingFace conversion and training library for Megatron-based models

    Python 226 70

  5. Guardrails Guardrails Public

    NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

    Python 5.3k 563

  6. Gym Gym Public

    Build RL environments for LLM training

    Python 66 3

Repositories

Showing 10 of 17 repositories
  • Megatron-Bridge Public

    HuggingFace conversion and training library for Megatron-based models

    NVIDIA-NeMo/Megatron-Bridge’s past year of commit activity
    Python 226 Apache-2.0 70 153 (1 issue needs help) 97 Updated Nov 28, 2025
  • Guardrails Public

    NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

    NVIDIA-NeMo/Guardrails’s past year of commit activity
    Python 5,337 563 128 (5 issues need help) 37 Updated Nov 28, 2025
  • RL Public

    Scalable toolkit for efficient model reinforcement

    NVIDIA-NeMo/RL’s past year of commit activity
    Python 1,048 Apache-2.0 173 292 (2 issues need help) 74 Updated Nov 28, 2025
  • Automodel Public

    Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

    NVIDIA-NeMo/Automodel’s past year of commit activity
    Python 190 Apache-2.0 25 64 (2 issues need help) 47 Updated Nov 28, 2025
  • NVIDIA-NeMo/ProRL-Agent-Server’s past year of commit activity
    Python 15 2 0 1 Updated Nov 28, 2025
  • Run Public

    A tool to configure, launch and manage your machine learning experiments.

    NVIDIA-NeMo/Run’s past year of commit activity
    Python 208 Apache-2.0 85 15 13 Updated Nov 28, 2025
  • Export-Deploy Public

    A library for exporting models including NeMo and Hugging Face to optimized inference backends, and deploying them for efficient querying

    NVIDIA-NeMo/Export-Deploy’s past year of commit activity
    Python 21 Apache-2.0 2 25 29 Updated Nov 27, 2025
  • Evaluator Public

    Open-source library for scalable, reproducible evaluation of AI models and benchmarks.

    NVIDIA-NeMo/Evaluator’s past year of commit activity
    Python 104 Apache-2.0 11 10 9 Updated Nov 28, 2025
  • Skills Public

    A project to improve skills of large language models

    NVIDIA-NeMo/Skills’s past year of commit activity
    Python 626 Apache-2.0 115 63 17 Updated Nov 27, 2025
  • FW-CI-templates Public

    CI/CD templates for NeMo-FW libraries

    NVIDIA-NeMo/FW-CI-templates’s past year of commit activity
    Python 5 Apache-2.0 4 0 0 Updated Nov 27, 2025