rl
Here are 1,133 public repositories matching this topic...
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
- 
            Updated
            
Nov 4, 2024  - Jupyter Notebook
 
An elegant PyTorch deep reinforcement learning library.
- 
            Updated
            
Oct 29, 2025  - Python
 
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
- 
            Updated
            
Oct 24, 2025  - Python
 
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
- 
            Updated
            
Apr 24, 2024  - Python
 
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
- 
            Updated
            
Jun 21, 2019  - C++
 
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
- 
            Updated
            
Nov 3, 2025  - Python
 
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
- 
            Updated
            
Nov 4, 2025  - Python
 
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
- 
            Updated
            
Oct 15, 2025  - Python
 
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
- 
            Updated
            
Dec 11, 2022  - Python
 
A Survey of Reinforcement Learning for Large Reasoning Models
- 
            Updated
            
Oct 29, 2025  
Implementation of papers in 100 lines of code.
- 
            Updated
            
Nov 3, 2025  - Python
 
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
- 
            Updated
            
Dec 7, 2022  - Python
 
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
- 
            Updated
            
Oct 17, 2022  - Python
 
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
- 
            Updated
            
Nov 4, 2025  - Python
 
Improve this page
Add a description, image, and links to the rl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rl topic, visit your repo's landing page and select "manage topics."