Meta-Learning

Proximal Policy Optimization (PPO) is used to automatically tune Simulated Annealing (SA) hyperparameters.

Visualisation of the hyperparameter space

More updates coming soon

PPO to tune PPO?
Other benchmarks, not just Rastrigin function
Can we adjust the reward signal to not only perform well, but make the algorithm more efficient?

Python 3.12+

Installation

# Install UV
pip install uv

# Install dependencies
uv sync

# Build Rust extension (optional, for better performance)
uv run maturin develop --release
uv sync

Running Experiments

# Run PPO training to tune SA hyperparameters
python run_experiment.py

# Run grid search over SA hyperparameters
python run_grid_search.py

Repo Structure

meta-learning/
├── run_experiment.py      # PPO training runner
├── run_grid_search.py     # Grid search runner
├── core/                  # Core implementation modules
│   ├── sa_algorithms/     # SA algorithm implementations
│   │   ├── python_serial.py   # Python serial SA
│   │   └── rust_parallel.py   # Rust parallel SA (fast)
│   ├── sa_config.py       # SA algorithm configuration
│   ├── tuning_env.py      # PPO training environment
│   └── ppo_agent.py       # PPO agent implementation
├── outputs/               # Generated plots and results
├── src/                   # Rust source code
│   └── lib.rs            # Rust SA implementation
└── README.md             # This file

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
core		core
heatmaps		heatmaps
outputs		outputs
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
run_experiment.py		run_experiment.py
run_grid_search.py		run_grid_search.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meta-Learning

Visualisation of the hyperparameter space

Installation

Running Experiments

Repo Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Meta-Learning

Visualisation of the hyperparameter space

Installation

Running Experiments

Repo Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages