AI-powered biomolecular structure prediction and binding affinity analysis — Interactive Jupyter notebooks for protein structure prediction, protein-ligand binding, and multi-entity complex modeling using the Boltz2 diffusion model. No local GPU installation required.
Launch in Google Colab (Free GPU):
Note: Google Colab provides free GPU access (T4 GPU). For best performance, select GPU runtime:
Runtime → Change runtime type → GPU
- Protein Structure Prediction — Diffusion-based AI modeling for single proteins and multi-chain complexes
- Protein-Ligand Binding — Predict and score molecular interactions with affinity estimation
- Multi-Entity Support — Handle proteins, DNA/RNA, ligands, and custom modifications simultaneously
- Advanced Constraints — Covalent bonds, binding pocket conditioning, contact constraints, template guidance
- Confidence Metrics — Per-residue confidence (pLDDT), Predicted Aligned Error (PAE), affinity predictions
- Interactive Visualization — 3D structure viewer with confidence overlays and binding analysis dashboard
- GPU Acceleration — CUDA-enabled with free T4 GPU access in Google Colab
- Zero Installation — Runs entirely in Google Colab (no local GPU setup required)
| Feature | V1.0.0 | V2.0.0 | Batch |
|---|---|---|---|
| Single protein prediction | ✅ | ✅ | ✅ |
| Protein-ligand binding | ✅ | ✅ | ✅ |
| Multi-chain complexes | ✅ | ✅ | ❌ |
| DNA/RNA support | ❌ | ✅ | ❌ |
| Template guidance | ✅ | ✅ | ✅ |
| Custom MSA upload | ❌ | ✅ | ❌ |
| Post-translational modifications | ❌ | ✅ | ❌ |
| Custom constraints | ❌ | ✅ | ❌ |
| Covalent bonds | ❌ | ✅ | ❌ |
| Cyclic peptides | ❌ | ✅ | ❌ |
| Affinity prediction | ✅ | ✅ | ✅ |
| Batch processing | ❌ | ❌ | ✅ |
| CSV/FASTA input | ❌ | ❌ | ✅ |
| 3D visualization | ✅ | ✅ | ✅ |
- New users or standard predictions? → Use V1.0.0 (stable, battle-tested)
- Advanced modeling needs? → Use V2.0.0 (latest features, DNA/RNA, constraints)
- Large-scale screening? → Use Batch v1.0 (automated high-throughput)
| Notebook | Status | Best For |
|---|---|---|
| V2.0.0 | Advanced modeling, multi-entity complexes, structure-guided design | |
| V1.0.0 | Production predictions, protein-ligand binding, standard analysis | |
| Batch v1.0 | High-throughput screening, batch processing, automation | |
| Parameter Generator | Utility | Custom configuration and input building |
┌─────────────────────────────────────────────────────────────┐
│ STEP 1: Setup Environment & Dependencies │
│ - Install Boltz2, PyTorch, CUDA support │
│ - Initialize workspace directories │
│ - Google Drive integration (optional) │
└────────────────┬────────────────────────────────────────────┘
↓
┌─────────────────────────────────────────────────────────────┐
│ STEP 2: Input Builder (param_gen.py) │
│ - Define protein/DNA/RNA sequences │
│ - Add ligands (SMILES or CCD code) │
│ - Upload templates & custom MSA files │
│ - Define constraints (bonds, pockets, contacts) │
│ - Generate params.yaml & run_params.txt │
└────────────────┬────────────────────────────────────────────┘
↓
┌─────────────────────────────────────────────────────────────┐
│ STEP 3: Boltz2 Execution (Boltz_Run.py) │
│ - MSA generation (online or pre-computed) │
│ - Diffusion-based structure prediction │
│ - Recycling steps for refinement │
│ - Generate PDB/CIF models │
│ - Compute confidence scores │
└────────────────┬────────────────────────────────────────────┘
↓
┌─────────────────────────────────────────────────────────────┐
│ STEP 4: Analysis & Visualization (analysis.py) │
│ - Extract pLDDT (per-residue confidence) │
│ - Extract PAE (predicted aligned error) │
│ - Compute affinity predictions │
│ - Generate confidence plots │
│ - Create interactive 3D viewer │
│ - Display hit discovery dashboard │
└─────────────────────────────────────────────────────────────┘
Typical Time: 2-10 minutes on T4 GPU (depending on complexity)
- Parameter Loading - Reads
run_params.txtwith job settings - Directory Setup - Creates output folder structure
- MSA Generation - Fetches homologous sequences (unless pre-provided)
- Boltz2 Command - Constructs and runs prediction:
boltz predict params.yaml \ --out_dir job_name \ --recycling_steps 3 \ --sampling_steps 200 \ --diffusion_samples 1 \ --step_scale 1.638 \ --max_msa_seqs 8192 \ --msa_pairing_strategy unpaired_paired \ --use_msa_server
- Output Generation - Creates PDB/CIF files with predictions
- Visualization - Extracts data for 3D rendering
| Parameter | Default | Range | Meaning |
|---|---|---|---|
recycling_steps |
3 | 1-10 | Model refinement iterations |
sampling_steps |
200 | 50-500 | Diffusion sampling iterations |
diffusion_samples |
1 | 1-10 | Number of structure samples per job |
step_scale |
1.638 | 0.5-2.0 | Scaling for diffusion steps |
max_msa_seqs |
8192 | 256-8192 | Max homolog sequences |
msa_pairing_strategy |
unpaired_paired | paired / unpaired / greedy | MSA alignment strategy |
use_potentials |
False | True/False | Physics-based refinement |
override |
False | True/False | Re-run even if output exists |
Input: Single protein sequence
Output: PDB file + confidence metrics
Time: ~2-5 min
Input: Protein sequence + ligand SMILES
Output: Complex structure + binding affinity + hit discovery dashboard
Time: ~3-7 min
Input: 2-3 protein sequences
Output: Full complex structure + interface metrics (PAE, pLDDT)
Time: ~5-10 min
Input: Protein variant + reference PDB template
Output: Faster convergence + higher confidence in conserved regions
Time: ~2-5 min
| Issue | Cause | Solution |
|---|---|---|
| "CUDA out of memory" | Sequence too long | Reduce max_msa_seqs to 256, use fast profile |
| "MSA generation failed" | Server/connectivity issue | Use custom MSA file instead of server |
| "Invalid SMILES" | Malformed ligand notation | Validate SMILES at chemspider.com |
| "No model file found" | Boltz2 crashed | Check output logs, verify params.yaml |
| "pLDDT all low (<50)" | Novel/ambiguous fold | Try template guidance if available |
| Low affinity confidence | Complex interface | Increase sampling_steps to 500 |
- Version Table — Feature comparison across releases
- Changelog — Complete release history
- Batch README — High-throughput batch processing guide
If you use Boltz2-Notebook, please cite:
@article{Passaro2025,
title={Boltz-2: Towards Accurate and Efficient Binding Affinity Prediction},
author={Passaro, S. and Corso, G. and Wohlwend, J. and others},
journal={bioRxiv},
year={2025}
}
@article{Wohlwend2024,
title={Boltz-1: Democratizing Biomolecular Interaction Modeling},
author={Wohlwend, J. and Corso, G. and Passaro, S. and others},
journal={bioRxiv},
year={2024}
}- Notebook: Atharva Tilewale & Dr. Dhaval Patel (Gujarat Biotechnology University)
- Boltz2 Model: Original Repository
- License: MIT License — See LICENSE for details.
- Issues & Bugs: GitHub Issues
- Feature Requests: GitHub Discussions
- Questions: See the Q&A Documentation or open a discussion
Last Updated: April 2026 | Version: 2.0.0 (Latest) | Status: ✅ Production Ready
