Skip to content

Latest commit

 

History

History
331 lines (220 loc) · 13.3 KB

README.md

File metadata and controls

331 lines (220 loc) · 13.3 KB

Oligo Designer Toolsuite - Lightweight Development of Custom Oligo Design Pipelines

test PyPI codecov stars License: MIT DOI

📃 Documentation

Oligonucleotides (abbrev. oligos) are short, synthetic strands of DNA or RNA that are designed with respect to a specific target region and have many application areas, ranging from research to disease diagnosis or therapeutics. Oligos can be used as primers during DNA amplification, as probes for in situ hybridization or as guide RNAs for CRISPR-based gene editing. Based on the intended application and experimental design, researchers have to customize the length, sequence composition, and thermodynamic properties of the designed oligos.

Various tools exist that provide custom design of oligo sequences depending on the area of application. Even though most tools apply the same basic processing steps, ranging from the generation of custom-length oligo sequences, the filtering of oligo sequences based on thermodynamic properties as well as the selection of an optimal set of oligos, each newly developed tool uses its own implementation and different package dependencies. Consequently, not only the development of new tools is slowed down, but also the maintenance and modification of existing tools is hampered, because developers do not have a common resource for those functionalities to use. We tackle this issue with our open-source Oligo Designer Toolsuite.

🚀 Oligo Designer Toolsuite is a collection of modules that provides all basic functionalities for custom oligo design pipelines as well as advanced experiment-specific functionalities like machine learning models for oligo specificity prediction within a flexible Python framework.

To allow the flexible usage of different modules, depending on the required processing steps, we developed a common underlying data structure that ensures the cross-compatibility of all modules within the framework. This data structure is runtime and memory optimized to enable the processing of large sequence dataset in a reasonable time frame. With our Oligo Designer Toolsuite we aim to set new standards in the development of oligo design pipelines, helping to accelerate the development of new tools and facilitate the upgrade of existing tools with the latest developments in the field. We also provide ready-to-use oligo design pipelines for specific experimental setups, e.g. MERFISH or SeqFISH+ probe design for Spatial Transcriptomics.

Installation

Requirements

This packages was tested for Python 3.9 - 3.12 on Linux (x64) and MacOS (osx64 and arm64).

For stable installation, we recommend to first setup a conda environment.

Note: if your institution does not support anaconda, you can use miniforge instead to run the conda installations.

🖥️ Linux (x64) Requirements

First create a conda environment:

conda create -n odt-x64 python=3.11
conda activate odt-x64

To install the additional required tools via conda, please activate the bioconda and conda-forge channels in your conda environment and update conda and all packages in your environment:

conda config --add channels bioconda
conda config --add channels conda-forge
conda update --all

The following additional tools Blast, BedTools, Bowtie and Bowtie2 need to be installed independently:

conda install "blast>=2.15.0" 
conda install "bedtools>=2.30"
conda install "bowtie>=1.3.1"
conda install "bowtie2>=2.5"

All other required packages are automatically installed if installation is done via pip.

🖥️ MacOS (osx64) Requirements

First create a conda environment:

conda create -n odt-x64 python=3.11
conda activate odt-x64

To install the additional required tools via conda, please activate the bioconda and conda-forge channels in your conda environment and update conda and all packages in your environment:

conda config --add channels bioconda
conda config --add channels conda-forge
conda update --all

The following additional tools Blast, BedTools, Bowtie and Bowtie2 need to be installed independently:

conda install "blast>=2.15.0" 
conda install "bedtools>=2.30"
conda install "bowtie>=1.3.1"
conda install "bowtie2>=2.5"

Since torch > 2.2.2 installation is not provided anymore for osx64 processor, we need to make sure to have a numpy version which is numpy < 2.0 to avoid conflicts which torch <= 2.2.2:

pip install "numpy<2.0"

All other required packages are automatically installed if installation is done via pip.

🖥️ MacOS (arm64) Requirements

For the Apple M chips, there is currently no Blast installation available via conda. Hence, we need a workaround for the Blast installation. Therefore, we have two options:

Option 1: Blast installation via conda installation through osx64 environment emulation

Pro: Works with conda and requires no extra dependencies
Con: Runs via Rosetta osx64 emulation, which does not support installation of torch > 2.2.2

First we need to create an conda environment that emulates the osx64 processor:

CONDA_SUBDIR=osx-64 conda create -n odt-osx64 python=3.11
conda activate odt-osx64
conda config --env --set subdir osx-64

To install the additional required tools via conda, please activate the bioconda and conda-forge channels in your conda environment and update conda and all packages in your environment:

conda config --add channels bioconda
conda config --add channels conda-forge
conda update --all

The following additional tools Blast, BedTools, Bowtie and Bowtie2 need to be installed independently:

conda install "blast>=2.15.0" 
conda install "bedtools>=2.30"
conda install "bowtie>=1.3.1"
conda install "bowtie2>=2.5"

Since torch > 2.2.2 installation is not provided anymore for osx64 processor, we need to make sure to have a numpy version which is numpy < 2.0 to avoid conflicts which torch <= 2.2.2:

pip install "numpy<2.0"

All other required packages are automatically installed if installation is done via pip.

Option 2: Blast installation via Homebrew

Pro: Allows to install a native ARM version of Blast and supports installation of torch > 2.2.2
Con: Requires Homebrew

First create a conda environment:

conda create -n odt-arm64 python=3.11
conda activate odt-arm64

To install the additional required tools via conda, please activate the bioconda and conda-forge channels in your conda environment and update conda and all packages in your environment:

conda config --add channels bioconda
conda config --add channels conda-forge
conda update --all

The following additional tools Blast, BedTools, Bowtie and Bowtie2 need to be installed independently:

conda install "bedtools>=2.30"
conda install "bowtie>=1.3.1"
conda install "bowtie2>=2.5"

To install the arm64 Blast version you need Homebrew, which can be installed as described here. Blast can then be installed via Homebrew:

brew install blast

All other required packages are automatically installed if installation is done via pip.

🖥️ Windows Requirements

TBD

The following additional tools Blast, BedTools, Bowtie and Bowtie2 need to be installed independently:

Install Options

The installation of the package is done via pip. Note: if you are using conda, first install pip with: conda install pip.

PyPI install:

pip install oligo-designer-toolsuite

Installation from source:

git clone https://github.com/HelmholtzAI-Consultants-Munich/oligo-designer-toolsuite.git
cd oligo-designer-toolsuite
  • Installation as python package (run inside directory):

      pip install .
    
  • Development installation as python package (run inside directory):

      pip install -e .[dev]
    

Implemented Oligo Design Pipelines

The following pipelines are pre-implemented and ready-to-use:

🧬 Genomic Region Generator

🧪 Scrinshot Probe Designer

🧪 SeqFISH+ Probe Designer

🧪 MERFISH Probe Designer

🧫 Oligo-Seq Probe Designer

If you would like to modify an existing pipeline for adjusted experimental settings or to design a new pipeline for a different experimental setup, feel free to reach out to us Lisa Barros de Andrade e Sousa or Isra Mekki.

Contributing

Contributions are more than welcome! Everything from code to notebooks to examples and documentation are all equally valuable so please don't feel you can't contribute. To contribute please fork the project make your changes and submit a pull request. We will do our best to work through any issues with you and get your code merged into the main branch.

For any further inquiries please send an email to Lisa Barros de Andrade e Sousa or Isra Mekki.

How to cite

If the Oligo Designer Toolsuite is useful for your research, consider citing the package:

@software{campi_2023_7823048,
	author   = {Isra Mekki,
		     Francesco Campi,  
		     Louis Kümmerle,
		     Chelsea Bright,
		     Malte Lücken
		     Fabian Theis,
		     Marie Piraud,
		     Lisa Barros de Andrade e Sousa},
    title        = {{Oligo Designer Toolsuite}},
    year         = 2023,
    publisher    = {Zenodo},
    version      = {v0.1.3},
    doi          = {10.5281/zenodo.7823048},
    url          = {https://doi.org/10.5281/zenodo.7823048}
}

If you are using the SCRINSHOT, MERFISH or SeqFISH+ pipeline provided along the Oligo Designer Toolsuite, consider citing in addition the paper:

@article {kuemmerle2024probe,
    author 	 = { Louis B. Kuemmerle,
		     Malte D. Luecken,
		     Alexandra B. Firsova
		     Lisa Barros de Andrade e Sousa
		     Lena Strasser
                     Ilhem Isra Mekki
                     Francesco Campi
		     Lukas Heumos
		     Maiia Shulman
                     Valentina Beliaeva
                     Soroor Hediyeh-Zadeh
                     Anna C. Schaar
		     Krishnaa T. Mahbubani
		     Alexandros Sountoulidis
		     Tamas Balassa
		     Ferenc Kovacs
		     Peter Horvath
		     Marie Piraud
		     Ali Ertürk
		     Christos Samakovlis
		     Fabian J. Theis},
    title 	 = {{Probe set selection for targeted spatial transcriptomics}},
    year 	 = {2024},
    publisher 	 = {Nature Publishing Group US New York},
    journal 	 = {Nature methods},
    doi 	 = {10.1038/s41592-024-02496-z},
    URL 	 = {https://doi.org/10.1038/s41592-024-02496-z}
}

License

oligo-designer-toolsuite is released under the MIT license. See LICENSE for additional details about it.