An ontology of cell type markers.
This ontology integrates cell type markers for cells in the Cell Ontology from various sources along with details of marker context (anatomical context, assay), confidence (where available) and provenance.
This ontology is not currently registered with or approved by the OBO foundry. It is intended, primarily, as a source for the content for the CL knowledge graph.
The ontology captures the following key concepts:
To add new marker files, follow these steps:
- Create a New Branch: Create a new branch in your repository for your changes.
- Prepare Input Data: Add your marker data file(s) to the
src/markers/input
directory. Ensure the file includes the required columns:clusterName
,f_score
,NSForest_markers
, andcxg_dataset_title
. - Add Metadata: Update the
src/markers/input/metadata.csv
file with a new row describing your input file. Include fields likefile_name
,Organ_region
,Species
, and others as specified in the detailed guide. - Create a Pull Request: Open a pull request to merge your branch into the main repository. This will trigger GitHub Actions to validate your input files and metadata.
For more details, refer to the Quick Start Guide or the Detailed Data Upload Guide.
Admins responsible for running the pipeline should follow the instructions provided in the Pipeline Guide.
General workflow can be found at Workflow Documentation.
- Prepare Input Data: Ensure that the input data and metadata in the
src/markers/input
directory are validated and meet the required format. - Prepare Gene Databases: Generate gene databases from the provided anndata files using the
Makefile
insrc/markers
. - Generate Source Files: Use the
dosdp_template_generator.py
script to create or update source files for input data. - Prepare QuickGO Templates: Run the
go_term_template_generator.py
script to fetch and prepare Gene Ontology templates. - Prepare CellxGene Marker Templates: Run the
cellxgene_marker_template_generator.py
script to download marker data, resolve UBERON/NCBITaxon/NCBIgene URIs, filter and cap markers, and prepare a ROBOT TSV (cellxgene_marker_template.tsv
) insrc/templates/cl_kg/
. - Run the ODK Pipeline: Execute the pipeline using the
make prepare_release
command as described in the guide.
Refer to the Pipeline Guide for detailed instructions and additional context.
Please use this GitHub repository's Issue tracker to request new content or report errors or specific concerns related to the ontology.