Skip to content

CLIs that take in input Atlas Pipeline datasets and push them into Nexus along with a Resource properties payload

License

Notifications You must be signed in to change notification settings

BlueBrain/bbp-data-push

Repository files navigation

Description

This module contains command-line executables aka CLIs that take in input atlas pipeline datasets and push them into Nexus along with a resource properties payload, including:

  • push_volumetric to create a VolumetricDataLayer resource payload and push it along with the corresponding volumetric input dataset files into Nexus.

  • push_meshes to create a Mesh resource payload and push it along with the corresponding brain .OBJ mesh folder input dataset files into Nexus.

  • push_cellrecords to create a CellRecordSerie resource payload and push it along with the corresponding Sonata hdf5 file input dataset files into Nexus.

  • push_regionsummary to create a RegionSummary resource payload and push it along with the corresponding brain region metadata json input dataset files into Nexus.

If the Resource already exists in Nexus then it will be updated instead. Eventually push/update their linked atlasRelease and ontology resources. Tag all these resources with the input tag or, if not provided, with a timestamp. Each CLI can process multiple files/directories at once.

The input datasets must be one of the datasets listed in the input configuration file given as the argument of the --config option. These datasets has to correspond to the ones generated by the atlas pipeline (these datasets are referenced on the Pipeline Products page.) The configuration yaml file content structure should look like this configuration example :

  HierarchyJson:
      hierarchy: hierarchy.json
      hierarchy_l23split: hierarchy_l23split.json
  GeneratedDatasetPath:
      VolumetricFile:
          annotation_hybrid: annotation_v2v3_hybrid.nrrd
          annotation_l23split: annotation_l23split.nrrd
          cell_densities: cell_densities
          neuron_densities: neuron_densities
      MeshFile:
          brain_region_meshes_hybrid: brain_region_meshes_v2v3_hybrid
          brain_region_meshes_l23split: brain_region_meshes_l23split
      CellRecordsFile:
          cell_records_sonata: cell_records_sonata.h5
  MetadataFile:
	      metadata_parcellations_ccfv3_l23split: metadata_parcellations_ccfv3_l23split.json

The Resource property payload includes, in addition to various information on the dataset pushed and its content, some provenance informations:

  • The contributor: @id of the user and organisation associated with the input Nexus token.
  • From which datasets it derivates.
  • Informations on the run stocked in the linked Activity resource: the run duration, the software used ...

These four CLIs are hierarchically grouped in the cli initialize-pusher-cli. This CLI allows the Initialisation of the Forge python framework to communicate with Nexus. The Forge will enable to build and push into Nexus the metadata payload along with the input dataset. This means that before calling one of the three CLIs, initialize-pusher-cli must first be called.

Note: the --verbosity argument allows you to print in the console the last resource payload from the list of resource payloads that has been constructed from input datasets that will be pushed into Nexus. If only one dataset has been given as input then its corresponding resource payload will be printed.

Install

pip install "bba-data-push"

Examples

Arguments for initialize-pusher-cli

Run the dataset pusher CLI starting by the Initialisation of the Forge python framework to communicate with Nexus

Inputs

--verbose, -v : Verbosity option. If equal True, the last resource payload from the list of resource payloads that has been constructed from input datasets will be printed. (Optional : boolean).
--forge-config-file : Path to the configuration file used to instantiate the Forge. (Optional, default = "https://raw.githubusercontent.com/BlueBrain/nexus-forge/master/examples/notebooks/use-cases/prod-forge-nexus.yml").
--nexus-env : Nexus environment to use, can be 'dev', staging', 'prod' or the URL of a custom environment. (Optional, default="prod").
--nexus-org : The Nexus organisation to push into. (Optional, default='bbp').
--nexus-org : The Nexus project to push into. (Optional, default='atlas').
--nexus-token-file : Path to the text file containing the Nexus token.

Arguments for push_volumetric

Create a 'VolumetricDataLayer', an 'atlasRelease' and an 'ontology' resource payload to push into Nexus. If the resources already exist in Nexus, they will be fetched and updated instead. This script has been designed to function with volumetric files generated by the Atlas pipeline.

Inputs

--dataset-path : [multiple paths] The files or directories of file to push on Nexus.
--config : Path to the generated dataset configuration file. This is a yaml file containing the paths to the Atlas pipeline generated dataset.
--hierarchy-path : [multiple paths] The path to the json hierarchy file containing an AIBS hierarchy structure.
--hierarchy-jsonld-path : [path] Path to the AIBS hierarchy structure as a JSON-LD file. It is mandatory in case of the creation of a new Ontology resource as it will be attached to it when integrated in the knowledge graph. New Ontology resource is created at the same time a new atlasRelease resource need to be created.
--atlasrelease-config-path : [path] Json file containing the atlasRelease @id as well as its ontology and parcellation volume @id. It needs to contains at least these informations for the atlasRelease Allen Mouse CCFV2 and CCFV3 stocked in the Nexus project bbp/atlas.
--provenance-metadata-path : [path] Path to the Json file containing metadata for the derivation properties as well as the Activity and SoftwareAgent resources.
--resource-tag : [string] Optional tag value with which to tag the resources. (Optional). --link-regions-path : [path] Optional json file containing link between regions and resources (@ ids of mask and mesh for each brain region) to be extracted by the CLI push-regionsummary. If the file already exists it will be annotated else it will be created. (Optional).

Arguments for push_meshes

Create a 'Mesh' , an 'atlasRelease' and an 'ontology' resource payload to push into Nexus. If the resources already exist in Nexus, they will be fetched and updated instead. This script has been designed to function with brain region meshes generated by the Atlas pipeline.

Inputs

--dataset-path : [multiple paths] The files or directories of file to push on Nexus.
--config : [path] Path to the generated dataset configuration file. This is a yaml file containing the paths to the Atlas pipeline generated dataset.
--hierarchy-path : [multiple paths] The path to the json hierarchy file containing an AIBS hierarchy structure.
--hierarchy-jsonld-path : [path] Path to the AIBS hierarchy structure as a JSON-LD file. It is mandatory in case of the creation of a new Ontology resource as it will be attached to it when integrated in the knowledge graph. New Ontology resource is created at the same time a new atlasRelease resource need to be created.
--atlasrelease-config-path : [path] Json file containing the atlasRelease @id as well as its ontology and parcellation volume @id. It needs to contains at least these informations for the atlasRelease Allen Mouse CCFV2 and CCFV3 stocked in the Nexus project bbp/atlas.
--provenance-metadata-path : [path] Path to the Json file containing metadata for the derivation properties as well as the Activity and SoftwareAgent resources.
--resource-tag : [string] Optional tag value with which to tag the resources. (Optional).
--link-regions-path : [path] Optional json file containing link between regions and resources (@ ids of mask and mesh for each brain region) to be extracted by the CLI push-regionsummary. If the file already exists it will be annotated else it will be created. (Optional).

Arguments for push_regionsummary

Create a 'RegionSummary', an 'atlasRelease' and an 'ontology' resource payload to push into Nexus. If the resources already exist in Nexus, they will be fetched and updated instead. This script has been designed to function with metadata json files generated by the Atlas pipeline.

Inputs

--dataset-path : [multiple paths] The files or directories of file to push on Nexus.
--config : [path] Path to the generated dataset configuration file. This is a yaml file containing the paths to the Atlas pipeline generated dataset.
--hierarchy-path : [multiple paths] The path to the json hierarchy file containing an AIBS hierarchy structure.
--hierarchy-jsonld-path : [path] Path to the AIBS hierarchy structure as a JSON-LD file. It is mandatory in case of the creation of a new Ontology resource as it will be attached to it when integrated in the knowledge graph. New Ontology resource is created at the same time a new atlasRelease resource need to be created.
--atlasrelease-config-path : [path] Json file containing the atlasRelease @id as well as its ontology and parcellation volume @id. It needs to contains at least these informations for the atlasRelease Allen Mouse CCFV2 and CCFV3 stocked in the Nexus project bbp/atlas.
--provenance-metadata-path : [path] Path to the Json file containing metadata for the derivation properties as well as the Activity and SoftwareAgent resources.
--resource-tag : [string] Optional tag value with which to tag the resources. (Optional). --link-regions-path : [path] Optional json file containing link between regions and resources (@ ids of mask and mesh for each brain region) to be extracted by the CLI push-regionsummary. If the file already exists it will be annotated else it will be created. (Optional).

Arguments for push_cellrecords

Create a 'CellRecordSeries', an 'atlasRelease' and an 'ontology' resource payload to push into Nexus. If the resources already exist in Nexus, they will be fetched and updated instead. This script has been designed to function with sonata h5 files storing 3D brain cell positions and orientations and generated by the Atlas pipeline.

Inputs

--dataset-path : [multiple paths] The files or directories of file to push on Nexus.
--config : Path to the generated dataset configuration file. This is a yaml file containing the paths to the Atlas pipeline generated dataset.
--hierarchy-path : [multiple paths] The path to the json hierarchy file containing an AIBS hierarchy structure.
--hierarchy-jsonld-path : [path] Path to the AIBS hierarchy structure as a JSON-LD file. It is mandatory in case of the creation of a new Ontology resource as it will be attached to it when integrated in the knowledge graph. New Ontology resource is created at the same time a new atlasRelease resource need to be created.
--atlasrelease-config-path : [path] Json file containing the atlasRelease @id as well as its ontology and parcellation volume @id. It needs to contains at least these informations for the atlasRelease Allen Mouse CCFV2 and CCFV3 stocked in the Nexus project bbp/atlas.
--provenance-metadata-path : [path] Path to the Json file containing metadata for the derivation properties as well as the Activity and SoftwareAgent resources.
--resource-tag : [string] Optional tag value with which to tag the resources. (Optional).

Acknowledgements

The development of this software was supported by funding to the Blue Brain Project, a research center of the École polytechnique fédérale de Lausanne (EPFL), from the Swiss government’s ETH Board of the Swiss Federal Institutes of Technology.

For license and authors, see LICENSE.txt and AUTHORS.txt respectively.

Copyright (c) 2020-2024 Blue Brain Project/EPFL

About

CLIs that take in input Atlas Pipeline datasets and push them into Nexus along with a Resource properties payload

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages