Develop an indexer for fast feedback
Status: We're very optimistic that it does what it's supposed to. David Waterman has integrated this indexer into DIALS. Thomas White has integrated this indexer into CrystFEL.
Luis Barbas PyTorch implementation has been benchmarked extensively here at PSI and shows indexing quality on par with other known indexers and superior speed. This CUDA version has been tested internally by Duan Jiaxin, shows indexing quality on par with XGandalf and superior speed.
Restrictions: Initial cell geometry is required. In other words - this indexer cannot be used for finding a cell from scratch.
Issues:
- The refinement algorithms misbehave with electron diffraction data
- Implemented in CUDA, so only Nvidia GPUs are supported currently.
Luis Barba from the Swiss Data Science Center has done an implementation with PyTorch: https://renkulab.io/projects/lfbarba/toro-indexer-for-serial-crystallography
The CUDA implementation exploits parallelism within the algorithm to a larger extent than the PyTorch version. That's why the PyTorch version relies on batching for speed, where the CUDA version can be fast taking frame by frame separately. The Python code of the PyTorch version is easier to understand and is thus recommended for experimentation with the algorithm.
When referring to the algorithm implemented here, please use:
- DOI: https://doi.org/10.1107/S1600576724003182
- TODO: Add some future paper with quality tests for the CUDA version
A copy of the BibTeX file can be found in this repo with the name BIBTeX.bib.
This is the result of the REDML Project, and other more or less informal collaborations between
- REDML: Filip Leonarski, Duan Jiaxin - PSI MX Group (https://www.psi.ch/en/macromolecular-crystallography)
- REDML: Luis Barba, Benjamin Béjar - Swiss Data Science Center (https://datascience.ch)
- REDML: Piero Gasparotto (formerly), Hans-Christian Stadler, Greta Assmann, Elsa Germann - PSI AWI (https://www.psi.ch/en/awi)
- REDML: Henrique Mendonça - CSCS (https://www.cscs.ch)
- Graeme Winter, Nick Devenish, Richard Gildea - DIALS group at Diamond Light Source (https://dials.github.io/about.html)
- David Waterman - CCP4 (https://www.ccp4.ac.uk)
- Thomas White - DESY (https://www.desy.de/~twhite/crystfel)
- C++17 compatible compiler
- cmake > 3.21 (not so sure if it works with earlier versions as well) or meson >= 1.1
- Eigen3 header only library
- BUILD_FAST_INDEXER needs a compiler compatible CUDA toolkit
- PYTHON_MODULE needs Python3 and NumPy (also see https://cmake.org/cmake/help/latest/module/FindPython3.html)
Cmake should complain and tell you what to add to the cmake commandline if internal dependencies are not met. In general tests require what is tested, executables need the simple data reader and indexer libraries, and the python module needs the indexer library.
Meson should also complain if requested features cannot be built and tell you what to add to the commandline.
If the default value of CMAKE_CUDA_ARCHITECTURES="75;80" is inappropriate for your GPU, set it correctly, or go to the bottom of the README file for the indexer to get more info.
Get hold of the Eigen3 library, either by installing it via your distro (e.g. on Ubuntu: sudo apt install libeigen3-dev
) or download it using git submodule update
. Install python3 and numpy for the python module.
$ FFBIDX_INSTALL_DIR=${HOME}/ffbidx
$ mkdir build
$ cd build
$ # For PSI merlin cluster add
$ # export Eigen3_DIR=<path to eigen installation>
$ cmake -DCMAKE_INSTALL_PREFIX=${FFBIDX_INSTALL_DIR} -DCMAKE_BUILD_TYPE=Release \
-DINSTALL_SIMPLE_DATA_FILES=ON -DTEST_ALL=ON -DTESTS_RPATH=ON \
-DBUILD_SIMPLE_DATA_READER=ON -DPYTHON_MODULE=ON \
-DPYTHON_MODULE_RPATH=ON ..
$ make
$ ctest
or with meson from the repository top directory after adapting these lines (especially the cpu-arch and gpu-arch) to your own needs:
$ FFBIDX_INSTALL_DIR=${HOME}/ffbidx
$ CXX=g++-13 meson setup --reconfigure --buildtype=release --prefix=${FFBIDX_INSTALL_DIR} \
--libdir=lib -D install-simple-data-reader=enabled -D install-simple-data-files=enabled \
-D build-tests=enabled -D build-simple-indexers=enabled -D include-python-api=enabled \
-D python-installation=python3 -D default_library=both \
-D cpu-arch=native -D gpu-arch='arch=compute_89,code=sm_89' meson
$ cd meson
$ meson compile -v
$ meson test
Take note of the CMAKE_INSTALL_PREFIX value above.
$ make install
$ # The following only works for sh compatible shells.
$ # Adapt the script if you're using something else.
$ . ${FFBIDX_INSTALL_DIR}/share/ffbidx/setup-env.sh
for Cmake and with meson, use
$ meson install
$ . ${FFBIDX_INSTALL_DIR}/share/ffbidx/setup-env.sh
If you included the simple indexers and python interface in the build, a quick installation test would be
$ refined_simple_data_indexer \
${FFBIDX_INSTALL_DIR}/share/ffbidx/data/files/image0_local.txt \
300 1 1 $((32*1024)) false ifssr .8 .02 6 15
$ python -c "import ffbidx; print('OK')"
Due to Elsa Germann, we are able to provide spack integration.
Install the official Spack instance
git clone https://github.com/spack/spack.git
source spack/share/spack/setup-env.sh
Tell Spack to find your C++17 compatible compiler
spack compiler find # HPC users should use 'module load gcc/<version>' beforehand
Add the unofficial ffbidx spack recipe
spack repo add fast-feedback-indexer/spack
Install ffbidx and its dependencies
spack install ffbidx cuda_arch=60 +python +simple_data_indexer +simple_data_files +test_all
# Not all options are mandatory
# Change your cuda_arch according to the system you are running on.
Before using the lib in C++ load its run env using:
spack load ffbidx
See: https://git.psi.ch/germann_e/spack-psi
$ micromamba install conda-build
$ conda-build -c conda-forge conda/meta.yaml
$ micromamba install -c local ffbidx
$ python -c "import ffbidx; print('OK')"
The conda-forge package for the fast feedback indexer is called ffbidx, as is the Python module. It can be installed with something like
micromamba install -c conda-forge ffbidx
Make sure your Nvidia driver is compatible with the CUDA version the package was built with.