Skip to content

Code for "Inherited or produced? Inferring protein production kinetics when protein counts are shaped by a cell's division history"

Notifications You must be signed in to change notification settings

PessoaP/FC-inference

Repository files navigation

Code for "Inherited or produced? Inferring protein production kinetics when protein counts are shaped by a cell's division history"

This repository contains the code associated with the manuscript titled: "Inherited or produced? Inferring protein production kinetics when protein counts are shaped by a cell's division history"

In this work, we introduce a neural network–based inference framework that estimates likelihoods from simulation data, enabling inference in biological models that are non-Markovian or otherwise have intractable likelihoods.

You can find the preprint on arXiv: https://arxiv.org/abs/2506.09374


Directory Structure

  • model1_DCD/ – Deterministic Cell Division (Model 1)
  • model2_SCD/ – Stochastic Cell Division (Model 2)
  • model3_FCYeast/ – Fluorescence-based flow cytometry inference (Model 3)
  • real/ – Inference from experimental S. cerevisiae data

How to Run

Models 1 and 2

To reproduce the results for Model 1 (Figure 3 in the manuscript), run:

cd model1_DCD
python 1DCD_training.py

Then run the Jupyter notebook 2DCD_figure.ipynb which generates Fig3 of the manuscript.

Analogously for Model 2 run the rescpective files in the model2_SCD1 directory

Real Data Preprocessing

To run inference on experimental flow cytometry data, place the data file found here into the folder clean_data/ then run the script:

cd real
python clean_data.py

This will clean and preprocess the raw flow cytometry data for use in the inference in the real model and the autofluorescence calibration (in both Model 3 and real data)

Models 3 and Real

In both the model3_FCYeast/ and real/ directories, a bash.sh script is provided to automate the workflow.
This script prepares the data, trains the necessary normalizing flows, and runs MCMC inference.

To execute, navigate into the respective folder and run:

cd model3_FCYeast
bash bash.sh

cd real
bash bash.sh

These will generate the results and trained models used to produce Figures 5 and 6 of the manuscript. The corresponding Jupyter notebooks in each directory can then be used to visualize and export the figures.

Requirements

This codebase builds on the normflows library.
Please make sure it is installed before running the scripts.


Any questions?

If you encounter any issues or have questions about the code or manuscript, feel free to open an issue or reach out to the authors via the contact information provided in the paper.


Citation

If you find this work useful, we appreciate the citation. Here's the BibTeX:

@misc{pessoa2025simulation,
      title={Inherited or produced? Inferring protein production kinetics when protein counts are shaped by a cell's division history}, 
      author={Pedro Pessoa and Juan Andres Martinez and Vincent Vandenbroucke and Frank Delvigne and Steve Pressé},
      year={2025},
      eprint={2506.09374},
      archivePrefix={arXiv},
      url={https://arxiv.org/abs/2506.09374}, 
}

About

Code for "Inherited or produced? Inferring protein production kinetics when protein counts are shaped by a cell's division history"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published