Align Mixscape with Seurat’s implementation #710

Lilly-May · 2025-02-14T16:31:55Z

PR Checklist

Referenced issue is linked
If you've fixed a bug or added code that should be tested, add tests!
Documentation in docs is updated

Description of changes

I made the following updates to pt.tl.Mixscape():

Added a de_layer parameter to mixscape(), since DEG computation should be based on adata.X, while the rest of the method operates on adata.layers[X_pert], i.e., the perturbation signature. Seurat’s implementation also includes this parameter (see here).
Added a test_method parameter to mixscape() and lda() to specify the test used for DEG computation. Seurat uses Wilcoxon by default (see here), while pertpy previously always used a t-test. Hence, users can now choose their preferred method.
Added a scale parameter to mixscape. By default, Seurat scales DEG expression within the respective group (see here), so I introduced an option to enable this in pertpy too, which is set to True by default.
Fixed an issue in the loop that assigns cells to NP and KO. Previously, the loop always used the original labels at the beginning of each iteration instead of updating them based on the previous iteration’s results. Now, it correctly updates the labels until convergence. This was also mentioned in issue Mixscape classification #688.
Implemented a CustomGaussianMixture model. mixscape() fits a Gaussian Mixture Model to perturbed and non-perturbed cells, which is then used to assign cells to NP or KO. However, Seurat’s model fixes the mean and standard deviation for the NT distributions (see here), whereas Scikit-learn’s GaussianMixture does not support this. Hence, so far, our implementation effectively fit two distributions instead of only one as in Seurat's case. To address this, I created a CustomGaussianMixture class that inherits from GaussianMixture and overrides the M-step of the EM algorithm, allowing to fix certain mean and/or covariance values.
Updated Gaussian Mixture Model initialization to align with Seurat’s approach. Seurat’s model allows specifying initial standard deviation values, while Scikit-learn’s implementation specifies precision (inverse of variance). I adjusted our initialization so that we now have the same behavior as in Seurat.

…as perturbation markers

Zethson

Many great improvements! Thank you so much

pertpy/tools/_mixscape.py

tests/tools/test_mixscape.py

Lilly-May added 9 commits February 10, 2025 13:58

Docstring clarifications

8ca7f4d

Fixed layer scaling for marker DEG computation

44191cb

Added test_method parameter to DEG method call

15d81a2

Introduced de_layer parameter and docs explanations

96bcfc4

Removed separate method for mean and std calculation

5b4594a

Added custom Gaussian Mixture Model, Take positive and negative DEGs …

f2da079

…as perturbation markers

Added Gaussian Mixture Model tests

a93b3b6

Resolved remaining ToDos

a1f6628

Removed scanpy import

59e3c5d

github-actions bot added the bug Something isn't working label Feb 14, 2025

Lilly-May added 2 commits February 18, 2025 16:50

Fixed split_by definition in mixscape method

462e655

Fixed split_by definition in lda

e89b23b

Zethson approved these changes Feb 19, 2025

View reviewed changes

PR Revisions

4fafc2e

Lilly-May marked this pull request as ready for review February 21, 2025 08:50

Merge branch 'main' into fix/mixscape_perturb_signature

c80729f

Lilly-May merged commit baf9bb2 into main Feb 21, 2025
3 of 5 checks passed

This was referenced Feb 21, 2025

Mean expression can be negative in _get_perturbation_markers #684

Closed

Mixscape classification #688

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align Mixscape with Seurat’s implementation #710

Align Mixscape with Seurat’s implementation #710

Lilly-May commented Feb 14, 2025

Zethson left a comment

Align Mixscape with Seurat’s implementation #710

Align Mixscape with Seurat’s implementation #710

Conversation

Lilly-May commented Feb 14, 2025

Zethson left a comment

Choose a reason for hiding this comment