Switch to formulaic-contrasts #682

grst · 2024-12-01T20:50:25Z

PR Checklist

Referenced issue is linked
If you've fixed a bug or added code that should be tested, add tests!
Documentation in docs is updated

Description of changes

Technical details

Additional context

Close #610

Zethson · 2024-12-02T08:45:17Z

@grst I'll fix the unrelated tests soon. Sorry about them + the currently uber slow CI.

Co-authored-by: Emma Dann <[email protected]>

grst · 2024-12-26T13:02:05Z

What's left is to update the differential expression tutorial to use a numeric contrast vector instead of a tuple of strings, i.e.

-res_df = pds2.test_contrasts(["Treatment", "Chemo", "Anti-PD-L1+Chemo"])
+res_df = pds2.test_contrasts(pds2.contrast(column="Treatment", baseline="Chemo", group_to_compare="Anti-PD-L1+Chemo"))

I tried a bit, but rerunning the tutorial with edgeR and rpy2 is a huge rabbit hole.

emdann

LGTM. I've pushed some edits to pass the rpy2/edgeR tests #692

The main thing to work on seems to be the documentation, e.g. is this still accurate?

pertpy/pertpy/tools/_differential_gene_expression/_base.py

Lines 982 to 992 in 298db0f

    
               def test_contrasts(self, contrasts, **kwargs): 
        
                   """ 
        
                   Perform a comparison as specified in a contrast vector. 
        
                   Args: 
        
                       contrasts: Either a numeric contrast vector, or a dictionary of numeric contrast vectors. 
        
                       **kwargs: passed to the respective implementation. 
        
                   Returns: 
        
                       A dataframe with the results. 
        
                   """

Also typing in base functions like test_contrasts and compare_groups would be helpful.

emdann · 2025-01-03T23:37:01Z

I'm gonna have a go at editing the tutorial.

emdann · 2025-01-04T01:15:20Z

One small note: I noticed that the model doesn't complain if you try to specify a complex interaction contrast on a model that wasn't fit with the interaction in the design, but just throws nonsense results.

Following example from the tutorial:

# Exclude patient with progressive disease, or not full rank for interaction
pdata2 = pdata[pdata.obs['Efficacy'] != 'PD'].copy()

# Bad design definition without interaction
pds2 = pt.tl.PyDESeq2(adata=pdata2, design="~ Efficacy + Treatment")
pds2.fit()

interaction_contrast = (
    pds2.cond(Treatment="Chemo", Efficacy="PR") - pds2.cond(Treatment="Chemo", Efficacy="SD")
) - (
    pds2.cond(Treatment="Anti-PD-L1+Chemo", Efficacy="PR") - pds2.cond(Treatment="Anti-PD-L1+Chemo", Efficacy="SD")
)
res_df = pds2.test_contrasts(contrasts=interaction_contrast)

No complaint, but the results are broken:

Log2 fold change & Wald test p-value, contrast vector: [0. 0. 0.]
           baseMean  log2FoldChange  lfcSE  stat  pvalue  padj
A1BG      16.408605             0.0    0.0   NaN     NaN   NaN
A1BG-AS1   1.958737             0.0    0.0   NaN     NaN   NaN
A1CF       0.002053             0.0    0.0   NaN     NaN   NaN
A2M       30.296881             0.0    0.0   NaN     NaN   NaN
A2M-AS1    0.557092             0.0    0.0   NaN     NaN   NaN
...             ...             ...    ...   ...     ...   ...
ZXDC       6.114098             0.0    0.0   NaN     NaN   NaN
ZYG11A     0.093600             0.0    0.0   NaN     NaN   NaN
ZYG11B     3.404941             0.0    0.0   NaN     NaN   NaN
ZYX       77.175203             0.0    0.0   NaN     NaN   NaN
ZZEF1      9.752162             0.0    0.0   NaN     NaN   NaN

While results make sense if I specify the design properly pds2 = pt.tl.PyDESeq2(adata=pdata2, design="~ Efficacy + Treatment + Efficacy*Treatment").

Do we want this to happen? Can we add an informative error if the contrast vector is all zeros?

* fix broken rpy2 edger tests * updated edger tests

Signed-off-by: zethson <[email protected]>

Zethson

<3 looks great!

pertpy/tools/_differential_gene_expression/_edger.py

Signed-off-by: zethson <[email protected]>

Zethson · 2025-01-04T13:04:10Z

@emdann I added type hints as you requested above and updated the submodule to have your tutorial changes as well. With passing CI (I work on it), I am happy to merge this now.

codecov-commenter · 2025-01-04T13:41:09Z

Codecov Report

Attention: Patch coverage is 86.84211% with 5 lines in your changes missing coverage. Please review.

Project coverage is 64.85%. Comparing base (9bba130) to head (a312bd0).
Report is 5 commits behind head on main.

Files with missing lines	Patch %	Lines
...ertpy/tools/_differential_gene_expression/_base.py	80.00%	3 Missing ⚠️
...y/tools/_differential_gene_expression/_pydeseq2.py	75.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #682      +/-   ##
==========================================
- Coverage   65.56%   64.85%   -0.72%     
==========================================
  Files          47       46       -1     
  Lines        6105     5992     -113     
==========================================
- Hits         4003     3886     -117     
- Misses       2102     2106       +4

Files with missing lines	Coverage Δ
...py/tools/_differential_gene_expression/__init__.py	`100.00% <100.00%> (ø)`
...rtpy/tools/_differential_gene_expression/_edger.py	`84.84% <100.00%> (-2.25%)`	⬇️
...ools/_differential_gene_expression/_statsmodels.py	`100.00% <ø> (ø)`
pertpy/tools/_distances/_distances.py	`89.96% <100.00%> (-0.07%)`	⬇️
pertpy/tools/_milo.py	`61.75% <100.00%> (ø)`
pertpy/tools/_mixscape.py	`79.12% <100.00%> (ø)`
...y/tools/_differential_gene_expression/_pydeseq2.py	`92.30% <75.00%> (-1.03%)`	⬇️
...ertpy/tools/_differential_gene_expression/_base.py	`24.84% <80.00%> (-7.49%)`	⬇️

Zethson · 2025-01-04T15:12:01Z

Merging now. Making a release as well. @emdann I'll create an issue for your concern above.

grst added 2 commits December 1, 2024 21:49

Switch to formulaic-contrasts

4e6cffe

Cleanup

60c90c7

grst and others added 4 commits December 26, 2024 09:05

Merge branch 'main' into formulaic-contrasts

967f39e

removing design matrix workaround (#691)

5f9a99c

Co-authored-by: Emma Dann <[email protected]>

Fix PyDESeq2

5c40f06

Update tests

19e753a

grst marked this pull request as ready for review December 26, 2024 12:23

fix typo in gitignore

dafcd2e

grst requested review from Zethson and emdann December 26, 2024 12:25

Remove contrast dataclass, which isnt used anywhere

298db0f

emdann reviewed Jan 3, 2025

View reviewed changes

emdann mentioned this pull request Jan 4, 2025

Updated DE tutorial with examples for formulaic contrasts scverse/pertpy-tutorials#49

Merged

emdann and others added 4 commits January 4, 2025 09:54

Fix edgeR rpy2 tests (#692)

59d815a

* fix broken rpy2 edger tests * updated edger tests

Fix tests (scipy)

898db80

Signed-off-by: zethson <[email protected]>

submodule

71f477c

Signed-off-by: zethson <[email protected]>

Remove unused code

466fe0c

Signed-off-by: zethson <[email protected]>

Zethson approved these changes Jan 4, 2025

View reviewed changes

pertpy/tools/_differential_gene_expression/_edger.py Outdated Show resolved Hide resolved

type hints

a312bd0

Signed-off-by: zethson <[email protected]>

github-actions bot added the chore label Jan 4, 2025

Zethson merged commit e43d8ff into main Jan 4, 2025
5 checks passed

Zethson mentioned this pull request Jan 4, 2025

Model doesn't complain if a complex interaction contrast is specified that wasn't fit with the interaction in the design #693

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to formulaic-contrasts #682

Switch to formulaic-contrasts #682

grst commented Dec 1, 2024 •

edited

Loading

Zethson commented Dec 2, 2024

grst commented Dec 26, 2024

emdann left a comment

emdann commented Jan 3, 2025

emdann commented Jan 4, 2025

Zethson left a comment

Zethson commented Jan 4, 2025 •

edited

Loading

codecov-commenter commented Jan 4, 2025 •

edited

Loading

Zethson commented Jan 4, 2025

	def test_contrasts(self, contrasts, **kwargs):
	"""
	Perform a comparison as specified in a contrast vector.

	Args:
	contrasts: Either a numeric contrast vector, or a dictionary of numeric contrast vectors.
	**kwargs: passed to the respective implementation.

	Returns:
	A dataframe with the results.
	"""

Switch to formulaic-contrasts #682

Switch to formulaic-contrasts #682

Conversation

grst commented Dec 1, 2024 • edited Loading

Zethson commented Dec 2, 2024

grst commented Dec 26, 2024

emdann left a comment

Choose a reason for hiding this comment

emdann commented Jan 3, 2025

emdann commented Jan 4, 2025

Zethson left a comment

Choose a reason for hiding this comment

Zethson commented Jan 4, 2025 • edited Loading

codecov-commenter commented Jan 4, 2025 • edited Loading

Codecov Report

Zethson commented Jan 4, 2025

grst commented Dec 1, 2024 •

edited

Loading

Zethson commented Jan 4, 2025 •

edited

Loading

codecov-commenter commented Jan 4, 2025 •

edited

Loading