Improve image combination performance #741

mwcraig · 2020-07-24T14:14:04Z

This pull request attempts to improve the performance of ccdproc by using using numpy's nan* functions instead of numpy MaskedArray, and using by using bottleneck. It is not intended (yet) to change the API for Combiner.

So far I have only switched average_combine to do this. If I get a couple 👍 on this I'll do the same for median_combine and the clipping routines, and also combine the implementations of sum_combine and average_combine to the extent possible.

To do:

Document weighting during image combination
Add link to preliminary benchmarking
Unify sum and average combination
Use nan* or bottleneck for clipping
Changelog entry
make bottleneck an optional dependency
Add prominent suggestion to use bottleneck
~~Consider making dtypes settable (might belong in CCDData).~~ -- this is a separate issue, really, and CCDData lives in astropy, core, not here.

Edit: Fixes #719

codecov · 2020-07-24T14:22:34Z

Codecov Report

Merging #741 (3ef82d9) into main (bb9c667) will decrease coverage by 1.08%.
The diff coverage is 99.25%.

❗ Current head 3ef82d9 differs from pull request most recent head 50abe7c. Consider uploading reports for the commit 50abe7c to get more accurate results

@@            Coverage Diff             @@
##             main     #741      +/-   ##
==========================================
- Coverage   96.01%   94.93%   -1.09%     
==========================================
  Files          30       30              
  Lines        3942     4049     +107     
==========================================
+ Hits         3785     3844      +59     
- Misses        157      205      +48

Impacted Files	Coverage Δ
ccdproc/combiner.py	`94.50% <98.33%> (+0.47%)`	⬆️
ccdproc/core.py	`97.24% <100.00%> (ø)`
ccdproc/tests/pytest_fixtures.py	`86.66% <100.00%> (ø)`
ccdproc/tests/test_combiner.py	`100.00% <100.00%> (ø)`
ccdproc/tests/test_memory_use.py	`89.65% <100.00%> (+0.36%)`	⬆️
ccdproc/tests/run_with_file_number_limit.py	`24.21% <0.00%> (-49.48%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update bb9c667...50abe7c. Read the comment docs.

mwcraig · 2020-07-24T14:25:06Z

Performance improvements (no clipping done)

Some benchmark plots are below. left-most commit is master, right-most uses bottleneck. Second-to-last uses nan* instead of masked array.

`average_combine` with weighting: Speedup is ~ x2

Unweighted `average_combine`, some entries masked. Speedup is ~ x3

Unweigthed `average_combine`, no entries masked. Speedup is ~ x2.8

mwcraig · 2020-07-24T14:29:26Z

Ping @crawfordsm @MSeifert04 @saimn @cmccully @ysBach -- you are either a ccdproc maintainer or someone who is working hard separate from this to improve image combination speed. Keep an eye out for an email invite later today to talk about whether we can pool efforts on this or not.

I don't need a detailed review of this -- but if you have objects to this stop-gap approach for improving performance please speak up!

saimn · 2020-07-24T15:10:11Z

Great to see progress on this. I started to do some comparison with what we have in DRAGONS, but it is still a wip.

About bottleneck, since I had the opportunity to look at it more closely recently (astropy/astropy#10553 (comment)):

It does have optimized algorithm only for little-endian data. For big-endian (FITS...) it has a fallback to np.nan* functions.
And currently the fallback to numpy comes with a pretty bad memory leak.

Other than that, 👍 to use nan* functions instead of masked arrays, it is much faster.

saimn · 2020-07-24T15:12:40Z

Looking more in detail at the code it seems that ccdproc converts the data to np.float64 by default, unless a dtype is specified. So the good news is that it will not be affected by the bottleneck issue. But in terms of performance it would be faster to use the original data's dtype.

mwcraig · 2020-07-30T22:58:25Z

One other bottleneck note: there are some precision issues with sums of float32: pydata/bottleneck#193

mwcraig · 2021-05-19T01:40:47Z

@saimn @ysBach -- any chance either of you can take a look at this? It has become a little convoluted because I'm trying to make sure I don't change the API. THis si only a small step towards improving performance, but it is a step...

For median the only improve is if bottleneck is installed because np.nanmedian just calls np.ma.median after setting up a mask internally.

Once this is wrapped up (hopefully this week) I'd like to come back to further improvements, likely the second week of June.

Yes, I changed one test to get this to pass. Note, though, that the test was of the return value of a completely masked result.

Also fix a couple of small sphinx-related issues

Includes using bottleneck for performance when it is available. Implement weighted sum and test weighted sum This includes factoring out the guts of the weighted sum for use in a couple of combination methods.

Putting it all in one place is the current practice.

The worry is that users may be passing in functions that expect masked data and we don't want to break that. It would be an API change that requires a new major release.

saimn · 2021-05-25T20:15:09Z

Coming a bit late but I don't see any issue (and my knowledge of ccdproc's combine code is limited). 🎉

mwcraig added enhancement combiner labels Jul 24, 2020

mwcraig marked this pull request as draft July 24, 2020 14:14

mwcraig added this to the 2.2 milestone Jul 24, 2020

mwcraig force-pushed the refactor-average-final branch from 08ef65b to 698edd1 Compare September 28, 2020 01:23

mwcraig closed this Nov 30, 2020

mwcraig reopened this Nov 30, 2020

mwcraig force-pushed the refactor-average-final branch 3 times, most recently from 388cbbd to 7f0c9cc Compare December 2, 2020 14:16

mwcraig mentioned this pull request Jan 27, 2021

Change combine and Combiner to accept a generator #757

Merged

6 tasks

mwcraig force-pushed the refactor-average-final branch from f329196 to f942355 Compare February 10, 2021 14:11

Base automatically changed from master to main March 16, 2021 13:40

mwcraig marked this pull request as ready for review March 19, 2021 20:01

mwcraig marked this pull request as draft March 19, 2021 20:01

mwcraig force-pushed the refactor-average-final branch 3 times, most recently from 3560821 to dcafaab Compare March 22, 2021 14:12

mwcraig marked this pull request as ready for review May 19, 2021 00:30

mwcraig closed this May 19, 2021

mwcraig reopened this May 19, 2021

mwcraig force-pushed the refactor-average-final branch 3 times, most recently from a8f395f to d581703 Compare May 19, 2021 00:53

mwcraig force-pushed the refactor-average-final branch 2 times, most recently from 3163156 to ee57383 Compare May 22, 2021 23:47

mwcraig added 14 commits May 23, 2021 21:39

Add test of pixel-wise weighting

444ca53

Implement average_combine using NaNs to mask instead of using numpy.ma

cb67e56

Yes, I changed one test to get this to pass. Note, though, that the test was of the return value of a completely masked result.

Add documentation for weighting in combination

94cb727

Also fix a couple of small sphinx-related issues

Revise memory use tests and change median to use nan instead of mask

f459b9d

Fix failing doctests

286267d

Implement NaN for mask throughout combiner

0758fc0

Includes using bottleneck for performance when it is available. Implement weighted sum and test weighted sum This includes factoring out the guts of the weighted sum for use in a couple of combination methods.

Fix typo in function name

a78ee5b

Add test with bottleneck to tox and github

b2ccd82

Move coverage configuration to setup.cfg

c106a29

Putting it all in one place is the current practice.

Ensure behavior as close as possible to previous

d60ff7e

The worry is that users may be passing in functions that expect masked data and we don't want to break that. It would be an API change that requires a new major release.

Remove numpy warning in test

81c8dc6

Add regression test for astropy#719

75ac109

Fix uncertainty with scaled data

7edd375

Skip coverage on a few things

29df104

mwcraig force-pushed the refactor-average-final branch from ee57383 to 29df104 Compare May 24, 2021 02:40

mwcraig added 3 commits May 24, 2021 11:17

Add option to use astropy sigma_clip

d57ca13

Add short note about bottleneck

3d47286

Add changelog entries

50abe7c

mwcraig merged commit ac8fcfb into astropy:main May 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve image combination performance #741

Improve image combination performance #741

mwcraig commented Jul 24, 2020 •

edited

Loading

codecov bot commented Jul 24, 2020 •

edited

Loading

mwcraig commented Jul 24, 2020

mwcraig commented Jul 24, 2020

saimn commented Jul 24, 2020

saimn commented Jul 24, 2020

mwcraig commented Jul 30, 2020

mwcraig commented May 19, 2021

saimn commented May 25, 2021

Improve image combination performance #741

Improve image combination performance #741

Conversation

mwcraig commented Jul 24, 2020 • edited Loading

codecov bot commented Jul 24, 2020 • edited Loading

Codecov Report

mwcraig commented Jul 24, 2020

Performance improvements (no clipping done)

average_combine with weighting: Speedup is ~ x2

Unweighted average_combine, some entries masked. Speedup is ~ x3

Unweigthed average_combine, no entries masked. Speedup is ~ x2.8

mwcraig commented Jul 24, 2020

saimn commented Jul 24, 2020

saimn commented Jul 24, 2020

mwcraig commented Jul 30, 2020

mwcraig commented May 19, 2021

saimn commented May 25, 2021

mwcraig commented Jul 24, 2020 •

edited

Loading

codecov bot commented Jul 24, 2020 •

edited

Loading

`average_combine` with weighting: Speedup is ~ x2

Unweighted `average_combine`, some entries masked. Speedup is ~ x3

Unweigthed `average_combine`, no entries masked. Speedup is ~ x2.8