[Math] Fix NaN from powf() domain error. #2067

stolk · 2024-09-30T08:00:00Z

This change prevents calling powf(x,y) with negative x.

The SIMD versions using ssePower() already seem to be resistent to negative pixel values, but the scalar version was not.

This would cause a SIGFPE on apps that have floating point exceptions enabled.

FIXES: #2066

linux-foundation-easycla · 2024-09-30T08:00:05Z

The committers listed above are authorized under a signed CLA.

✅ login: stolk / name: Bram Stolk (3c3e867, 6eff96c)

doug-walker

Thanks Bram, good catch. What do you think about moving the std::pow call down into the ternary operator, to avoid adding the extra std::max call?

In addition, I think this would fix the unit test errors. (You're clamping the value but the ternary expression needs the unclamped value.)

stolk · 2024-09-30T17:49:53Z

Thanks Bram, good catch. What do you think about moving the std::pow call down into the ternary operator, to avoid adding the extra std::max call?

Hi Doug,

I advise against the ternary operator...

I checked, and the compilers (gcc, clang) actually produce more efficient code with std::max than with ?:

See this godbolt output.

stolk · 2024-09-30T17:52:13Z

...not authorized...

I expected having a CLA in place for OpenImageIO with ASWF, would transfer over to OpenColorIO?

stolk · 2024-09-30T17:56:50Z

In addition, I think this would fix the unit test errors. (You're clamping the value but the ternary expression needs the unclamped value.)

Oops, did I break the unit test? I'll revisit.

doug-walker · 2024-09-30T17:59:11Z

I was not proposing to replace the max with a new ternary. I was suggesting moving the pow call down into the existing ternary. As I mentioned, your unit tests are failing, so some action is needed.

The CLAs for all ASWF projects are managed through the same account but need to be signed separately for each project.

stolk · 2024-09-30T18:06:03Z

I was not proposing to replace the max with a new ternary. I was suggesting moving the pow call down into the existing ternary. As I mentioned, your unit tests are failing, so some action is needed.

Thanks Doug, I will redo this PR.

Strangely, the unmodified main branch fails the unit test as well. I have to find out what is going on here.

$ make test
Running tests...
Test project /home/stolk/src/OpenColorIO/build
    Start 1: test_utils_exec
1/6 Test #1: test_utils_exec ..................   Passed    0.01 sec
    Start 2: test_cpu
2/6 Test #2: test_cpu .........................***Failed    5.78 sec
    Start 3: test_cpu_no_accel
3/6 Test #3: test_cpu_no_accel ................***Failed    5.41 sec
    Start 4: test_cpu_avx512
4/6 Test #4: test_cpu_avx512 ..................***Failed    5.99 sec
    Start 5: test_gpu
5/6 Test #5: test_gpu .........................***Failed    8.83 sec
    Start 6: test_python
6/6 Test #6: test_python ......................   Passed    1.38 sec

33% tests passed, 4 tests failed out of 6

Does the main branch HEAD pass the unit tests for you? Thanks.

doug-walker · 2024-09-30T18:08:14Z

All the tests should pass, except for the GPU tests on some mac platforms currently.

This change prevents calling powf(x,y) with negative x. The SIMD versions using ssePower() already seem to be resistent to negative pixel values, but the scalar version was not. This would cause a SIGFPE on apps that have floating point exceptions enabled. FIXES: AcademySoftwareFoundation#2066 Signed-off-by: Bram Stolk <[email protected]>

stolk · 2024-09-30T18:33:13Z

I have changed this PR to be less invasive, and only correct the value passed to powf() and not the entire pixel value as used in the rest of the function.

Note: I cannot get the unit test to pass on my machine, even when using HEAD of main. Pushed the PR to see if CI will pass.

KevinJW · 2024-10-02T10:54:24Z

just to note, micro optimisation difference comes from the testing vs return order, if you flip the ternary around you can get the same code ... https://godbolt.org/z/Wzafa3nsb

stolk · 2024-10-10T20:49:11Z

Can a maintainer please approve the CI run, and then merge? Thanks.

doug-walker

Hi Bram, what do you think about changing this:

out[0] = pixel[0]<=red[3] ? pixel[0] * red[4] : data[0];

to this:

out[0] = pixel[0]<=red[3] ? pixel[0] * red[4] :
std::pow(pixel[0] * red[0] + red[1], red[2]);

That way it avoids the extra branch.

stolk · 2024-10-11T16:54:25Z

Hi Bram, what do you think about changing this:

out[0] = pixel[0]<=red[3] ? pixel[0] * red[4] : data[0];

to this:

out[0] = pixel[0]<=red[3] ? pixel[0] * red[4] : std::pow(pixel[0] * red[0] + red[1], red[2]);

That way it avoids the extra branch.

No, that does not address the issue.

We should not be feeding negative numbers to the base of a powf().

My change makes sure of that, your version does not.

doug-walker · 2024-10-18T02:12:10Z

Hi Bram,

Is the reason that you don't like my proposal because you are worried that pixel[0] * red[0] + red[1] will be negative? In practice, that is never the case. In theory, someone may be able to set parameter values that would place the breakpoint below zero, but if that is the edge case you are worried about, it would be better dealt with in the GammaOpData.cpp:GammaOpData::validateParameters function rather than inside the performance critical pixel loop.

stolk mentioned this pull request Sep 30, 2024

[test] Add switch to imageinout_test for enabling floating point exceptions. AcademySoftwareFoundation/OpenImageIO#4463

Merged

doug-walker reviewed Sep 30, 2024

View reviewed changes

stolk force-pushed the fix_fpe branch from 03d245d to 3c3e867 Compare September 30, 2024 18:29

Merge branch 'main' into fix_fpe

6eff96c

doug-walker reviewed Oct 11, 2024

View reviewed changes

doug-walker requested a review from cozdas October 11, 2024 19:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Math] Fix NaN from powf() domain error. #2067

[Math] Fix NaN from powf() domain error. #2067

stolk commented Sep 30, 2024

linux-foundation-easycla bot commented Sep 30, 2024 •

edited

Loading

doug-walker left a comment •

edited

Loading

stolk commented Sep 30, 2024

stolk commented Sep 30, 2024

stolk commented Sep 30, 2024

doug-walker commented Sep 30, 2024

stolk commented Sep 30, 2024

doug-walker commented Sep 30, 2024

stolk commented Sep 30, 2024

KevinJW commented Oct 2, 2024

stolk commented Oct 10, 2024

doug-walker left a comment

stolk commented Oct 11, 2024

doug-walker commented Oct 18, 2024

[Math] Fix NaN from powf() domain error. #2067

Are you sure you want to change the base?

[Math] Fix NaN from powf() domain error. #2067

Conversation

stolk commented Sep 30, 2024

linux-foundation-easycla bot commented Sep 30, 2024 • edited Loading

doug-walker left a comment • edited Loading

Choose a reason for hiding this comment

stolk commented Sep 30, 2024

stolk commented Sep 30, 2024

stolk commented Sep 30, 2024

doug-walker commented Sep 30, 2024

stolk commented Sep 30, 2024

doug-walker commented Sep 30, 2024

stolk commented Sep 30, 2024

KevinJW commented Oct 2, 2024

stolk commented Oct 10, 2024

doug-walker left a comment

Choose a reason for hiding this comment

stolk commented Oct 11, 2024

doug-walker commented Oct 18, 2024

linux-foundation-easycla bot commented Sep 30, 2024 •

edited

Loading

doug-walker left a comment •

edited

Loading