Slowdown after switch to `crypto-bigint` #490

fjarri · 2025-03-05T22:49:49Z

Benchmarks slowed down several times after #394. What could be the reason?

Performance improvements for boxed uints (RustCrypto/crypto-bigint#777) and the corresponding changes in crypto-primes (entropyxyz/crypto-primes#78) don't really change the results much according to my tests.

sign test time is split ~50/50 between Montgomery exponentiation (with all the time spent in almost_montgomery_mul(), the lowest level function) and BoxedUint::inv_mod(); decrypt is almost exclusively exponentiation. Both of these are the calls in RSA itself; crypto-primes calls take negligible time.

So it seems that either these two functions are somehow much slower than num-bigint (possible, yet unlikely, and straightforward to test), or somehow #394 changed the algorithm to apply them more times than necessary.

The text was updated successfully, but these errors were encountered:

fjarri · 2025-03-05T23:44:36Z

So one difference that I see is that modpow() (and Montgomery multiplication in general) in num-bigint-dig was not, in fact, constant-time. But I would not expect just that account for such a huge difference.

tarcieri · 2025-03-05T23:53:07Z

Yeah, that's one difference: num_bigint::BigUint has a normalize function which automatically strips leading zeros which is called all over the place.

However, that could also be a clue to the slowdown, namely there may be places where fixed-but-dynamic precision BoxedUints are using a larger size/precision than necessary for a given key size, causing the number of iterations looping through limbs to be much larger than num-bigint (i.e. because they're larger than necessary).

That's my best guess anyway. I'm not sure to sleuth out exactly where that might be happening other than tediously going through line-by-line and comparing the implementations or otherwise looking for unnecessary leading zeros (via e.g. dbg! inspection)

dignifiedquire · 2025-03-05T23:53:25Z

The difference comes from the fact that we now do a lot of operation over numbers twice as large as they need to be, at least that is what I remember when I last checked.

if each prime has x bits, we end up doing operations on numbers of the size of 2x bits in a bunch of places, when often times something much smaller would suffice.

fjarri · 2025-03-06T00:04:05Z

Yep, I just reached that conclusion as well. There are several instances of exponentiation modulo p or q in rsa_decrypt() with the moduli kept in a BoxedUint the size of p * q. That still leaves the inversion, but it's probably the same problem there.

fjarri · 2025-03-06T00:08:57Z

Btw if anyone wants to deal with this issue, feel free, I don't think I will have enough free time in the coming weeks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slowdown after switch to `crypto-bigint` #490

Slowdown after switch to `crypto-bigint` #490

fjarri commented Mar 5, 2025 •

edited

Loading

fjarri commented Mar 5, 2025

tarcieri commented Mar 5, 2025 •

edited

Loading

dignifiedquire commented Mar 5, 2025

fjarri commented Mar 6, 2025

fjarri commented Mar 6, 2025

Slowdown after switch to crypto-bigint #490

Slowdown after switch to crypto-bigint #490

Comments

fjarri commented Mar 5, 2025 • edited Loading

fjarri commented Mar 5, 2025

tarcieri commented Mar 5, 2025 • edited Loading

dignifiedquire commented Mar 5, 2025

fjarri commented Mar 6, 2025

fjarri commented Mar 6, 2025

Slowdown after switch to `crypto-bigint` #490

Slowdown after switch to `crypto-bigint` #490

fjarri commented Mar 5, 2025 •

edited

Loading

tarcieri commented Mar 5, 2025 •

edited

Loading