Enable parallel big-M calculation for gdp.mbigm transformation #3641

sadavis1 · 2025-06-23T18:22:08Z

Fixes # (n/a)

Summary/Motivation:

The multiple big-M transformation tends to slow down as model size increases due to linear or quadratic growth in the number of subsolver runs required. This change parallelizes M calculation using Python's multiprocessing module. The threading module was tried at first, but due to previously discussed issues was switched to multiprocessing.

Changes proposed in this PR:

Rework flow of gdp.mbigm transformation. Calculate M values all at once using a multiprocessing.Pool.starmap call.
Switch gdp.mbigm big-M calculation from using the primal bound to using the dual bound, as the primal bound is not mathematically correct in the presence of numerical error (but any dual bound is valid).
Add configuration option 'use_primal_bound' to reenable old behavior, in case a solver like ipopt that cannot provide a dual bound is used.
Add configuration option 'threads' to control number of multiprocessing workers:
- By default, use os.cpu_count() - 1 workers. Note that this is potentially harmful; for example, it may lead to using more Gurobi license tokens at once, also on Windows the model must now be pickleable (see below).
- When set to 1 or 0, do not use multiprocessing and revert to fully single-threaded operation
Add configuration option 'process_spawn_mechanism' to determine how we spawn processes.
- Each mechanism described in the Python docs is available.
- Dynamically choose the default option: 'spawn' on Windows, 'fork' on Unix, unless we can detect that there are multiple threads, in which case we use 'forkserver' instead.
- When using 'spawn' or 'forkserver', models must be pickled to give to the other processes. Depend on dill in this case as models often contain nested functions and in my testing do not reliably pickle without it. I think this code leads to a nested pickle, but I'm not sure if anything can be done about it.
Fix a bug in bigm_mixin.py (name 'logger' used without being defined)
contrib/solver/factory.py: Set name class attribute of LegacySolverWrapper derived class to the solver's legacy_name. This is so that code like this:

from pyomo.environ import SolverFactory
solver = SolverFactory('gurobi_direct_v2')
solver_effective_copy = SolverFactory(solver.name, options=solver.options)

will behave as intended (as solvers do not reliably pickle even with dill). This does not affect code that gets the original contrib Solver directly from the pyomo.contrib.solver.common.factory.SolverFactory.

Add check that we are not passing a contrib Solver object, unless it is a LegacySolverWrapper (it has a .solve attribute, but the code will choke later in several places if we do not reject it).
Add tests to ensure that big-M calculation functions properly with the different start methods. These are essentially copy-pastes of test_calculated_Ms_correct with parameters to the solve() call altered.

Since this is a performance change, I ran a test on the medium-sized instance gdp_col from GDPlib, using baron as the subsolver.

It kind of looks like f(x) = 1/x, if you squint in such a way that you cannot see the bottom of the chart. This instance transformed in 145 seconds on the current main branch, so this is not a regression in the single-threaded case. Naturally, things are slightly slower when using 'spawn', but I do at least make sure we only pickle the model once per thread (and hopefully only once total? I'm not sure how multiprocessing works on the inside, but it really should cache these).

I also tested the small instance jobshop, to ensure nothing horrible happened.

On the current main branch, this model transforms in 0.36 seconds so again there is no regression.

Finally, there seems to be a bug when using this transformation with gurobi_direct v1. It works fine with the other solvers I've tried, so I suspect it's a bug in that interface, but I haven't tracked it down yet. This combination also has errors on the current main branch, but they're different errors, so it's hard to know if I've changed anything in that regard.

Legal Acknowledgement

By contributing to this software project, I have read the contribution guide and agree to the following terms and conditions for my contribution:

I agree my contributions are submitted under the BSD license.
I represent I am authorized to make the contributions and grant the license. If my employer has rights to intellectual property that includes these contributions, I represent that I have received permission to make contributions and grant the required license on behalf of that employer.

…adlock with gurobi

pyomo/contrib/solver/common/factory.py

pyomo/gdp/plugins/bigm_mixin.py

mrmundt · 2025-06-24T14:48:53Z

pyomo/gdp/plugins/multiple_bigm.py

-        (transBlock, algebraic_constraint) = self._setup_transform_disjunctionData(
-            obj, root_disjunct
-        )
+    def _transform_disjunctionDatas(


Is it possible to break this up? This function is huge.

It's even worse than it looks, because _setup_jobs_for_disjunction is basically just the inner body of this function that I transposed out so it would be less offensively indented (notice it mutates like 4 of its parameters). The problem is that the whole transformation is basically a big ball of state until it's done and I don't know if it can really not be that way. Emma's version was a lot nicer because it did the disjunctions one by one, but I can't do that if I want it to use threads effectively.

All that said, I will look and see if I can separate any more of this out in a reasonably clean way.

I moved the multiprocessing pool setup to its own instance method, slightly improving this situation

…acy name" This reverts commit 1c0bb3e.

sadavis1 · 2025-06-24T20:49:23Z

After discussion today, I have reverted the change to the LegacySolverWrapper class names, and switched from sending the solver name to recreate the solver to sending the solver class to recreate the solver. This makes the assumption that all solver classes (besides contrib without the wrapper, which we reject for other reasons) can be correctly constructed with the single named argument options.

Also, using this on windows now depends on dill even more, because solvers can be instances of nested classes so that's another dill.dumps(). I went ahead and completed the trio by dill-ing the options parameter too -- who knows, maybe there's a way to pass a nested function into one.

codecov · 2025-06-30T07:31:18Z

Codecov Report

Attention: Patch coverage is 85.36585% with 24 lines in your changes missing coverage. Please review.

Project coverage is 88.98%. Comparing base (ca33341) to head (9e93f17).
Report is 107 commits behind head on main.

Files with missing lines	Patch %	Lines
pyomo/gdp/plugins/multiple_bigm.py	85.71%	23 Missing ⚠️
pyomo/gdp/plugins/bigm_mixin.py	66.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3641      +/-   ##
==========================================
+ Coverage   88.93%   88.98%   +0.04%     
==========================================
  Files         888      888              
  Lines      102406   103541    +1135     
==========================================
+ Hits        91079    92139    +1060     
- Misses      11327    11402      +75

Flag	Coverage Δ
builders	`26.69% <15.24%> (+<0.01%)`	⬆️
default	`85.51% <85.36%> (?)`
expensive	`34.04% <15.24%> (?)`
linux	`86.74% <85.36%> (-1.96%)`	⬇️
linux_other	`86.74% <85.36%> (-0.01%)`	⬇️
osx	`83.04% <85.36%> (-0.01%)`	⬇️
win	`84.92% <82.31%> (-0.05%)`	⬇️
win_other	`84.92% <82.31%> (-0.05%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

jsiirola

A couple minor questions / nits, but overall looks good.

pyomo/gdp/plugins/bigm_mixin.py

jsiirola · 2025-07-21T17:47:11Z

pyomo/gdp/plugins/multiple_bigm.py


 def Solver(val):
    if isinstance(val, str):
        return SolverFactory(val)
    if not hasattr(val, 'solve'):
        raise ValueError("Expected a string or solver object (with solve() method)")
+    if isinstance(val, NewSolverBase) and not isinstance(val, LegacySolverWrapper):


How does this admonition work with APPSI solvers? Should you just check for not deriving off the old Solver base class?

To be honest, I only included this error check because it's an error I personally managed to run into. In general I don't think there's any way to tell if something is a legitimate old-style solver or not (ie, takes a model and produces a SolverResults), except maybe by passing a trivial model and seeing what comes back, which seems like a little too much work for an argument validation function. I think the "normal" solvers all derive from the base class, but there are plenty of weird ones like gdpopt, mindtpy, contrib.multistart, the pynumero scipy stuff, etc. that should be a legitimate argument here but don't inherit from anything common.

Maybe I should just add another special case for appsi? Surely the .solve()-able classes that do work here outnumber those that don't?

I wonder if the right solution is to add something like a api_version method to the various solver interfaces that you can query (something like 1 for legacy, 2 for APPSI, and 3 for contrib.solver)?

@mrmundt: thoughts?

@jsiirola - This is a good question. We have a "version" of this in pyomo.__future__ - solver_factory_v3. Could we use this system to our advantage somehow? (Basically, I am saying yes, but wondering if we can build on something we already have)

It's different, though: The thing in __future__ is selecting what the "default" solver factory is. In this case, we want to know, "given a particular solver instance, what API does it support"?

Fair point. What about a property that we introduce based on an enum, e.g.,

from enum import Enum class SolverAPI(Enum): V1 = 1 APPSI = 2 V2 = 3

And then in the solver base classes (e.g., OptSolver and SolverBase), we make a property:

@property def api_version(self): return SolverAPI.V1

Something like that was what I was thinking. I would have a slight preference for it to be a class attribute over a @property so that you can detect the API version without necessarily instantiating an instance.

jsiirola · 2025-07-21T17:48:29Z

pyomo/gdp/plugins/multiple_bigm.py

+        'process_start_method',
+        ConfigValue(
+            default=None,
+            domain=ProcessStartMethod,


Why is this a string and not an Enum? I think for things like this, an Enum (and validation using In) would be better?

Python itself uses strings, so a user might expect them to work. I changed it to an enum type derived from str, and used InEnum as the domain validator. This automatically converts any string that does get passed, as best I can tell.

pyomo/gdp/plugins/multiple_bigm.py

jsiirola · 2025-07-21T19:23:09Z

pyomo/gdp/plugins/multiple_bigm.py

+    # Do the inner work setting up M calculation jobs for one
+    # disjunction. This mutates the parameters Ms, jobs,
+    # transBlock._mbm_values, and also self.used_args.


It would be helpful to convert this to a docstring. In particular, it would be good to document the data structures that are being created (lists? dicts? if a dict, from what to what, etc.).

I replaced this comment with a docstring, but I'm not sure if it's properly formatted or if it contains all the things it should contain.

sadavis1 · 2025-07-22T22:13:55Z

I applied the docstrings from the arguments to apply_to using document_kwargs_from_configdict. Actually this seems to be the only place I can put it, and I confused myself earlier -- it can't go on CONFIG because decorators don't go on class attributes, and putting it on the class would make them look like constructor arguments. It also seems like this was only partially successful, because it added all the contents of doc, but for a lot of these you also need the strings from description to get the full picture...

mrmundt · 2025-07-31T16:21:26Z

pyomo/gdp/plugins/multiple_bigm.py

+            disjunction: disjunction to set up the jobs for
+            active_disjuncts: map from disjunctions to ComponentSets of active
+                disjuncts
+            arg_Ms: user-provided map from (constraint, disjunct) to M value


transformed_constraints is missed in this list.

mrmundt · 2025-07-31T16:22:15Z

pyomo/gdp/plugins/multiple_bigm.py

+        arg_Ms,
+        Ms,
+        jobs,
+        transBlock,


Nitpick - transBlock is weird here in comparison to active_disjuncts, args_Ms. It's a switch from snake_case to camelCase for no discernible reason.

jsiirola · 2025-08-01T21:54:27Z

I applied the docstrings from the arguments to apply_to using document_kwargs_from_configdict. Actually this seems to be the only place I can put it, and I confused myself earlier -- it can't go on CONFIG because decorators don't go on class attributes, and putting it on the class would make them look like constructor arguments. It also seems like this was only partially successful, because it added all the contents of doc, but for a lot of these you also need the strings from description to get the full picture...

This motivated #3667. Now that it is merged, I have ported it here. I also updated the doc fields to be self-contained (i.e., description is meant to be a short description of the field, and doc is the (standalone) full documentation). I think this can be merged once we get the second review.

sadavis1 added 21 commits June 6, 2025 12:03

fix a bug in bigm_mixin

2e2cb24

non-working attempt at threaded mbigm

72003c4

Working attempt at parallel mbigm when using baron, but leads to a de…

526bca4

…adlock with gurobi

delete unused code

69b22ce

Improve several edge cases

cf09d52

Handle edge cases "better"

89218cf

Handle ipopt in a somewhat better way

cdb0103

apply black

160fc22

apply black on more files

83a2a20

Cleanup and formatting

2e5e07e

black yet again

21ea319

Merge branch 'main' of github.com:Pyomo/pyomo into mbigm-parallel

523bbaa

Switch from multithreading to multiprocessing

c9794a5

fix a bug, also reflow my block comments

a2d52cf

Merge branch 'main' of github.com:Pyomo/pyomo into mbigm-parallel

2ff8888

Import plugins manually, also remove unused imports

738223b

remove debug code

bb7a816

Set 'name' class attribute of LegacySolver objects to the legacy name

1c0bb3e

Consider and fix more edge cases

d57501d

Test process spawning methods

2de93fb

reorder some imports

2c5d3e5

emma58 self-requested a review June 23, 2025 19:26

mrmundt reviewed Jun 24, 2025

View reviewed changes

sadavis1 added 3 commits June 24, 2025 13:59

Revert "Set 'name' class attribute of LegacySolver objects to the leg…

bcbb9ab

…acy name" This reverts commit 1c0bb3e.

Pickle classes instead of passing solver names

edc0f30

Move thread pool setup out of _transform_DisjunctionDatas

9e93f17

jsiirola added the AT: PRE-TEST INSPECTED label Jun 30, 2025

pyomo-autotest removed the AT: PRE-TEST INSPECTED label Jun 30, 2025

sadavis1 added 5 commits June 30, 2025 09:47

check for a rather implausible error case

c37d305

terminate subprocesses more gently in an attempt to make codecov work

a65467e

Merge branch 'main' of github.com:Pyomo/pyomo into mbigm-parallel

ffe3a5e

reduce lines of code

56d12ed

Merge branch 'main' of github.com:Pyomo/pyomo into mbigm-parallel

547e800

blnicho added this to August 2025 Release Jul 1, 2025

github-project-automation bot moved this to Todo in August 2025 Release Jul 1, 2025

blnicho moved this from Todo to Review In Progress in August 2025 Release Jul 1, 2025

blnicho requested a review from jsiirola July 8, 2025 18:51

jsiirola reviewed Jul 21, 2025

View reviewed changes

sadavis1 added 4 commits July 22, 2025 10:59

Address review comments

e14f0e2

Merge branch 'main' of github.com:Pyomo/pyomo into mbigm-parallel

bfe2809

apply kwargs from CONFIG to apply_to documentation

c5a1e9f

apply black

4b303f8

jsiirola approved these changes Jul 23, 2025

View reviewed changes

github-project-automation bot moved this from Review In Progress to Reviewer Approved in August 2025 Release Jul 23, 2025

jsiirola mentioned this pull request Jul 27, 2025

Rework ConfigDict numpydoc generation support #3667

Merged

mrmundt reviewed Jul 31, 2025

View reviewed changes

jsiirola added 4 commits August 1, 2025 15:28

Merge branch 'main' into mbigm-parallel

7d3ffd9

Update to use new document_class_CONFIG decorator

fb5e4c1

NFC: update docs so doc= is self-contained

79e7576

NFC: move references into the bibliography

77bc5c0

jsiirola added the AT: PRE-TEST INSPECTED label Aug 1, 2025

pyomo-autotest removed the AT: PRE-TEST INSPECTED label Aug 1, 2025

Enable parallel big-M calculation for gdp.mbigm transformation #3641

Are you sure you want to change the base?

Enable parallel big-M calculation for gdp.mbigm transformation #3641

Conversation

sadavis1 commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fixes # (n/a)

Summary/Motivation:

Changes proposed in this PR:

Legal Acknowledgement

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sadavis1 commented Jun 24, 2025

Uh oh!

codecov bot commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jsiirola left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sadavis1 Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sadavis1 commented Jul 22, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jsiirola commented Aug 1, 2025

Uh oh!

Uh oh!

sadavis1 commented Jun 23, 2025 •

edited

Loading

codecov bot commented Jun 30, 2025 •

edited

Loading

sadavis1 Jul 22, 2025 •

edited

Loading