Use recv in naive distributer by ryanstocks00 · Pull Request #7 · ryanstocks00/DynaMPI

ryanstocks00 · 2026-02-04T11:29:44Z

Summary by CodeRabbit

New Features
- Immediate-receive mode with configurable immediate-receive toggle and result buffer cap; new communicator probe/recv-any helpers.
Bug Fixes
- Accurate receive accounting using actual received element counts; more reliable result sizing and worker reuse.
Tests
- Consolidated, parameterized distribution tests covering immediate and probe-based modes.
Chores
- Distributor Config types made publicly accessible for configuration/testing.

coderabbitai · 2026-02-04T11:30:11Z

Caution

Review failed

The pull request is closed.

📝 Walkthrough

Walkthrough

Added two public Config options to NaiveMPIWorkDistributor (use_immediate_recv, max_result_size), replaced direct MPI_Probe/Recv with MPICommunicator probe/recv_any helpers, split receive flow into immediate vs probe-based paths with process_result_message, updated tests, and made Hierarchical distributor Config public.

Changes

Cohort / File(s)	Summary
Naive distributor `include/dynampi/impl/naive_distributor.hpp`	Added `Config::use_immediate_recv` and `Config::max_result_size`. Reworked master receive flow to use `MPICommunicator::probe` / `recv_any`, added `process_result_message(...)`, and split logic into immediate vs probe-based branches. Ensures `_results` resizing, updates `_worker_current_task_indices` and `_results_received`, and returns worker IDs to the free queue after processing.
MPI communicator `include/dynampi/mpi/mpi_communicator.hpp`	Added `probe(source, tag) -> MPI_Status`, template `recv_any<T>(data, source, tag) -> MPI_Status`, `send_empty<T>(dest, tag)` and `recv_empty<T>(source, tag)`. Adjusted `recv` to use `MPI_Get_count` to update bytes/receive stats using actual received counts.
Tests `test/mpi/test_distributers.cpp`	Replaced `DistributerTypeWrapper` with `TestConfig<DistributorT, ImmediateRecv>` and unified `DynamicDistribution` fixture with `make_distributor`. Added ImmediateRecv-aware configs and updated tests to construct/exercise configured distributors (Naive immediate/non-immediate, Hierarchical). Consolidated test cases and removed legacy wrappers.
Hierarchical distributor visibility `include/dynampi/impl/hierarchical_distributor.hpp`	Made `HierarchicalMPIWorkDistributor::Config` publicly accessible by adding an explicit `public:` before the `Config` definition.

Sequence Diagram(s)

sequenceDiagram
    participant Master as MasterDistributor
    participant Comm as MPICommunicator
    participant Worker as Worker
    Note left of Master: Receive loop (immediate or probe-based)
    Master->>Comm: recv_any(buffer) or probe()
    alt Immediate receive (use_immediate_recv = true)
        Comm-->>Master: STATUS (tag=RESULT / REQUEST)
        alt RESULT
            Master->>Master: process_result_message(status, buffer)
            Master->>Master: resize/store ResultT, update indices
            Master->>Master: push worker_id (status.source) to free queue
        else REQUEST
            Master->>Master: prepare task for worker
            Master->>Worker: send task (via Comm)
        end
    else Probe-based receive (use_immediate_recv = false)
        Master->>Comm: probe()
        Comm-->>Master: STATUS
        alt STATUS.tag == RESULT
            Master->>Comm: recv(buffer from status.source)
            Master->>Master: process_result_message(status, buffer)
            Master->>Master: push worker_id to free queue
        else STATUS.tag == REQUEST
            Master->>Comm: recv_empty<T>(status.source)
            Master->>Master: assign/send task to worker
        end
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Use recv in naive distributer #7 — Modifies the same Naive distributor and MPICommunicator helpers (probe/recv_any) and alters RESULT handling; likely directly related.

Poem

🐰
I hopped through bytes and MPI streams,
resized baskets for result-filled dreams.
Quick receives or probes in queue,
I map, I store, then free the crew.
Hooray — the distributor hums anew!

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 15.38% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title is vague and does not clearly convey the main objectives of the changes, which involve introducing immediate-receive functionality and new configuration options.	Consider a more descriptive title such as 'Add immediate-receive mode to NaiveMPIWorkDistributor' or 'Introduce use_immediate_recv configuration option' that better captures the core changes.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch recv

Tip

Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov · 2026-02-04T11:31:36Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.48%. Comparing base (a02209b) to head (9b58f49).

Additional details and impacted files

@@             Coverage Diff             @@
##              main       #7      +/-   ##
===========================================
- Coverage   100.00%   99.48%   -0.52%     
===========================================
  Files           13       13              
  Lines          568      584      +16     
  Branches        66       63       -3     
===========================================
+ Hits           568      581      +13     
- Misses           0        3       +3

Flag	Coverage Δ
unittests	`99.48% <100.00%> (-0.52%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull request overview

This pull request adds immediate receive mode support to the NaiveMPIWorkDistributor class. Instead of using the probe-then-receive pattern (which requires two MPI calls per message), immediate receive mode uses a single recv_any call with a predetermined buffer size, potentially improving performance by reducing MPI call overhead.

Changes:

Added use_immediate_recv and max_result_size configuration options to NaiveMPIWorkDistributor
Implemented immediate receive logic with separate handling for fixed-size and variable-size result types
Added helper methods to MPICommunicator: probe(), recv_any(), recv_empty_message_any(), and adjust_recv_bytes_received()
Added comprehensive test coverage for immediate receive mode with various result types and configurations

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 6 comments.

File	Description
test/mpi/test_distributers.cpp	Added new test fixture and four test cases to verify immediate receive mode with different result types (primitives, vectors), priority queues, and statistics tracking
include/dynampi/mpi/mpi_communicator.hpp	Added MPI wrapper methods for probe, immediate receive, and statistics adjustment to support the new receive mode
include/dynampi/impl/naive_distributor.hpp	Implemented immediate receive mode in the worker message handling logic with separate code paths for fixed-size and resizable result types, including statistics correction logic

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

include/dynampi/impl/naive_distributor.hpp

Copilot · 2026-02-04T11:38:05Z

include/dynampi/impl/naive_distributor.hpp

+    int max_result_size = 1024;  // Maximum expected size for RESULT messages when using immediate
+                                 // recv. Must be large enough to hold the largest expected RESULT
+                                 // message. If a message exceeds this size, behavior is undefined.


The comment states "If a message exceeds this size, behavior is undefined" but the actual behavior when a RESULT message exceeds max_result_size would be a buffer overflow or truncation, which is a critical safety issue. Consider adding runtime validation to detect when messages exceed this size and either throw an error or log a warning. Additionally, the documentation should be clearer about what happens (e.g., "message will be truncated" or "will cause undefined behavior/crash").

Suggested change

int max_result_size = 1024; // Maximum expected size for RESULT messages when using immediate

// recv. Must be large enough to hold the largest expected RESULT

// message. If a message exceeds this size, behavior is undefined.

int max_result_size = 1024; // Maximum expected size (in bytes) for RESULT messages when using

// immediate recv. Must be large enough to hold the largest possible

// RESULT message produced by the worker function. This class does

// not perform runtime validation of this limit; if a RESULT message

// is larger than max_result_size, the underlying MPI calls may

// truncate the message or cause memory corruption/crashes depending

// on the MPI implementation and surrounding code.

Copilot · 2026-02-04T11:38:06Z

include/dynampi/impl/naive_distributor.hpp

+          int count;
+          DYNAMPI_MPI_CHECK(MPI_Get_count, (&status, result_type::value, &count));
+          // Resize buffer to actual received count (may be less than max_result_size)
+          result_type::resize(buffer, count);


When receiving variable-size messages (resize_required types) in immediate receive mode, if the incoming RESULT message is larger than max_result_size, MPI will truncate the message to fit the buffer, potentially causing data loss or corruption. Consider adding validation after receiving to check if truncation occurred by comparing count (from MPI_Get_count) with max_result_size, and throwing an error or logging a warning when truncation is detected. This would help catch configuration errors where max_result_size is set too small.

include/dynampi/mpi/mpi_communicator.hpp

include/dynampi/impl/naive_distributor.hpp

include/dynampi/mpi/mpi_communicator.hpp

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@include/dynampi/mpi/mpi_communicator.hpp`:
- Around line 230-235: The adjust_recv_bytes_received function can underflow
when bytes_to_subtract > _statistics.bytes_received; update
adjust_recv_bytes_received (guarding on statistics_mode != StatisticsMode::None)
to clamp the subtraction or assert: check _statistics.bytes_received and if
bytes_to_subtract >= _statistics.bytes_received set _statistics.bytes_received =
0 (or log/assert in debug builds), otherwise subtract bytes_to_subtract from
_statistics.bytes_received to prevent size_t wraparound.

include/dynampi/mpi/mpi_communicator.hpp

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@include/dynampi/impl/naive_distributor.hpp`:
- Around line 271-300: Immediate receive path can call recv_any(buffer) before
knowing the tag, causing an MPI datatype mismatch when a REQUEST (nullptr)
arrives; change the _config.use_immediate_recv branch to first call
_communicator.recv_empty_message_any(status) (or otherwise probe) to obtain the
MPI_Status/tag, then: if status.MPI_TAG == Tag::RESULT allocate/resize ResultT
buffer (using result_type::resize(_config.max_result_size) when
resize_required), call _communicator.recv_any(buffer) and use MPI_Get_count then
process_result_message(status, std::move(buffer), count); else if Tag::REQUEST
call _communicator.recv_empty_message() (or handle the empty receive) and
proceed as the probe-based code does; update both the resize_required and
non-resize paths to follow this flow and keep references to recv_any,
recv_empty_message_any, process_result_message, result_type::resize, ResultT,
and Tag::RESULT/Tag::REQUEST.

🧹 Nitpick comments (3)

include/dynampi/mpi/mpi_communicator.hpp (1)

179-195: Consider extracting shared statistics tracking logic.

The statistics update block (lines 186-193) is nearly identical to the one in recv() (lines 162-169). Consider extracting a private helper to reduce duplication.
♻️ Example refactor
private:
  template <typename mpi_type>
  void update_recv_statistics(const MPI_Status& status) {
    if constexpr (statistics_mode != StatisticsMode::None) {
      _statistics.recv_count++;
      int actual_count;
      DYNAMPI_MPI_CHECK(MPI_Get_count, (&status, mpi_type::value, &actual_count));
      int size;
      MPI_Type_size(mpi_type::value, &size);
      _statistics.bytes_received += actual_count * size;
    }
  }

include/dynampi/impl/naive_distributor.hpp (2)

33-36: Clarify truncation behavior in documentation.

The comment states behavior is "undefined" if a message exceeds max_result_size, but MPI actually returns MPI_ERR_TRUNCATE on truncation, which DYNAMPI_MPI_CHECK would catch and likely abort. Consider clarifying that exceeding this size will cause an error, not silent corruption.

-    int max_result_size = 1024;  // Maximum expected size for RESULT messages when using immediate
-                                 // recv. Must be large enough to hold the largest expected RESULT
-                                 // message. If a message exceeds this size, behavior is undefined.
+    int max_result_size = 1024;  // Maximum expected size for RESULT messages when using immediate
+                                 // recv. Must be large enough to hold the largest expected RESULT
+                                 // message. If a message exceeds this size, MPI_Recv will fail
+                                 // with MPI_ERR_TRUNCATE.

247-261: Reuse idx_for_worker() and remove redundant resize.

Line 249 duplicates the logic already in idx_for_worker(status.MPI_SOURCE).
Lines 256-258: The resize of _results[task_idx] before move assignment is wasteful—the move will overwrite the destination entirely, discarding any pre-allocated storage.

♻️ Proposed fix

   void process_result_message(const MPI_Status& status, ResultT&& result, int count) {
-    using result_type = MPI_Type<ResultT>;
-    int worker_idx = status.MPI_SOURCE - (status.MPI_SOURCE > _config.manager_rank);
+    int worker_idx = idx_for_worker(status.MPI_SOURCE);
     int64_t task_idx = _worker_current_task_indices[worker_idx];
     _worker_current_task_indices[worker_idx] = -1;
     assert(task_idx >= 0 && "Task index should be valid");
     if (static_cast<uint64_t>(task_idx) >= _results.size()) {
       _results.resize(task_idx + 1);
     }
-    if constexpr (result_type::resize_required) {
-      result_type::resize(_results[task_idx], count);
-    }
     _results[task_idx] = std::move(result);
     _results_received++;
   }

include/dynampi/impl/naive_distributor.hpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Use recv in naive distributer

e34abae

Copilot AI review requested due to automatic review settings February 4, 2026 11:29

Copilot started reviewing on behalf of ryanstocks00 February 4, 2026 11:30 View session

Copilot AI reviewed Feb 4, 2026

View reviewed changes

coderabbitai bot reviewed Feb 4, 2026

View reviewed changes

include/dynampi/mpi/mpi_communicator.hpp Outdated Show resolved Hide resolved

ryanstocks00 added 3 commits February 5, 2026 00:09

Cleaned up tests

59129d4

Restoring some test elements that were nuked

d0a8e93

Minor refinement

077fe08

coderabbitai bot reviewed Feb 5, 2026

View reviewed changes

include/dynampi/impl/naive_distributor.hpp Show resolved Hide resolved

ryanstocks00 and others added 2 commits February 5, 2026 23:44

Update include/dynampi/mpi/mpi_communicator.hpp

9b58f49

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Always recv same type

b3ad32a

ryanstocks00 merged commit c47bc9e into main Feb 17, 2026
6 of 36 checks passed

coderabbitai bot mentioned this pull request Feb 17, 2026

Implementing hierarchical distributor #2

Open

-    int max_result_size = 1024;  // Maximum expected size for RESULT messages when using immediate
-                                 // recv. Must be large enough to hold the largest expected RESULT
-                                 // message. If a message exceeds this size, behavior is undefined.
+    int max_result_size = 1024;  // Maximum expected size (in bytes) for RESULT messages when using
+                                 // immediate recv. Must be large enough to hold the largest possible
+                                 // RESULT message produced by the worker function. This class does
+                                 // not perform runtime validation of this limit; if a RESULT message
+                                 // is larger than max_result_size, the underlying MPI calls may
+                                 // truncate the message or cause memory corruption/crashes depending
+                                 // on the MPI implementation and surrounding code.

Conversation

ryanstocks00 commented Feb 4, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

codecov bot commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ryanstocks00 commented Feb 4, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 4, 2026 •

edited

Loading

codecov bot commented Feb 4, 2026 •

edited

Loading