feat(driver): do not schedule the same timeout twice by cason · Pull Request #1501 · circlefin/malachite

cason · 2026-02-24T15:04:12Z

Closes: #1500

As per title.

This can happen to several reasons, but mostly because TimeoutPrecommit is re-scheduled at every round step change.

The solution consists on creating a scheduled_timeouts vector at the driver, and only adding a new timeout to it if not already present. The vector is cleaned upon a new height, and old entries are removed upon a new height.

cason · 2026-02-24T17:00:37Z

code/crates/core-driver/src/driver.rs

+        if timeout.round < self.round() || self.scheduled_timeouts.contains(&timeout) {
+            return;
+        }
+        // XXX: test if the driver produces **non**-consensus timeouts


Should I change this to a warning or something like that? Apparently, we don't panic - at least in the test that have ran.

I would perhaps change this to a debug_assert!

romac · 2026-02-25T15:58:12Z

code/crates/core-driver/src/driver.rs

    /// The certificate that justifies moving to the `enter_round` specified in the `EnterRoundCertificate.
    pub round_certificate: Option<EnterRoundCertificate<Ctx>>,
+
+    scheduled_timeouts: Vec<Timeout>,


Let's add a doc comment here to explain this field's purpose

code/crates/core-driver/src/driver.rs

code/crates/core-types/src/timeout.rs

romac · 2026-02-25T16:01:29Z

code/crates/core-driver/src/driver.rs

+        if timeout.round < self.round() || self.scheduled_timeouts.contains(&timeout) {
+            return;
+        }
+        // XXX: test if the driver produces **non**-consensus timeouts


I would perhaps change this to a debug_assert!

Co-authored-by: Romain Ruetschi <github@romac.me>

nenadmilosevic95

Looks good! Left two minor comments. In general, I like the driver-level dedup — simple and effective. IMHO the state machine check might be even cleaner, since the Tendermint pseudo-code states "for the first time" and the state machine should align with it, but this works well too.

nenadmilosevic95 · 2026-02-26T18:22:21Z

code/crates/core-driver/src/driver.rs

    }

+    fn lift_timeout_output(&mut self, timeout: Timeout, outputs: &mut Vec<Output<Ctx>>) {
+        if timeout.round < self.round() || self.scheduled_timeouts.contains(&timeout) {


Minor: Can this ever be true timeout.round < self.round()? Timeouts should only be for the current round. If purely defensive, I'd add a warn! here so we notice if it ever triggers.

Good point.

nenadmilosevic95 · 2026-02-26T18:23:43Z

code/crates/core-driver/src/driver.rs


+        // Remove useless timeouts from previous rounds
+        self.scheduled_timeouts
+            .retain(|timeout| timeout.round >= round);


Similar here, could this just be clear()? It is minor ofc, I just want to understand if we expect to see any future rounds here, since I don't see how

Also a good point. But the concern here is to limit the growth of this set.

I wonder if we can schedule timeouts for future rounds - otherwise, clear() is the way to go.

cason · 2026-02-27T09:45:53Z

IMHO the state machine check might be even cleaner, since the Tendermint pseudo-code states "for the first time" and the state machine should align with it, but this works well too.

I will try to implement this version.

cason · 2026-02-27T10:51:06Z

IMHO the state machine check might be even cleaner, since the Tendermint pseudo-code states "for the first time" and the state machine should align with it, but this works well too.

I will try to implement this version.

#1508

cason requested review from ancazamfir and romac as code owners February 24, 2026 15:04

cason added 2 commits February 24, 2026 16:04

driver: do not schedule the same timeout twice

ccd02fc

driver: remove duplicated timeouts from tests

4d031cf

cason force-pushed the duplicated-timeouts branch from 652ed50 to 4d031cf Compare February 24, 2026 15:05

This comment was marked as outdated.

Sign in to view

github-actions bot added the need-triage This issue needs to be triaged label Feb 24, 2026

github-actions bot closed this Feb 24, 2026

XXX: does the driver produce non-consensus timeouts?

cb9c578

romac reopened this Feb 24, 2026

romac removed the need-triage This issue needs to be triaged label Feb 24, 2026

cason commented Feb 24, 2026

View reviewed changes

romac requested changes Feb 25, 2026

View reviewed changes

cason and others added 2 commits February 25, 2026 23:24

Apply suggestions from @romac

7cd5092

Co-authored-by: Romain Ruetschi <github@romac.me>

Addressing comments from @romac

023fe6f

cason requested a review from romac February 25, 2026 22:29

Cargo fmt

8262369

romac requested a review from nenadmilosevic95 February 26, 2026 14:30

Merge branch 'main' into duplicated-timeouts

76ec324

nenadmilosevic95 approved these changes Feb 26, 2026

View reviewed changes

cason mentioned this pull request Feb 27, 2026

feat(smr): do not schedule the same timeout twice #1508

Open

Conversation

cason commented Feb 24, 2026 • edited by romac Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nenadmilosevic95 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cason commented Feb 27, 2026

Uh oh!

cason commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cason commented Feb 24, 2026 •

edited by romac

Loading