Skip to content

sweepbatcher: notify caller about confirmations #919

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 22 commits into
base: master
Choose a base branch
from

Conversation

starius
Copy link
Collaborator

@starius starius commented Apr 9, 2025

Add fields ConfChan and ConfErrChan to sweepbatcher.SpendNotifier type which is a part of SweepRequest passed to AddSweep method.

This is needed to reuse confirmation notifications on the calling side the same way it is done for spending notifications.

Also fixed many small bugs and issues and added missing tests for notifications sent through channels of SpendNotifier.

Fixed TODO left from #891 re-add sweeps to new batches only after fully confirmed.
In case of a reorg sweeps should not go to another batch but stay in the current batch until it is fully confirmed. Only after that the remaining sweeps are re-added to another batch. Field sweep.completed is now set to true only for fully-confirmed sweeps.

Fixed OnChainFeePortion values. There were two mistakes. In case of a swap with multiple sweeps only the fee of the first sweep of a swap was accounted. Another issue is that rounding diff (the remainder) was attributed to all the sweeps rather than to the first (primary) sweep of the batch. The sweep to attribute the remainder was chosen by comparing SignatureScript which is always empty. New approach is to find the primary sweep and to compare its outpoint directly.

Pull Request Checklist

  • Update release_notes.md if your PR contains major features, breaking changes or bugfixes

@starius starius marked this pull request as ready for review April 9, 2025 04:06
@starius starius requested review from bhandras, sputn1ck and hieblmi April 9, 2025 04:15
@@ -213,8 +213,8 @@ type dbBatch struct {
// ID is the unique identifier of the batch.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe stupid question, but missing lots of context. This commit changes more than the mock, and feels a bit complicated, can you maybe elaborate what the changes to the other files are?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I updated the commit message:

sweepbatcher: align dbBatch type with DB schema

Previously, dbBatch had a State field (enum: Open, Closed, Confirmed), but in
the database it is represented as a boolean Confirmed. The Closed state was
stored the same way as Open. This wasn't an issue in practice, since an Open
batch is quickly transitioned to Closed after startup.

However, the in-memory mock stores plain dbBatch instances, leading to
inconsistent behavior between the mock and the real DB-backed store. This
commit updates dbBatch to match the database representation by replacing

Also added a note to the description of Closed const:

// Closed is the state in which the batch is no longer able to accept
// new sweeps. NOTE: this state exists only in-memory. In the database
// it is stored as Open and converted to Closed after a spend
// notification arrives (quickly after start of Batch.Run).
Closed batchState = 1

Copy link
Member

@bhandras bhandras left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good. Perhaps we don't even need the "closed" state?

// If the sweep's notifier is empty then this means that a swap
// is not waiting to read an update from it, so we can skip
// the notification part.
if s.notifier == nil || *s.notifier == (SpendNotifier{}) {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why check *s.notifier == (SpendNotifier{} ?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was a copy-paste. I updated this line:

if s.notifier == nil || s.notifier.SpendErrChan == nil {

@@ -259,7 +259,7 @@ func convertBatchRow(row sqlc.SweepBatch) *dbBatch {
}

if row.Confirmed {
batch.State = batchOpen
batch.State = batchConfirmed
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I assume this just resulted in the batches restarting (and then getting to confirmed state again after spend notification received)?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, and then after confirmation notification (after 3 confs).

Open -(spend notif)-> Closed -(conf notif)-> Confirmed


// Then we get the total amount that was swept by the batch.
totalSwept, err := b.store.TotalSweptAmount(ctx, parentBatchID)
if err != nil {
cancel()
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm, this is a great fix, but how did we ever not get cancelled?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This code path is executed only when a fully confirmed sweep is added again to the sweeper. This may happen if the daemon was down when the sweep was fully confirmed (got 3rd confirmation). This is not the main code path.

// manages to be added during this time, it will be
// detected as missing when analyzing the spend
// notification and will be added to new batch.
batch.state = Open
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But this also means that any new sweeps could be added, and then we just would try to publish the batch again (and fail as inputs are spent). It's robust so not a huge issue, but perhaps we could avoid.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Avoiding this would require a DB migration, which doesn’t seem worthwhile just to improve this edge case.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

SGTM!

@starius
Copy link
Collaborator Author

starius commented Apr 13, 2025

Looks good. Perhaps we don't even need the "closed" state?

I think it makes sense to keep Closed to make this thing easier to understand / debug.

Also maybe greedy batch selection can work worse if we let it add to Closed.

@starius starius requested review from sputn1ck and bhandras April 13, 2025 21:15
@levmi levmi removed the request for review from hieblmi April 15, 2025 15:29
@starius starius marked this pull request as draft April 26, 2025 04:35
@starius
Copy link
Collaborator Author

starius commented Apr 26, 2025

Converting to draft. I'm working to add more commits here and to rebase it on top of #891

@starius starius changed the title sweepbatcher: notify caller about confirmations [WIP] sweepbatcher: notify caller about confirmations Apr 26, 2025
@starius starius force-pushed the conf-chan branch 2 times, most recently from a67a666 to e234022 Compare April 29, 2025 03:52
@starius starius changed the title [WIP] sweepbatcher: notify caller about confirmations sweepbatcher: notify caller about confirmations Apr 29, 2025
@starius starius marked this pull request as ready for review April 29, 2025 03:53
@starius starius force-pushed the conf-chan branch 3 times, most recently from 5217a80 to 494ebfb Compare May 2, 2025 04:40
@lightninglabs-deploy
Copy link

@sputn1ck: review reminder
@bhandras: review reminder

@levmi levmi requested review from hieblmi and removed request for sputn1ck May 6, 2025 14:22
starius added 19 commits May 8, 2025 17:44
Forgot to return the calculated total value.
Support sending errors to error channels returned by RegisterSpendNtfn and
RegisterConfirmationsNtfn.
Function monitorSpendAndNotify used to cancel the context passed to
RegisterSpendNtfn right after starting the goroutine processing results.
Spend notifications were missed.

Now the context is canceled when the goroutine finishes.
The loop always had exactly one iteration.
If monitorConfirmations fails, we still want to persist the state to DB.
Previously, dbBatch had a State field (enum: Open, Closed, Confirmed), but in
the database it is represented as a boolean Confirmed. The Closed state was
stored the same way as Open. This wasn't an issue in practice, since an Open
batch is quickly transitioned to Closed after startup.

However, the in-memory mock stores plain dbBatch instances, leading to
inconsistent behavior between the mock and the real DB-backed store. This
commit updates dbBatch to match the database representation by replacing
the State field with a Confirmed boolean.
Add fields ConfChan and ConfErrChan to SpendNotifier type which is a part of
SweepRequest passed to AddSweep method.

This is needed to reuse confirmation notifications on the calling side the same
way it is done for spending notifications.
This is needed to have multiple spending registrations running and to send
a notification to a specific spending registration.
In case of a reorg sweeps should not go to another batch but stay in the current
batch until it is fully confirmed. Only after that the remaining sweeps are
re-added to another batch.

Field sweep.completed is now set to true only for fully-confirmed sweeps.
There were two mistakes.

In case of a swap with multiple sweeps only the fee of the first sweep
of a swap was accounted.

Rounding diff (the remainder) was attributed to all the sweeps rather than to
the first (primary) sweep of the batch. The sweep to attribute the remainder
was chosen by comparing SignatureScript which is always empty. New approach is
to find the primary sweep and to compare its outpoint directly.
This is needed because sweepbatcher can use this channel in multiple
select statements to unblock itself if the caller cancels.
It doesn't need loopdb, so remove that code.
Make sure that broadcasted tx has feeRate >= minRelayFee.
Make sure that feeRate of broadcasted tx doesn't decrease.
@starius starius requested a review from sputn1ck May 8, 2025 21:29
Copy link
Member

@bhandras bhandras left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Really nice set of fixes! Making sweep-batcher more robust one step at a time 🚀
LGTM 🎉

// new sweeps.
// new sweeps. NOTE: this state exists only in-memory. In the database
// it is stored as Open and converted to Closed after a spend
// notification arrives (quickly after start of Batch.Run).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

// manages to be added during this time, it will be
// detected as missing when analyzing the spend
// notification and will be added to new batch.
batch.state = Open
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

SGTM!


spendDetail := SpendDetail{
Tx: spendTx,
OnChainFeePortion: getFeePortionPaidBySweep(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Alternatively we could communicate the fee portion once the sweep is fully-confirmed. wdyt?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants