Avoid change streams on the storage database #276

rkistner · 2025-06-12T15:02:32Z

Background

The MongoDB storage adapter relied on change streams to detect changes to:

Read checkpoints.
Write checkpoints (implemented in Optimize write checkpoints lookups #230).

The idea is that each API process is "notified" of changes, so that:

There is low overhead when the instance is idle.
Latency is short.

The issue

The issue is that change streams can have high overhead on a cluster. A change stream is effectively reading all changes in the oplog, then filtering it to the watched collection/pipeline.

This is fine when you only have a small number of open change streams, or low volumes of writes. However, we have cases where there are 100+ change streams open at a time. And even though those collections are not modified often, when you also have a 20k/s write rate (could happen when reprocessing sync rules), you suddenly end up with 100k document scans/s, even though very few documents are returned.

I cannot find any good documentation on this - this performance impact is not mentioned in the docs. But this script demonstrates the issue quite clearly: https://gist.github.com/rkistner/0d898880b0a0a48d1557f64e01992795

I also suspect that MongoDB has an optimization for this issue in Flex clusters, but that code is private unfortunately.

The fix

The fix is to not use watch/change streams in the storage adapter. The actual implementation is different for read and write checkpoints.

Read checkpoints

Diff for this part

We implement this similar to the NOTIFY functionality we use for Postgres storage:

Each time a read checkpoint is committed, we write an empty document to a new checkpoint_events capped collection.
The API processes watch this collection for changes, by using a tailable cursor.
When a change is seen, it fetches the latest state from the sync_rules collection.

An alternative would be to just use polling on sync_rules. However, method has lower latency, and reduced overhead when the instance is mostly idle.

Tailable cursors are an under-documented feature, but it does appear to work well for this case. It gives functionality similar to change streams, with better efficiency, at the cost of requiring explicit writes to the collection.

Write checkpoints

Diff for this part

For write checkpoints, we now use the same mechanism as for bucket_data and parameter_data: On each new read checkpoint, we read all the write checkpoints created after the previous read checkpoint.

What makes this a larger change is that:

We did not previously record sufficient info to look up the write checkpoints between two read checkpoints.
Managed write checkpoints are persisted in a completely different way from custom write checkpoints, so this requires separate implementations for each.
a. Custom write checkpoints are now persisted in the same batch/transaction as other data, and gets a matching op_id.
b. Manged write checkpoints gets a processed_at_lsn field, populated when a read checkpoints are committed. We may change this to also use an op_id in the future, but that would complicate the current implementation a bit.

This reverts big parts of #230, but does not go back to the old approach. This actually results in less code and a simpler architecture overall.

changeset-bot · 2025-06-12T15:02:36Z

🦋 Changeset detected

Latest commit: d5abe45

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 11 packages

Name	Type
@powersync/service-module-postgres-storage	Minor
@powersync/service-module-mongodb-storage	Minor
@powersync/service-core-tests	Minor
@powersync/service-module-postgres	Minor
@powersync/service-core	Minor
@powersync/service-schema	Minor
@powersync/service-module-mongodb	Patch
@powersync/service-module-mysql	Patch
@powersync/service-image	Minor
@powersync/service-module-core	Patch
test-client	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

Copilot

Pull Request Overview

The primary purpose of this PR is to remove change streams from the storage adapter and adopt a more efficient, capped-collection/tailable-cursor–based mechanism for checkpoint notifications. Key changes include the complete removal of Demultiplexer and its tests, refactoring of the write checkpoint APIs (in both Postgres and MongoDB), and updates in test and migration files to support the new notification mechanism.

Reviewed Changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
packages/service-core/test/src/demultiplexer.test.ts	Removed Demultiplexer tests as the functionality has been deprecated.
packages/service-core/src/streams/streams-index.ts	Removed export of Demultiplexer to reflect its removal.
packages/service-core/src/storage/WriteCheckpointAPI.ts	Dropped unused interfaces/methods relating to change stream-based checkpoint watching.
modules/module-mongodb-storage/src/storage/implementation/MongoWriteCheckpointAPI.ts	Refactored methods to replace watch methods with a new checkpoint change API and updated error types.
modules/module-mongodb-storage/src/storage/implementation/db.ts	Added methods for checkpoint notifications and creation of a capped collection for events.
Other files	Various test and migration files updated to use the new batching and notification mechanisms.

Comments suppressed due to low confidence (2)

modules/module-mongodb-storage/src/storage/implementation/MongoWriteCheckpointAPI.ts:60

Consider using a more explicit boolean check for the existence of 'sync_rules_id'. For example, replace the condition with 'if (!('sync_rules_id' in filters))' for improved clarity.

if (false == 'sync_rules_id' in filters) {

modules/module-mongodb-storage/src/storage/implementation/MongoWriteCheckpointAPI.ts:65

Consider replacing 'if (false == 'heads' in filters)' with 'if (!('heads' in filters))' to clearly express the intended check.

if (false == 'heads' in filters) {

stevensJourney

The logic here looks good to me.

modules/module-mongodb-storage/src/storage/implementation/MongoSyncBucketStorage.ts

rkistner added 10 commits June 12, 2025 11:51

Add checkpoint_events capped collection.

8932c5a

Use a tailable cursor instead of change stream for checkpoint events.

12da269

Handle CappedPositionLost.

9b7d02d

Create checkpoint_events in tests.

99de725

Incrementally lookup changed write checkpoints.

ea02d8f

Remove watch on managed write checkpoints.

58c7b61

Refactor custom write checkpoint storage in MongoDB.

dbd1517

Support custom write checkpoints.

916b653

Remove Demultiplexer.

bb720c2

Further cleanup.

c97cefd

rkistner requested a review from stevensJourney June 12, 2025 15:02

rkistner added 5 commits June 17, 2025 13:05

Process write checkpoints on keepalive.

7c961bd

Merge remote-tracking branch 'origin/main' into write-checkpoint-polling

e139567

Test write checkpoints on both postgres and mongodb storage.

a1e213d

Increase delay in test.

a36440e

Changeset.

9cbe9aa

rkistner requested a review from Copilot June 17, 2025 11:18

Copilot AI reviewed Jun 17, 2025

View reviewed changes

Another attempt at fixing the test.

d5abe45

rkistner marked this pull request as ready for review June 17, 2025 11:42

stevensJourney approved these changes Jun 17, 2025

View reviewed changes

modules/module-mongodb-storage/src/storage/implementation/MongoSyncBucketStorage.ts Show resolved Hide resolved

rkistner merged commit d235f7b into main Jun 17, 2025
21 checks passed

rkistner deleted the write-checkpoint-polling branch June 17, 2025 14:02

rkistner mentioned this pull request Jun 18, 2025

Fix custom write checkpoints #280

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid change streams on the storage database #276

Avoid change streams on the storage database #276

Uh oh!

rkistner commented Jun 12, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Jun 12, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

stevensJourney left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Avoid change streams on the storage database #276

Avoid change streams on the storage database #276

Uh oh!

Conversation

rkistner commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

The issue

The fix

Read checkpoints

Write checkpoints

Uh oh!

changeset-bot bot commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

stevensJourney left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rkistner commented Jun 12, 2025 •

edited

Loading

changeset-bot bot commented Jun 12, 2025 •

edited

Loading