Defer MonitorUpdatingPersister writes to flush() #4317

joostjager · 2026-01-15T14:02:47Z

Update MonitorUpdatingPersister and MonitorUpdatingPersisterAsync to queue persist operations in memory instead of writing immediately to disk. The Persist trait methods now return ChannelMonitorUpdateStatus:: InProgress and the actual writes happen when flush() is called.

This fixes a race condition that could cause channel force closures: previously, if the node crashed after writing channel monitors but before writing the channel manager, the monitors would be ahead of the manager on restart. By deferring monitor writes until after the channel manager is persisted (via flush()), we ensure the manager is always at least as up-to-date as the monitors.

Key changes:

Add PendingWrite enum to represent queued write/remove operations
Add pending_writes queue to MonitorUpdatingPersisterAsyncInner
Add flush() to Persist trait and ChainMonitor
Update Persist impl to queue writes and return InProgress
Call flush() in background processor after channel manager persistence
Remove unused event_notifier from AsyncPersister

ldk-reviews-bot · 2026-01-15T14:02:50Z

👋 Hi! I see this is a draft PR.
I'll wait to assign reviewers until you mark it as ready for review.
Just convert it out of draft status when you're ready for review!

TheBlueMatt · 2026-01-15T14:26:07Z

lightning/src/util/persist.rs

 use core::mem;
 use core::ops::Deref;
-use core::pin::{pin, Pin};
+use core::pin::pin;


I think we should be able to do this without touching persist.rs.

Where would we do it then? Queue in ChainMonitor, in KVStore, or somewhere else?

ChainMonitor would need to store the actual monitor data to defer writes, not just track update IDs as it does now. This means either cloning expensive ChannelMonitor objects or storing serialized bytes, which leaks persistence format details into ChainMonitor?

Update MonitorUpdatingPersister and MonitorUpdatingPersisterAsync to queue persist operations in memory instead of writing immediately to disk. The Persist trait methods now return ChannelMonitorUpdateStatus:: InProgress and the actual writes happen when flush() is called. This fixes a race condition that could cause channel force closures: previously, if the node crashed after writing channel monitors but before writing the channel manager, the monitors would be ahead of the manager on restart. By deferring monitor writes until after the channel manager is persisted (via flush()), we ensure the manager is always at least as up-to-date as the monitors. The flush() method takes an optional count parameter to flush only a specific number of queued writes. The background processor captures the queue size before persisting the channel manager, then flushes exactly that many writes afterward. This prevents flushing monitor updates that arrived after the manager state was captured. Key changes: - Add PendingWrite enum to represent queued write/remove operations - Add pending_writes queue to MonitorUpdatingPersisterAsyncInner - Add pending_write_count() and flush(count) to Persist trait and ChainMonitor - ChainMonitor::flush() calls channel_monitor_updated for each completed write - Update Persist impl to queue writes and return InProgress - Call flush() in background processor after channel manager persistence - Remove unused event_notifier from AsyncPersister Co-Authored-By: Claude Opus 4.5 <[email protected]>

joostjager force-pushed the mon-barrier branch from 79e9390 to c8405e2 Compare January 15, 2026 14:09

TheBlueMatt reviewed Jan 15, 2026

View reviewed changes

joostjager force-pushed the mon-barrier branch from c8405e2 to 40e909a Compare January 15, 2026 15:48

joostjager added this to Weekly Goals Jan 15, 2026

joostjager self-assigned this Jan 15, 2026

joostjager force-pushed the mon-barrier branch 2 times, most recently from 2b85294 to 93ff6c9 Compare January 16, 2026 09:35

joostjager force-pushed the mon-barrier branch from 93ff6c9 to 181a6e0 Compare January 16, 2026 09:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Defer MonitorUpdatingPersister writes to flush() #4317

Defer MonitorUpdatingPersister writes to flush() #4317

Uh oh!

joostjager commented Jan 15, 2026

Uh oh!

ldk-reviews-bot commented Jan 15, 2026

Uh oh!

TheBlueMatt Jan 15, 2026

Uh oh!

joostjager Jan 15, 2026

Uh oh!

joostjager Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Defer MonitorUpdatingPersister writes to flush() #4317

Are you sure you want to change the base?

Defer MonitorUpdatingPersister writes to flush() #4317

Uh oh!

Conversation

joostjager commented Jan 15, 2026

Uh oh!

ldk-reviews-bot commented Jan 15, 2026

Uh oh!

TheBlueMatt Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

joostjager Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

joostjager Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants