Draining read, take 2 by jasnell · Pull Request #6265 · cloudflare/workerd

jasnell · 2026-03-05T23:06:30Z

Fixes up multiple issues with the draining read implementation, and more importantly, places the change behind an autogate.

Fixes UAF caused by an errant state change
Fixes an unbounded read in certain cases
Fixes handling of closed and canceled states

Along the way,

~~Fixes a bug in the @all-autogates variant that caused it to be ineffective for any test using TestFixture~~
Add an opencode skill for more effectively running bazel tests (was super necessary in this case)
Refined the investigate command/skill a bit to help keep it focused.

ask-bonk

I'm Bonk, and I've done a quick review of your PR.

This PR fixes multiple safety issues (UAF, unbounded reads, closed/canceled state handling) in the draining read path for standard streams, places the new pumpTo implementation behind an autogate (ENABLE_DRAINING_READ_ON_STANDARD_STREAMS), and fixes a real bug where @all-autogates test variants were ineffective when TestFixture called initAutogate({}) early. Excellent test coverage with regression tests for each bug class.

Issues

(Medium) Duplicated code between ValueQueue and ByteQueue drainingRead()

The post-pump-loop logic (lines 253-303 for ValueQueue, lines 723-770 for ByteQueue) is nearly identical across the two implementations: !impl.state.isActive() check, impl.queue == kj::none check, isClosing finalization, and the normal data-return path. This is a maintenance hazard — if a bug is found in one path, it must be fixed in both. Not a blocker but worth noting as a follow-up to extract the shared post-pump logic into a common helper.
(Low) benchChunkCounterStatic is a file-scope mutable global

In bench-pumpto.c++, benchChunkCounterStatic (line ~133 of the new file) is a file-scoped static size_t that gets reset in createConfiguredStream() and mutated via raw pointer in pull callbacks. Google Benchmark can run iterations in parallel in some configurations. This is unlikely to cause issues with the current --benchmark_min_time defaults, but a per-invocation kj::heap<size_t>() or a member of a benchmark fixture would be more robust. Not blocking since the benchmark is tags = ["manual"].

.opencode/package.json

ask-bonk · 2026-03-05T23:30:38Z

I have read the CLA Document and I hereby sign the CLA

Co-authored-by: jasnell <jasnell@users.noreply.github.com>

This reverts commit c9309c6.

…uction The pumpToImpl coroutine uses DrainingReader which calls `consumer->drainingRead()``. That call may trigger `onConsumerWantsData` -> `forcePull` -> `pull` callback -> synchronous close/error -> `deferTransitionTo<Closed>`. Previously, both `ValueReadable::drainingRead()` and the caller (`ReadableStreamJsController::drainingRead`) each called `beginOperation()`/`endOperation()` independently. The inner `endOperation()` in ValueReadable/ByteReadable fired the deferred state transition before the caller's `wrapDrainingRead` could set up `.then()` callbacks on the returned promise. Since those callbacks capture `this` (the Consumer), the transition destroyed the Consumer out from under them triggering the UAF. The fix removes `beginOperation()`/`endOperation()` from `ValueReadable::drainingRead()` and `ByteReadable::drainingRead()`, and moves the single `beginOperation()` call to before `consumer->drainingRead()` at each call site in `ReadableStreamJsController::drainingRead`. The matching `endOperation()` remains in the `.then()`/`.catch()` callbacks of `wrapDrainingRead`, ensuring the deferred state change only fires after the Consumer's this-capturing callbacks have already run. `js.tryCatch` wraps each call site for exception safety. The tests do not perfectly catch the UAF even with ASAN because the conditions are extremely timing-sensitive and I haven't yet found a way to reproduce the exact timing reliably in workerd.

anonrig · 2026-03-06T17:05:58Z

src/workerd/tests/BUILD.bazel


+# Benchmark for PumpToReader (ReadableStream::pumpTo path in standard.c++).
+# Run before and after DrainingReader adoption to measure improvement.
+#   bazel run --config=opt //src/workerd/tests:bench-pumpto


or just bench pumpto

jasnell requested review from anonrig, danlapid and harrishancock March 5, 2026 23:06

jasnell requested review from a team as code owners March 5, 2026 23:06

ask-bonk bot reviewed Mar 5, 2026

View reviewed changes

This comment was marked as duplicate.

Sign in to view

ask-bonk bot requested a review from a team as a code owner March 5, 2026 23:22

This comment was marked as outdated.

Sign in to view

jasnell commented Mar 5, 2026

View reviewed changes

.opencode/package.json Outdated Show resolved Hide resolved

jasnell commented Mar 5, 2026

View reviewed changes

.opencode/package.json Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

ask-bonk bot added a commit that referenced this pull request Mar 5, 2026

CLA comment posted on PR #6265.

fa9c248

Co-authored-by: jasnell <jasnell@users.noreply.github.com>

github-actions bot added a commit that referenced this pull request Mar 5, 2026

@ask-bonk[bot] has signed the CLA in #6265

22841f6

jasnell force-pushed the jasnell/fixup-drainingread branch from fa9c248 to e41bcda Compare March 5, 2026 23:32

jasnell added 10 commits March 6, 2026 08:06

Refine the investigation skill further

1cc04a5

Revert "Revert "Update standard.c++ pumpTo impl to use draining read""

2cb56ab

This reverts commit c9309c6.

Handle synchronous cancel better in draining read

1fe3609

Fixup drainingRead memory limits

f78a972

Fixup error propagation in drainingRead

d740382

Cleanup closing state when drainingRead completes

c12392f

Add bazel test hygiene opencode skill

abf3df5

Add streams draining read autogate

ffc0b22

Restore original PumpToReader and place pumpToImpl behind autogate

cc5c650

jasnell force-pushed the jasnell/fixup-drainingread branch from e41bcda to cc5c650 Compare March 6, 2026 16:06

anonrig reviewed Mar 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draining read, take 2#6265

Draining read, take 2#6265
jasnell wants to merge 10 commits intomainfrom
jasnell/fixup-drainingread

jasnell commented Mar 5, 2026 •

edited

Loading

Uh oh!

ask-bonk bot left a comment

Uh oh!

This comment was marked as duplicate.

This comment was marked as outdated.

Uh oh!

Uh oh!

This comment was marked as outdated.

ask-bonk bot commented Mar 5, 2026

Uh oh!

This comment was marked as outdated.

anonrig Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jasnell commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ask-bonk bot left a comment

Choose a reason for hiding this comment

Issues

Uh oh!

This comment was marked as duplicate.

This comment was marked as outdated.

Uh oh!

Uh oh!

This comment was marked as outdated.

ask-bonk bot commented Mar 5, 2026

Uh oh!

This comment was marked as outdated.

anonrig Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jasnell commented Mar 5, 2026 •

edited

Loading