ChannelClosedException when server close connection right after releasing the last http2 stream #6258

huajiang-tubi · 2025-07-15T06:26:19Z

Motivation and Context

sequenceDiagram
    participant aws AS AWS Service
    participant channel AS Channel Event Loop
    participant pool AS HttpOrHttp2ChannelPool.eventLoop

    aws->>+channel: 1. LastHttpContent
    channel->>+channel: 2. ResponseHandler.finalizeResponse
    channel->>+aws: 3. Http2ResetFrame
    channel->>+channel: 4. HttpOrHttp2ChannelPool.release
    channel->>pool: 5. doInEventLoop(() -> release0(channel, promise))
    deactivate channel
    deactivate channel
    deactivate channel

    aws->>+channel: 6. Close The Connection
    channel->>+channel: 7. Http2ConnectionHandler.channelInactive
    channel->>+channel: 8. MultiplexedChannelRecord.closeAndExecuteOnChildChannels
    deactivate channel
    deactivate channel
    deactivate channel


    pool->>+pool: 9. HttpOrHttp2ChannelPool.release0(channel, promise)
    pool->>+channel: 10. doInEventLoop(() -> MultiplexedChannelRecord.closeAndReleaseChild)
    channel->>+channel: 11. childChannels.remove
    deactivate channel
    channel-->>-pool:
    deactivate pool

The AWS service returns the final part of the response.
ResponseHandler.finalizeResponse is invoked to complete processing of the response.
The client sends an RST_STREAM frame to acknowledge the completion.
It then calls release on the channel pool. Since the channel pool consists of multiple layers, this invocation eventually reaches HttpOrHttp2ChannelPool.release.
HttpOrHttp2ChannelPool.release needs to access the protocolImpl field, which is only safely accessed from the pool’s event loop. To ensure thread safety, it submits a task to perform the release within that event loop.
Meanwhile, the server receives the reset frame and decides to immediately close the connection (No idea why. It's a question for the service developer).
This triggers channelInactive on the Http2ConnectionHandler.
As a result, MultiplexedChannelRecord.closeAndExecuteOnChildChannels is called. It detects that there are still unreleased child channels because the task submitted in step 5 has not yet been executed. This leads to a ClosedChannelException being thrown and logged as an error.
The release task is now running on the pool's event loop, but a little bit too late.

It may also be the root cause of #2914

Modifications

By declaring protocolImpl as volatile, we ensure visibility across threads once it is assigned within the pool's event loop. This guarantees that it can be safely accessed without thread-safety concerns. The underlying BetterFixedChannelPool is thread-safe, as its mutable state is managed exclusively within its dedicated event loop. For release operations, state updates are performed via a future listener after the underlying pool completes the release, effectively avoiding the concurrency issues observed with HttpOrHttp2ChannelPool.

Tests

Reproducing the issue consistently in a test environment is challenging, as it relies on the precise timing and order of event handler invocations.

We’ve been running the fix without the volatile keyword on protocolImpl in one of our production services for some time. It has been working well, as evidenced by a noticeable drop in error logs:

The deployment with the volatile keyword added is now live. I’ll share an update on the results shortly.

Do release in the channel's event loop if it's safe

28d0c97

huajiang-tubi requested a review from a team as a code owner July 15, 2025 06:26

huajiang-tubi changed the title ~~ChannelClosedException when server close connection right after the last http2 stream~~ ChannelClosedException when server close connection right after releasing the last http2 stream Jul 15, 2025

huajiang-tubi and others added 2 commits July 16, 2025 20:55

make protocolImpl volatile to avoid dirty read

8d563a9

Merge branch 'master' into fix-closed-channel-exception

6cc0b1a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ChannelClosedException when server close connection right after releasing the last http2 stream #6258

ChannelClosedException when server close connection right after releasing the last http2 stream #6258

huajiang-tubi commented Jul 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

ChannelClosedException when server close connection right after releasing the last http2 stream #6258

Are you sure you want to change the base?

ChannelClosedException when server close connection right after releasing the last http2 stream #6258

Conversation

huajiang-tubi commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Modifications

Tests

Uh oh!

Uh oh!

huajiang-tubi commented Jul 15, 2025 •

edited

Loading