fix(2117): if si_wq is full, reset connection in case of flooding #2150

kingluo · 2024-06-26T10:44:59Z

The leak only occurs when si_wq is full and continues to process the current skb (which may contain the remaining SSL record), sk->sk_receive_queue, and possibly skbs that come in later. The leak is never triggered when we reset the connection and stop processing data immediately. Such a fix would be reasonable even without the leak since it is unlikely that si_wq will become full without flooding.

const-t · 2024-07-05T11:39:43Z

In general I don't have any corrections, however I have few suggestions:

I would like to suggest to do connection reset on failed tfw_connection_send() for each protocol, not only for http2.
Let's replace DBG message with warning in case of failed pushing to si_wq. It's pretty important event, where we disconnect the client and it's would be great to know about that. Maybe it worth to add statistics counter for this event.
Maybe we should find exact place where we leaks on si_wq overflow, just to know, maybe it can be reproduced in another way that not known at this moment.

@krizhanovsky Please see this comment, we need to know your opinion.

fix #2117 fix #2149

kingluo · 2024-07-24T13:06:23Z

Maybe we should find exact place where we leaks on si_wq overflow, just to know, maybe it can be reproduced in another way that not known at this moment.

It's not so easy to do this. Maybe do not close that issue for later investigation.

const-t

LGTM. Only one question, maybe we should do the same thing for websockets as well?

kingluo · 2024-08-05T16:15:39Z

LGTM. Only one question, maybe we should do the same thing for websockets as well?

Yes, other places should be fixed, too. I'll try to cover them later.

krizhanovsky

I have a lot of questions about this PR. Also from #2117 :

The root cause of the tls error is that si_wq (which has a default budget of 10, but even 1,000,000 is not enough to flood a single connection) cannot tolerate the high rate of ping acks being sent and returns -EBUSY

Why? si_wq is supposed to be a very fast lock-free RB and we wake up a target processor on insertions. What's the reason for the slowness on the read (processing) side? There is something fundamentally broken if a Python script can flood the lock-free in-kernel network processing.

Probably it's OK to reset TCP connections, which we can't handle, but we should not involve security events handling for this.

Having #1940 (comment) in mind, I'd propose to postpone the fix until #1940

fw/sync_socket.h

krizhanovsky · 2024-08-14T21:28:35Z

tls/ttls.c

 			  tls_state_to_str(tls->state), r,
 			  r == -EBADMSG ? "(bad ciphertext)" : "");
-		return r;
+		return T_BLOCK_WITH_RST;


It seems we're going to block the client, not just reset it's connection. As noted in lib/log.h the return code for a security event, not for OOM, which might have different reasons.

That's pure misleading in naming, all I want is sending RST, but we have only T_BLOCK_WITH_RST constant. Maybe we should bring in a dedicated constant for normal RST.

Yeah, if we need to reset connections, then we do need designated RST constant and appropriate workflow handling the return codes.

krizhanovsky · 2024-08-14T21:51:12Z

fw/sock.c

 	}

+	if (unlikely(SS_CONN_TYPE(sk) & Conn_Reset))
+		return;


Introducing Conn_Reset state, used only in one function seems like a workaround. I'd prefer a more clear solution. Seems this state is only needed to not to execute the skb processing while the socket is in the queue for closing, so don't we already have enough information about the socket state at the moment?

It is not a workaround, but a bugfix: In case of errors, we should exit the call chain and stop handling the involved TLS records, unrolled skb, sk->sk_receive_queue, and future sk_data_ready callbacks from the kernel (that's why we need to reset but not close the socket). Otherwise, it's an undefined behavior. And yes, we didn't cover the RST case (not close) in that function, that's exactly why I made changes here.

The new socket state used only in one function looks awkward.

Can we handle this with a real socket state? Maybe we can even avoid calling sk_data_ready by setting some socket state and/or doing partial close/reset?

Co-authored-by: Alexander Krizhanovsky <[email protected]>

kingluo · 2024-08-15T01:16:27Z

Why? si_wq is supposed to be a very fast lock-free RB and we wake up a target processor on insertions. What's the reason for the slowness on the read (processing) side? There is something fundamentally broken if a Python script can flood the lock-free in-kernel network processing.

The bottleneck is not the locking, but the efficiency of work processing in the NET_TX softirq. Obviously, sending is much slower than enqueuing, maybe due to the TLS encryption. Another suspicious point is, that when we handle the sending of the same socket in another CPU, it's most likely we will be blocked at the locking of the sk, because the receiving softirq is busy handling a lot of skb in that sk and producing a lot of ping ack, so finally overflows the queue.

krizhanovsky · 2024-10-09T20:58:32Z

Closed in favor of #2257

kingluo requested review from const-t and krizhanovsky June 26, 2024 10:46

krizhanovsky modified the milestone: 0.9 - LA Jul 22, 2024

kingluo added 2 commits July 24, 2024 20:44

fix(2117): if si_wq is full, reset connection

a215ad0

fix #2117 fix #2149

use T_WARN if -EBUSY

c9c438d

kingluo force-pushed the jinhua/fix-2117 branch from ad5f0a9 to c9c438d Compare July 24, 2024 13:04

const-t approved these changes Aug 5, 2024

View reviewed changes

krizhanovsky requested changes Aug 14, 2024

View reviewed changes

Update fw/sync_socket.h

57d025b

Co-authored-by: Alexander Krizhanovsky <[email protected]>

Merge branch 'master' into jinhua/fix-2117

71b8a2b

krizhanovsky closed this Oct 9, 2024

fix(2117): if si_wq is full, reset connection in case of flooding #2150

fix(2117): if si_wq is full, reset connection in case of flooding #2150

Uh oh!

Conversation

kingluo commented Jun 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

const-t commented Jul 5, 2024

Uh oh!

kingluo commented Jul 24, 2024

Uh oh!

const-t left a comment

Choose a reason for hiding this comment

Uh oh!

kingluo commented Aug 5, 2024

Uh oh!

krizhanovsky left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

krizhanovsky Aug 14, 2024

Choose a reason for hiding this comment

Uh oh!

kingluo Aug 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

krizhanovsky Aug 15, 2024

Choose a reason for hiding this comment

Uh oh!

krizhanovsky Aug 14, 2024

Choose a reason for hiding this comment

Uh oh!

kingluo Aug 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

krizhanovsky Aug 15, 2024

Choose a reason for hiding this comment

Uh oh!

kingluo commented Aug 15, 2024

Uh oh!

krizhanovsky commented Oct 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kingluo commented Jun 26, 2024 •

edited

Loading

kingluo Aug 15, 2024 •

edited

Loading

kingluo Aug 15, 2024 •

edited

Loading