proc_loadavg: Sleep in loadavg thread should not slow down reloads #679

DeyanSG · 2025-03-17T13:50:09Z

This resolves the issue with reload described in #678

Blub · 2025-03-19T07:42:49Z

The signal handler will set need_reload=1, which is a bit of a weird side effect.
It should be okay, since when we reach this code, we're either in do_reload() where that's already the case, or in main() during shutdown, but personally I don't like depending on this kind of knowledge, as it is a maintenance burden.
(I wonder if we should introduce a more specific signal there (SIGALRM is common to wake up sleeps I think, or some RT signal)

mihalicyn · 2025-03-19T12:49:09Z

Hello @DeyanSG,

first of all, thanks for your report and proposed fix!

From #678:

This led us to conclude that we missed the wake-up time for the timer started by the usleep in the loadavg thread. We calculated that it would take about 70 minutes for the timer to overflow and reach the same value again, allowing normal operations to continue.

you are referring to this part of code:

	clock_t time1, time2;

	for (;;) {
		if (loadavg_stop == 1)
			return NULL;

		time1 = clock();
<...>
		if (loadavg_stop == 1)
			return NULL;

		time2 = clock();
		usleep(FLUSH_TIME * 1000000 -
		       (int)((time2 - time1) * 1000000 / CLOCKS_PER_SEC));

right?

Please, can you test the following simple fix:

--- a/src/proc_loadavg.c
+++ b/src/proc_loadavg.c
@@ -504,6 +504,7 @@ static void *load_begin(void *arg)
        int first_node, sum;
        struct load_node *f;
        clock_t time1, time2;
+       int sleep_time;
 
        for (;;) {
                if (loadavg_stop == 1)
@@ -542,8 +543,9 @@ static void *load_begin(void *arg)
                        return NULL;
 
                time2 = clock();
-               usleep(FLUSH_TIME * 1000000 -
-                      (int)((time2 - time1) * 1000000 / CLOCKS_PER_SEC));
+               sleep_time = FLUSH_TIME - (int)((time2 - time1) / CLOCKS_PER_SEC);
+               if ((sleep_time > 0) && (sleep_time <= FLUSH_TIME))
+                       usleep(sleep_time * 1000000);
        }
 }

Kind regards,
Alex

Blub · 2025-03-19T13:18:51Z

Ah yes, catching the overflow definitely makes sense. We'd still be a bit delayed, but that's much less problematic than a near-endless sleep call and can still be addressed separately.

DeyanSG · 2025-03-20T12:14:10Z

Hi @mihalicyn and @Blub,

Thanks for looking into this.

It seems I somehow missed the overflow in the code and went on to look into the kernel code. Thanks for spotting this, @mihalicyn. We will deploy a patched version today that includes the suggested fix and some additional logging in case we skip the sleep, so we can see all the values.

The issue is tricky to reproduce, as we do not trigger these live migrations. They are part of some GCE maintenance, and so far, the issue during a migration seems to occur rarely. Nevertheless, we will do our best to test and see if we can determine whether this patch will solve the issue.

@Blub, I tested with SIGALRM and SIGVTALRM. However, sending these to the thread seems to kill the whole process. I am also not sure how good of an idea it is to set a custom handler for these signals. I also tested with SIGUSR2, and even without installing a handler for this signal, it seems to work correctly. Let me know if you are keen on merging this with SIGUSR2 (or another signal), or if we will stick to just the solution provided by @mihalicyn if it works.

Regards,
Deyan

Blub · 2025-03-20T12:52:47Z

SIGUSR2 is already used up by existing lxcfs code.

I think installing a handler for SIGALRM should be fine, we can choose ourselves how such signals should be dealt with, and SIGALRM is commonly for timeouts (eg. using the alarm(2) call which is meant to interrupt (without killing the process). (Eg. there are no file-locking methods with a timeout, so the only way to try such locks with a time out is to set a timer or alarm or emulate such a thing via threads+sleep)).
If we ever need that behavior we'd want to use timers (timer_create(2)) instead of alarm(2) anyway, as with timers you can target specific threads.

The thread might be sleeping. Make sure to interrupt so we can reload faster. Signed-off-by: Deyan Doychev <[email protected]>

DeyanSG · 2025-03-21T13:23:33Z

Hi @Blub ,

I've modified it to use SIGALRM as suggested and did a quick test. It seems to work fine this way.

Feel free to suggest more improvements.

Regards,
Deyan

DeyanSG force-pushed the pthread-kill branch from 3d5fc6a to 50408e7 Compare March 17, 2025 14:00

mihalicyn self-requested a review March 18, 2025 14:20

DeyanSG force-pushed the pthread-kill branch from 50408e7 to 8365920 Compare March 21, 2025 13:15

proc_loadavg: Sleep in loadavg thread should not slow down reloads

85802c0

The thread might be sleeping. Make sure to interrupt so we can reload faster. Signed-off-by: Deyan Doychev <[email protected]>

DeyanSG force-pushed the pthread-kill branch from 8365920 to 85802c0 Compare March 21, 2025 13:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proc_loadavg: Sleep in loadavg thread should not slow down reloads #679

proc_loadavg: Sleep in loadavg thread should not slow down reloads #679

DeyanSG commented Mar 17, 2025

Blub commented Mar 19, 2025

mihalicyn commented Mar 19, 2025

Blub commented Mar 19, 2025

DeyanSG commented Mar 20, 2025

Blub commented Mar 20, 2025

DeyanSG commented Mar 21, 2025

proc_loadavg: Sleep in loadavg thread should not slow down reloads #679

Are you sure you want to change the base?

proc_loadavg: Sleep in loadavg thread should not slow down reloads #679

Conversation

DeyanSG commented Mar 17, 2025

Blub commented Mar 19, 2025

mihalicyn commented Mar 19, 2025

Blub commented Mar 19, 2025

DeyanSG commented Mar 20, 2025

Blub commented Mar 20, 2025

DeyanSG commented Mar 21, 2025