device: Allow buffer memory growth to be limited at run time #69

gitlankford · 2023-02-26T15:19:55Z

The infinite memory growth allowed by the default PreallocatedBuffersPerPool setting causes processes to be oom-killed on low memory devices. This occurs even when a soft limit is set with GOMEMLIMIT. Specifically running tailscale on a linux device (openwrt, mips, 128MB RAM) will exhaust all memory and be oom-killed when put under heavy load. Allowing this value to be overwritten as is done in the iOS build will allow tuning to cap memory expansion and prevent oom-kill.

see tailscale issue thread for further info:
tailscale/tailscale#7272

The infinite memory growth allowed by the default PreallocatedBuffersPerPool setting causes processes to be oom-killed on low memory devices. This occurs even when a soft limit is set with GOMEMLIMIT. Specifically running tailscale on a linux device (openwrt, mips, 128MB RAM) will exhaust all memory and be oom-killed when put under heavy load. Allowing this value to be overwritten as is done in the iOS build will allow tuning to cap memory expansion and prevent oom-kill. see tailscale issue thread for further info: tailscale/tailscale#7272 Signed-off-by: Seth Lankford <[email protected]>

gitlankford · 2023-02-26T15:25:37Z

Thank you for your patience.

I'm sure there are other ways to solve this issue (specific low-mem builds, etc).
This method follows the prior art of the iOS build.

zx2c4 · 2023-03-03T13:31:00Z

Not sure it's a good idea to add random knobs like this for third parties to twiddle and trip over. I wonder if there's some heuristic that could be used instead, which would always work and dynamically scale accordingly? ratelimiter.c in the kernel does this, for example, with something pretty kludgy but it does work:

int wg_ratelimiter_init(void)
{
        mutex_lock(&init_lock);
        if (++init_refcnt != 1)
                goto out;

        entry_cache = KMEM_CACHE(ratelimiter_entry, 0);
        if (!entry_cache)
                goto err;

        /* xt_hashlimit.c uses a slightly different algorithm for ratelimiting,
         * but what it shares in common is that it uses a massive hashtable. So,
         * we borrow their wisdom about good table sizes on different systems
         * dependent on RAM. This calculation here comes from there.
         */
        table_size = (totalram_pages() > (1U << 30) / PAGE_SIZE) ? 8192 :
                max_t(unsigned long, 16, roundup_pow_of_two(
                        (totalram_pages() << PAGE_SHIFT) /
                        (1U << 14) / sizeof(struct hlist_head)));
        max_entries = table_size * 8;

        table_v4 = kvcalloc(table_size, sizeof(*table_v4), GFP_KERNEL);
        if (unlikely(!table_v4))
                goto err_kmemcache;

#if IS_ENABLED(CONFIG_IPV6)
        table_v6 = kvcalloc(table_size, sizeof(*table_v6), GFP_KERNEL);
        if (unlikely(!table_v6)) {
                kvfree(table_v4);
                goto err_kmemcache;
        }
#endif

gitlankford · 2023-03-03T14:57:24Z

Thanks @zx2c4 - I am happy to experiment with that sort of dynamic scaling, but will be less confident about being able to do it up to your standards given my current limited GO experience. Perhaps that could be a future workstream?

In my experiments with different values, I found that when the PreallocatedBuffersPerPool was too high, the GC would be working overtime until oom-kill, and when it was "just right" the GC would only run every 2 minutes in the forced GC run. (an indirect measurement of the memory situation)

The other thing to note is that the router I'm running this on is dedicated to running this VPN, so the memory usage is otherwise extremely stable. That is probably the the most common setup for devices of this size (also: noswap).

qdm12 · 2024-05-10T14:41:58Z

Any news on this? It looks like someone had the issue as well over here: qdm12/gluetun#2036

rasooll approved these changes Feb 26, 2023

View reviewed changes

zx2c4-bot force-pushed the master branch from 787da64 to f41f474 Compare March 10, 2023 13:53

zx2c4-bot force-pushed the master branch from d3cb5bd to 6f895be Compare March 24, 2023 16:05

Merge branch 'WireGuard:master' into master

f35045b

luker983 mentioned this pull request Jul 6, 2023

High Memory usage per TCP connection sandialabs/wiretap#16

Closed

qdm12 mentioned this pull request May 10, 2024

Bug: wireguard-go unbounded memory usage qdm12/gluetun#2036

Open

gitlankford mentioned this pull request May 11, 2024

High memory usage, when reading large file over NFS tailscale/tailscale#11382

Open

Merge branch 'WireGuard:master' into master

ef1a626

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

device: Allow buffer memory growth to be limited at run time #69

device: Allow buffer memory growth to be limited at run time #69

gitlankford commented Feb 26, 2023

gitlankford commented Feb 26, 2023

zx2c4 commented Mar 3, 2023

gitlankford commented Mar 3, 2023

qdm12 commented May 10, 2024

device: Allow buffer memory growth to be limited at run time #69

Are you sure you want to change the base?

device: Allow buffer memory growth to be limited at run time #69

Conversation

gitlankford commented Feb 26, 2023

gitlankford commented Feb 26, 2023

zx2c4 commented Mar 3, 2023

gitlankford commented Mar 3, 2023

qdm12 commented May 10, 2024