-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
req_body_file leaks file handles #534
Comments
Are you using the development version? I can't reproduce the problem: library(httr2)
path <- tempfile()
writeLines("Hello", path)
showConnections()
#> description class mode text isopen can read can write
#> 3 "output" "textConnection" "wr" "text" "opened" "no" "yes"
resp <- request("https://example.com/") |>
req_body_file(path) |>
req_perform()
showConnections()
#> description class mode text isopen can read can write
#> 3 "output" "textConnection" "wr" "text" "opened" "no" "yes" Created on 2024-09-04 with reprex v2.1.0 |
Yeah, same happens using the latest httr2 Git commit. It might have to do with the curl version. I found this commit, introduced in curl 8.7.0 (ie. March 2024): curl/curl@9369c30 which refactors a lot of the file transfer code. In particular, it adds a piece of code that caps the /* respect length limitations */
if(ctx->total_len >= 0) {
curl_off_t remain = ctx->total_len - ctx->read_len;
if(remain <= 0)
blen = 0;
else if(remain < (curl_off_t)blen)
blen = (size_t)remain;
}
nread = 0;
if(data->state.fread_func && blen) {
Curl_set_in_callback(data, true);
nread = data->state.fread_func(buf, 1, blen, data->state.in);
Curl_set_in_callback(data, false);
ctx->has_used_cb = TRUE;
} I don't see any equivalent code in the removed lines, so I guess the behaviour has changed slightly, and where it used to pass the size of the full outgoing buffer, it now only passes the size of how many bytes are actually needed. For reference, this is my curl version and sessionInfo:
|
Can you please send me the results of my reprex run on your computer? |
So we can ensure that open connections are always closed. Fixes #534.
Reprex below. I've had to run reprex with
library(httr2)
path <- tempfile()
writeLines("Hello", path)
showConnections()
#> description class mode text isopen can read can write
#> 3 "output" "textConnection" "wr" "text" "opened" "no" "yes"
resp <- request("https://example.com/") |>
req_body_file(path) |>
req_perform()
showConnections()
#> description class mode text isopen can read can write
#> 3 "output" "textConnection" "wr" "text" "opened" "no" "yes" Created on 2024-09-04 with reprex v2.1.0 Standard output and standard errorWarning message:
In .Internal(gc(verbose, reset, full)) :
closing unused connection 4 (/tmp/nix-shell.Of66Tb/RtmpPnh7Wt/filef07445c59a7d) I can run a slightly different reprex, which uses library(httr2)
path <- tempfile()
writeLines("Hello", path)
sapply(getAllConnections(), \(x) summary.connection(x)$description)
#> [1] "stdin" "stdout" "stderr" "output"
resp <- request("https://example.com/") |>
req_body_file(path) |>
req_perform()
sapply(getAllConnections(), \(x) summary.connection(x)$description)
#> [1] "stdin"
#> [2] "stdout"
#> [3] "stderr"
#> [4] "output"
#> [5] "/tmp/nix-shell.Of66Tb/Rtmp7pXUfP/filefd4a7ce78bdc"
gc()
#> used (Mb) gc trigger (Mb) max used (Mb)
#> Ncells 707211 37.8 1429215 76.4 835805 44.7
#> Vcells 1282302 9.8 8388608 64.0 2003164 15.3
sapply(getAllConnections(), \(x) summary.connection(x)$description)
#> [1] "stdin" "stdout" "stderr" "output" Created on 2024-09-04 with reprex v2.1.0 Standard output and standard errorWarning message:
In .Internal(gc(verbose, reset, full)) :
closing unused connection 4 (/tmp/nix-shell.Of66Tb/Rtmp7pXUfP/filefd4a7ce78bdc) |
Thanks! I hacked together a quick fix in #542. I need to think about it a bit more to make sure it's the right approach, but I think it should solve your problem (and is a safer approach in general). |
Thanks for the quick resolution! I can confirm that the |
So we can ensure that open connections are always closed. Fixes #534.
When using
httr2::req_body_file
,httr2
opens a connection to the file being uploaded but never closes it. Eventually the handle gets closed by the garbage collector, but a warning gets printed onto the console.Looking at the source, I see some code that is supposed to close the handle on a short read:
httr2/R/req-body.R
Lines 241 to 246 in 06e9133
However from what I can tell by adding print statements, libcurl never tries to read more than necessary, so short reads never happen, even on a successful file transfer. For small enough files,
nbytes
is always just equal tosize
, and tolength(out)
One possible solution would be to accumulate the number of bytes read so far (the sum of all the
length(out)
), and when that equalssize
then close the file. This would work in the happy case where the connection is not closed early.I think a more foolproof solution would be to record the connection handle in the request object, and have
req_perform
close it when the request is complete.Example reproduction:
The call to
gc()
is used to force the GC to run, but even without it the warning message gets printed eventually.The text was updated successfully, but these errors were encountered: