-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Understanding memory usage and performance of furrr::future_apply
#260
Comments
Can you also post the code you used to do the memory profiling? |
Sure, here it is:
|
@DavisVaughan any news on this? |
I'm having the same issue. |
Same for me as well. I also recently several gigabytes of temp files had been created and never cleaned up and parallelized functions do not complete as quickly as they used to. I've been using furrr for years with excellent performance, this is unusual. It feels like something else changed in the R ecosystem that's impacting furrr. I may have to switch to Crew (powered by mirai). It's a shame because nothing comes close to furrr in terms of syntactic sugar and ease of use. |
I am facing some issues parallelizing processes with
furrr::future_apply
.This is the setting I am having issues with:
When I profile memory and time for these 4 plans this is what I get:
I have launched 4 different jobs from R studio server, while I was profiling all memory used for processes with my user in a separate job to get data for the graph.
This is the outpu of my
sessionInfo())
of the parallelization jobs:Is this behavior normal? I did not expected the steep increase in memory for all the plans, other than the increase in time when I increase the number of workers.
I also tested the
sys.sleep(1)
function in parallel, and I got the result I expected, time decreases as I increase workers.What I am trying to parallelize is far more complex than this, i.e. a series of nested wrapped functions that do some training for some time series models and inference writing a csv and not returning anything.
I fill like I am losing something very simple but yet I cannot wrap my head around it, what concerns me the most is the memory increase, as it would be a very memory intensive function.
The text was updated successfully, but these errors were encountered: