-
Notifications
You must be signed in to change notification settings - Fork 17
Store intermediate data in a directory named after the compute ID #413
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Saving temporary data from individual executions into different directories would be useful for benchmarking. This requires the ops to know both the Currently the |
I think it's worse than that - the intermediate directories paths are created (not on the filesystem, just as strings) as the array functions are called, but before the computation is run - and therefore before the I think the easier way to solve the original problem in cubed-dev/cubed-benchmarks#10 would be to just get the intermediate array paths from the DAG. |
Thinking about this more, it would be possible to change |
So should we perhaps instead create only the known part of the directory path (i.e. the "prefixes") during plan construction time, and then join the
So then the |
Oh I didn't see your comment when I wrote mine - I think we're suggesting basically the same thing. I agree this is probably overkill to get cubed-dev/cubed-benchmarks#10 working, but I do think being able to distinguish different run directories might be useful in other contexts (e.g. perhaps an external tool whose job is to periodically purge temporary data from older runs). |
Yes, that would be useful. |
Originally posted by @TomNicholas in cubed-dev/cubed-benchmarks#10 (comment)
The text was updated successfully, but these errors were encountered: