Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generating datasets and running the pipeline #6

Open
sidgairo18 opened this issue Nov 2, 2024 · 0 comments
Open

Generating datasets and running the pipeline #6

sidgairo18 opened this issue Nov 2, 2024 · 0 comments

Comments

@sidgairo18
Copy link

Hi @KupynOrest ,

Thanks for the work.

I had a question regarding generating the datasets - so from the pipeline it seems like each sample is generated individually (sequentially) and the images cannot be batched (i might be wrong about this). And each sample can take a few minutes to generate a new augmentation; even on GPUs.

How does one manage to parallelise this, to generate datasets faster? because otherwise even for generating ~10k images it can take over a day (or few days).

Would be grateful for your response.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant