Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

separating error correction for each fastq then combining at the end #756

Open
johnathanlo opened this issue Jan 10, 2025 · 0 comments
Open

Comments

@johnathanlo
Copy link

Hi there,

We are running into some memory issues while running hifiasm, probably due to the large amount of data. Our genome is around 2.2 Gb sequenced to about 150x coverage on 3 cells. This means there are 3 fastq files around 220 GB each. Unfortunately we are mostly limited to compute nodes with < 500 GB memory and the memory usage for the error correction step seems to be well in excess of 500 GB.

FYI, read quality looks good:
linear_plot

Would it be possible to separately generate error corrected reads for each batch of raw reads and then recombine them for assembly? Would you advise against this?

Thanks,
John

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant