Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Version 0.15 / bp.p_ctg.gfa versus bp.hap?.p_ctg.gfa N50 #106

Open
chklopp opened this issue Apr 27, 2021 · 2 comments
Open

Version 0.15 / bp.p_ctg.gfa versus bp.hap?.p_ctg.gfa N50 #106

chklopp opened this issue Apr 27, 2021 · 2 comments

Comments

@chklopp
Copy link

chklopp commented Apr 27, 2021

Hi,

For several assemblies we found better N50 metrics for bp.p_ctg.gfa than both bp.hap?.p_ctg.gfa.
Why are why losing continuity when separating haplotypes?
Is it pertinent to scaffold bp.hap?.p_ctg.gfa with bp.p_ctg.gfa?

@chhylp123
Copy link
Owner

bp.hap?.p_ctg.gfa dose not join different haplotigs while bp.p_ctg.gfa joins some of them. Therefore bp.p_ctg.gfa usually has higher continuity at the expense of much fragmented a_ctg.gfa. We think fragmented a_ctg.gfa is not such useful so that half of information might be lost. Another difference is that bp.p_ctg.gfa tends to keep sex chromosomes at once. Just curious: how do you scaffold bp.hap?.p_ctg.gfa with bp.p_ctg.gfa?

By the way, version 0.15 has some minor bugs so that it's better to re-output assemblies with v0.15.1 or current github HEAD.

@chklopp
Copy link
Author

chklopp commented Apr 27, 2021

The easiest way is to align bp.hap?.p_ctg.gfa.fa on bp.p_ctg.gfa.fa with dgenies (http://dgenies.toulouse.inra.fr) and export the query organized as the reference.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants