Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting highly imbalanced haplotypes using trio-binning #776

Open
amara86 opened this issue Feb 10, 2025 · 1 comment
Open

Getting highly imbalanced haplotypes using trio-binning #776

amara86 opened this issue Feb 10, 2025 · 1 comment

Comments

@amara86
Copy link

amara86 commented Feb 10, 2025

Hi
I am attempting to generate trio-binned assemblies using Hifiasm and have followed this workflow:

Processed Illumina HiSeq short reads from Parent 1 using fastp for adapter removal and quality filtering (Total: 304,817,131 bp).
Processed 10X Genomics linked-read data from Parent 2 using fastp (Total: 88,641,287 bp).
Collected HiFi reads from an F1 hybrid offspring for assembly.
To perform trio-binning, I generated parental k-mers using yak, but the number of unique k-mers was highly imbalanced:

Parent 1: 34,008,379 k-mers
Parent 2: 7,669,877 k-mers
This imbalance resulted in highly uneven haplotype assemblies:

One haplotype: ~948MB
Other haplotype: ~5MB
To address this, I tried generating k-mers using KMC, which yielded more balanced unique k-mers:

Parent 1: 308,155,996 k-mers
Parent 2: 280,131,731 k-mers
However, despite this, the assembled haplotypes remain highly imbalanced.

Questions:

  1. Is there a way to normalize k-mer counts in yak to achieve a more balanced distribution?
  2. Can we convert k-mer generated by (KMC in txt format) to yak format?
    Regards,
    Amara
@chhylp123
Copy link
Owner

Hifiasm accepts -3 and -4 if you can partition reads by yourself. If you can do it with KMC output, that might work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants