Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adapter Trimming on PE reads have different number of reads output #186

Open
bshim181 opened this issue Mar 6, 2024 · 1 comment
Open

Comments

@bshim181
Copy link

bshim181 commented Mar 6, 2024

Hello,

trim_galore -o output --fastqc --paired $R1_file $R2_file

I have utilized trim-galore to trim illumina adapters on my PE sequencing reads.
When I tried to process the output fq files through bbmerge for insert length analysis,
I have received this error message from bbmerge.
"There appear to be different numbers of reads in the paired input files.
The pairing may have been corrupted by an upstream process."

So when I checked the length of trim-galore outputs for one of the samples, output from R1 and R2 differed. Would I have to specify retained_unpaired for these to match up? My assumption was that if I run trim-galore with paired parameters, the pairings would be retained.

314343816 Sample1_R1_001_val_1.fq
299531972 Sample1_R2_001_val_2.fq

@FelixKrueger
Copy link
Owner

This is unusual, I would say. Trim Galore has functionality built in that it would die (i.e. terminate) if R1 and R2 input files are truncated, and sequence pairs are always handled together. There is no need to specify anything else.

Is there a chance that the files got corrupted during a copying process of the like? Are they gzip compressed, or are you showing the number of lines in the example above? Do you still have the log file?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants