-
Notifications
You must be signed in to change notification settings - Fork 150
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
valX files vs trimmed files? diff output same code? #162
Comments
If you still have files called As a side note, if this trimming is for methylation alignments, I would recommend the trimming setting described here: http://felixkrueger.github.io/Bismark/bismark/library_types/#em-seq-neb |
Related questions specific to EM-Seq:
Thanks. |
You don't necessarily have to use Trim Galore, but yes some trimming is recommended. the nf-core/methylseq pipeline has an EM-seq switch which should work equally: |
I think this still uses Trim Galore under the hood
|
Hi,
I want to trim EM-SEQ fastq files.
I used the same code, first for a single pair, and then for a batch.
The code for the first pair was:
trim_galore --2colour 20 --illumina -o trim --paired V00001_R1.fastq.gz V00001_R2.fastq.gz
and the output was:
V00001_R1_val_1.fq.gz
V00001_R2_val_2.fq.gz
The summary stated trimming mode - paired end:
SUMMARISING RUN PARAMETERS
Input filename: V00001_R1.fastq.gz
Trimming mode: paired-end
Trim Galore version: 0.6.10
Cutadapt version: 1.18
Number of cores used for trimming: 1
Quality encoding type selected: ASCII+33
Adapter sequence: 'AGATCGGAAGAGC' (Illumina TruSeq, Sanger iPCR; user defined)
2-colour high quality G-trimming enabled, with quality cutoff: --nextseq-trim=20
Maximum trimming error rate: 0.1 (default)
Minimum required adapter overlap (stringency): 1 bp
Minimum required sequence length for both reads before a sequence pair gets removed: 20 bp
Output file will be GZIP compressed
Then, for a second pair I used the code:
trim_galore --2colour 20 --illumina --output_dir=trim -j 4 --paired V00021_R1.fastq.gz V00021_R2.fastq.gz
The output files were:
V00021_R1_trimmed.fq.gz
V00021_R2_trimmed.fq.gz
And the summary:
SUMMARISING RUN PARAMETERS
Input filename: V00021_R1.fastq.gz
Trimming mode: paired-end
Trim Galore version: 0.6.10
Cutadapt version: 1.18
Python version: could not detect
Number of cores used for trimming: 4
Quality encoding type selected: ASCII+33
Adapter sequence: 'AGATCGGAAGAGC' (Illumina TruSeq, Sanger iPCR; user defined)
2-colour high quality G-trimming enabled, with quality cutoff: --nextseq-trim=20
Maximum trimming error rate: 0.1 (default)
Minimum required adapter overlap (stringency): 1 bp
Minimum required sequence length for both reads before a sequence pair gets removed: 20 bp
Output file will be GZIP compressed
Why the first pair had the prefix val* while the second just trimmed?
Is there something in the code that I didn't know or was it an effect of using multithreaded mode?
Thanks;
The text was updated successfully, but these errors were encountered: