-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incomplete result from minimap2-aligned bam files #395
Comments
Additionally, the UCSC reference genome (hg38.fa) and annotation file (hg38.knownGene.gtf) were used in the code. |
Hi Kenan, Thanks for reaching out. I am not yet too sure what could be causing this, as the aligner theoretically should not impact the output format of Bambu. Could you please share the lines of code you used to generate these two screenshots so I can get a better picture of what might be happening. You can rename all sample files if you need to maintain privacy. |
Hi Andre, #bambu project |
Hi Kenen, Thanks for this, I noticed in the first screen shot you use mcols(newAnnotations) which isn't a variable you included in the code you posted, whereas the second is of ont_iso_analysis. Could I ask where newAnnotations comes from. Kind Regards, |
Thanks Andre! Best, |
Hi Kenen, I tried some more tests on my side and havn't figured out how this could occur. Perhaps you could sample your two bam files down to an easily transferable size, check the issue still occurs, and if so and it is not sensitive data, upload them here and I can try run them to see if I can replicate the issue. This will make it a lot easier for me to troubleshoot. Kind Regards, |
Oh I think I see the issue
Let me know how it goes. |
Hi Kenan, Thanks for checking that. The warning number 3 is the important one here. Do you expect no spliced reads in your dataset? Given this is a standard tumor tissue in human I assume that is unlikely. Common causes for this is if the alignment was done without splice aware mode. Minimap2 is recommended to be run like this ./minimap2 -ax splice -uf -k14 ref.fa reads.fa > aln.sam for nanopore reads. Alternatively alignments need to be done to the genome and not the transcriptome, which would also cause unspliced reads, however given you said the scaffold names are "chr1" this is less likely. Regarding point 1. NDR = 1 and quant = FALSE are very different. Setting quant to false stops Bambu from quantifying the output and will just return the filtered annotations (Filtered by NDR value). NDR = 1 runs the discovery step at its most sensitive meaning it will return all annotations regardless of their NDR value. If it doesn't trouble you, could you let me know what part of the manual this was, so that I can adjust it so it is clearer for future users! Let me know if the alignment was the issue or if you do only expect un-spliced reads. |
Thanks, Andre. |
Hello, I've been testing bambu on a dataset i am interested in and also get the warning ""The current filtering criteria filters out all new read classes, please consider less stringent criteria" I'm running it as follows
and get the following warnings/errors
The system I'm working in has predominantly single exon transcripts and so I am wondering if I need to configure bambu differently to deal with this? |
Hi, Yes by default Bambu excludes all single exon transcript. Please see this section of our documentation https://github.com/GoekeLab/bambu#Including-single-exons. You need to add this parameter to prevent them being filtered out. Kind Regards, |
That is super helpful. thank you very much! |
Hi Zhang, Since we have not heard back I hope this solved you issue and will close it for now. If it hasn't let us know please write back with a detailed description of what is occurring and preferentially a reproducible example and we can reopen it. |
Hi Andre, |
Ok sorry to hear about the killed jobs, if you are still having issues once it finishes I can reopen this. |
Hi Andre, Great thanks for your assistance, and thanks again for the brilliant software. |
Glad to hear everything is resolved! I hope you get some nice results! |
Hi,
I have been using bambu lately for novel isoform identification from long-read RNA-seq data these days. And it is efficient and useful.
However, an issue happened when performing novel isoform identification from minimap2 aligned bam files. The metadata only included 4 columns, and the novelTranscript information is lacking. Hence, the novel isoforms can not be identified.
But running bambu from STAR-aligned bam files could be well performed.
I hope I have described this issue clearly. Any hint on how to solve the issue is very appreciated!
Best,
Kenan
The text was updated successfully, but these errors were encountered: