1- Metadata-Version: 2.2
1+ Metadata-Version: 2.4
22Name: REDItools3
3- Version: 3.3
3+ Version: 3.5
44Author: Ernesto Picardi
55Author-email: Adam Handen <adam.handen@gmail.com>
6+ License-Expression: GPL-3.0-or-later
67Project-URL: homepage, https://github.com/BioinfoUNIBA/REDItools3
78Project-URL: repository, https://github.com/BioinfoUNIBA/REDItools3
89Project-URL: issues, https://github.com/BioinfoUNIBA/REDItools3/issues
910Keywords: bioinformatics,RNA,RNA-editing
1011Classifier: Development Status :: 5 - Production/Stable
1112Classifier: Intended Audience :: Developers
1213Classifier: Intended Audience :: Science/Research
13- Classifier: License :: OSI Approved :: GNU General Public License (GPL)
1414Classifier: Operating System :: MacOS :: MacOS X
1515Classifier: Operating System :: Unix
1616Classifier: Programming Language :: Python :: 3.7
@@ -20,6 +20,7 @@ Description-Content-Type: text/markdown
2020License-File: LICENSE
2121Requires-Dist: pysam>=0.22.0
2222Requires-Dist: sortedcontainers>=2.4.0
23+ Dynamic: license-file
2324
2425# REDItools3
2526A new REDItools implementation to speed-up the RNA editing profiling in massive RNAseq data
@@ -34,3 +35,65 @@ Use the whl file under the dist directory.
3435# Usage
3536Once installed, reditools can be run from the commandline.
3637`python -m reditools`
38+
39+ ## Tools
40+
41+ ### analyze
42+ This is the core reditools function: detecting editing events from one or more BAM file.
43+
44+ The output is a tab separated table with these columns:
45+ | Field | Description |
46+ | --- | --- |
47+ | Region | Chromosome or contig |
48+ | Position | Position in the region |
49+ | Reference | Base from the reference sequence |
50+ | Strand | DNA strand (+, -, or \*) |
51+ | Coverage-q30 | How many reads had a quality of at least 30 |
52+ | MeanQ | Mean read quality |
53+ | BaseCount[A,C,G,T] | Total count of each base found |
54+ | AllSubs | All the detected substitutions |
55+ | Frequency | Ratio of non-reference bases to reference bases |
56+ | gCoverage-q30 | Genomic Coverage-q30 (see `annotate`) |
57+ | gMeanQ | Genomic MeanQ (see `annotate`) |
58+ | gBaseCount[A,C,G,T] | Genomic BaseCount (see `annotate`) |
59+ | gAllSubs | Genomic variants (see `annotate`) |
60+ | gFrequency | Genomic variant frequency (see `annotate`) |
61+
62+ The last 5 columns will always be blank (`-`). They are reserved for output
63+ from the `annotate` tool.
64+
65+ ### annotate
66+ Annotate RNA editing output with variant detection from genomic data.
67+
68+ `annotate` takes two reditools output files and fills in the last five columns
69+ of the first file with positional matches from the second.
70+
71+ For example, this RNA file:
72+ ```
73+ Region Position Reference Strand Coverage-q30 MeanQ BaseCount[A,C,G,T] AllSubs Frequency gCoverage-q30 gMeanQ gBaseCount[A,C,G,T] gAllSubs gFrequency
74+ chr1 1115715 C * 2 38.00 [0, 2, 0, 0] - 0.00 - - - - -
75+ chr1 1115716 A * 2 38.00 [2, 0, 0, 0] - 0.00 - - - - -
76+ ```
77+
78+ With this DNA file:
79+ ```
80+ Region Position Reference Strand Coverage-q30 MeanQ BaseCount[A,C,G,T] AllSubs Frequency gCoverage-q30 gMeanQ gBaseCount[A,C,G,T] gAllSubs gFrequency
81+ chr1 1115716 A * 2 38.00 [2, 0, 0, 0] - 0.00 - - - - -
82+ chr1 1115717 C * 2 38.00 [0, 2, 0, 0] - 0.00 - - - - -
83+ ```
84+
85+ Produces:
86+ ```
87+ Region Position Reference Strand Coverage-q30 MeanQ BaseCount[A,C,G,T] AllSubs Frequency gCoverage-q30 gMeanQ gBaseCount[A,C,G,T] gAllSubs gFrequency
88+ chr1 1115715 C * 2 38.00 [0, 2, 0, 0] - 0.00 - - - - -
89+ chr1 1115716 A * 2 38.00 [2, 0, 0, 0] - 0.00 2 38.00 [2, 0, 0, 0] - 0.00
90+ ```
91+
92+ ### find-repeats
93+ Identify repetitive elements in a FASTQ file.
94+
95+ ### index
96+ Compute RNA editing index from reditools `analyze` output
97+ ([PMDI: 31636457](https://pubmed.ncbi.nlm.nih.gov/31636457/)).
98+ The `index` tool computes the editing indices for all possible variants, not
99+ just A-to-I (listed as A-G in the output).
0 commit comments