Releases: KennaLab/rvat
Releases · KennaLab/rvat
v.0.3.3
v.0.3.1
- Includes new mutationPlot function to create mutation plots that visualize variant-level and gene-level association results along a transcript structure. Optionally, custom tracks such as protein domains or mutation clusters can be overlaid.
- The mutationPlot function is demonstrated in the updated spatial clustering tutorial.
- Includes a new method getRanges for
varSetListandvarSetFilethat retrieves genomic ranges for variant sets. Can be used to retrieve either genomic coordinates or other user-defined coordinates such as CDS coordinates genearated with themapCDSmethod.
v.0.3.0
Large update! In particular, changes have been made to:
- Documentation and tutorials
- All methods/classes/functions now contain examples in their documentation.
- New tutorials are included on variant annotation, gene set analysis and spatial clustering.
Existing workflows have been updated with new features/functions .
- Association testing
- RVAT results now include a
ORfield indicating the odds-ratio for logistic tests (firth and glm). Importantly: For firth and flm testseffect/effectSE/effectCIlower/effectCIupperare now on the log-scale! - Effect alleles (i.e. the allele to which the effect estimate refers) and non-effect alleles are now included in the results of single variant tests.
- RVAT results now include a
- Metadata
- GDB now includes additional metadata, including a unique identifier, the genome build and creation date and can be retrieved using corresponding getters (e.g.
getGenomeBuildandgetGdbId). The genome build can be set when building the gdb (genomeBuildparameter), and will then be used correctly assign ploidies on the sex-chromosomes in downstream analyses (so no need to set thecheckPloidyparameter anymore, although this is still an option). The gdb identifier can be useful to track which gdb was used to generate results, as the id will be included in the metadata of downstream files such as results files and varSetFiles (see below). Also, it is used methods such asassocTestandsummariseGenoto check whether supplied varSetFile/varSetList files were generated from the specified gdb. - Files generated using RVAT, such as rvatResult and varSetFiles, now include a header with metadata, including the RVAT version used to generate the file, the gdb identifier, the genome build and the creation date. This metadata will be included when loading or connecting to these files with the corresponding RVAT methods (e.g.
rvbResult(<file>)andvarSetFile(<file>)). Note that if you read in RVAT results using non-RVAT functions (e.g.read.table), then you will have to skip lines that start with '#'.
- GDB now includes additional metadata, including a unique identifier, the genome build and creation date and can be retrieved using corresponding getters (e.g.
- Robustness, fixes & efficiency
- Unit tests are now included that cover the majority of code.
- RVAT now passes R CMD CHECK without errors / warnings.
- Complete rewrite of the RVAT command-line interface. This is mostly an 'under-the-hood' update to increase robustness and maintainability. For users it is relevant that 1) CLI + help pages are now in-sync with interactive version 2) parameters can now be provided without an '=', e.g.
--gdb <path to gdb>rather than--gdb=<path to gdb> - Increased efficiency: significant speed-up of extracting ranges (
rangesparameter ingetGTandgetAnno). - Several bug fixes / improvements.
- Miscellanous
- Annotations can now be included directly when loading genotypes using
getGT()by setting theannoparameter, or settinginludeVarInfo = TRUEto include the 'var' table. If 'REF' and 'ALT' alleles are included in the annotations, 'effectAllele' and 'otherAllele' are assigned in thegenoMatrixand automatically updated when alleles are flipped. effectAllele and otherAllele are passed on as a fields in single variant results. subsetGdb: Now has aVAR_idparameter to directly subset VAR_ids and atablesparameter to specify which tables to keep.getCohortnow returns records only for the samples included in the respective cohort, rather than running NA fields for all excluded samples. ThekeepAllparameter can be set toTRUEto return all records, like in previous versions.uploadAnno: now has akeepUnmappedflag, if set to TRUE (default) variants that do not map to gdb are discarded.writeVcfnow includes aincludeVarIdparameter that controls whether VAR_ids should be included in the 'ID' field. This parameter defaults toFALSE, in which case the 'ID' field from the 'var' table is included. Note: this is a change relative to the previous version, in which VAR_ids were written to the 'ID' field by default. Also, FORMAT field not included anymore when generating a sites-only vcf (includeGeno=TRUE).- 'AF' field in rowData slot of genoMatrix is removed.
mergeAggregateFilesis now split into two methods:mergeAggregateFilesandcollapseAggregateFiles.mergeAggregateFilesbehaves identically to formermergeAggregateFilesmethod withcollapse = FALSE, whereascollapseAggregateFilesbehaved identically to formermergeAggregateFilesmethod withcollapse = TRUE.
- Annotations can now be included directly when loading genotypes using