| Literature DB >> 35134827 |
Daniel L Cameron1,2, Ruining Dong1,2, Anthony T Papenfuss1,2,3,4.
Abstract
StructuralVariantAnnotation is an R/Bioconductor package that provides a framework for decoupling downstream analysis of structural variant breakpoints from upstream variant calling methods. It standardizes the representational format from BEDPE, or any of the three different notations supported by VCF into a breakpoint GRanges data structure suitable for use by the wider Bioconductor ecosystem. It handles both transitive breakpoints and duplication/insertion notational differences of identical variants-both common scenarios when comparing short/long read-based call sets that confound downstream analysis. StructuralVariantAnnotation provides the caller-agnostic foundation needed for a R/Bioconductor ecosystem of structural variant annotation, classification, and interpretation tools able to handle both simple and complex genomic rearrangements. AVAILABILITY: StructuralVariantAnnotation is implemented in R and available for download as the Bioconductor StructuralVariantAnnotation package. Details can be found at https://www.bioconductor.org/packages/release/bioc/html/StructuralVariantAnnotation.htmlIt has been released under a GPL license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.Entities:
Year: 2022 PMID: 35134827 PMCID: PMC8963309 DOI: 10.1093/bioinformatics/btac042
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.StructuralVariantAnnotation provides a potential foundation for a caller-agnostic structural variant tool ecosystem. (a) StructuralVariantAnnotation simplifies downstream analysis by decoupling the caller from the analysis. (b) Structural variants are converted to a common breakpoint GRanges notation regardless of how they are represented in VCF. Strand indicates breakend orientation. Inversions are decomposed into their constituent breakpoints. (c) Breakpoints are composed of a pair of connected breakends. Single breakend variants are breakpoints where only one side can be placed. (d) Long read callers frequently report duplications as insertions. StructuralVariantAnnotation can identify when an insertion and duplication represent the same genomic rearrangement. (e) StructuralVariantAnnotation can identify transitive breakpoints—spurious breakpoints that span two or more breakpoints in a complex genomic rearrangement