| Literature DB >> 27576531 |
Mercedeh Movassagh1,2, Nawaf Alomran1,3, Prakriti Mudvari1, Merve Dede1, Cem Dede1, Kamran Kowsari1,4, Paula Restrepo1, Edmund Cauley5, Sonali Bahl5, Muzi Li1,3, Wesley Waterhouse1, Krasimira Tsaneva-Atanasova6, Nathan Edwards3, Anelia Horvath7,5.
Abstract
We introduce RNA2DNAlign, a computational framework for quantitative assessment of allele counts across paired RNA and DNA sequencing datasets. RNA2DNAlign is based on quantitation of the relative abundance of variant and reference read counts, followed by binomial tests for genotype and allelic status at SNV positions between compatible sequences. RNA2DNAlign detects positions with differential allele distribution, suggesting asymmetries due to regulatory/structural events. Based on the type of asymmetry, RNA2DNAlign outlines positions likely to be implicated in RNA editing, allele-specific expression or loss, somatic mutagenesis or loss-of-heterozygosity (the first three also in a tumor-specific setting). We applied RNA2DNAlign on 360 matching normal and tumor exomes and transcriptomes from 90 breast cancer patients from TCGA. Under high-confidence settings, RNA2DNAlign identified 2038 distinct SNV sites associated with one of the aforementioned asymetries, the majority of which have not been linked to functionality before. The performance assessment shows very high specificity and sensitivity, due to the corroboration of signals across multiple matching datasets. RNA2DNAlign is freely available from http://github.com/HorvathLab/NGS as a self-contained binary package for 64-bit Linux systems.Entities:
Mesh:
Year: 2016 PMID: 27576531 PMCID: PMC5159535 DOI: 10.1093/nar/gkw757
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971