Literature DB >> 31786209

Detecting, Categorizing, and Correcting Coverage Anomalies of RNA-Seq Quantification.

Cong Ma1, Carl Kingsford2.   

Abstract

Because of incomplete reference transcriptomes, incomplete sequencing bias models, or other modeling defects, algorithms to infer isoform expression from RNA sequencing (RNA-seq) sometimes do not accurately model expression. We present a computational method to detect instances where a quantification algorithm could not completely explain the input reads. Our approach identifies regions where the read coverage significantly deviates from expectation. We call these regions "expression anomalies." We further present a method to attribute their cause to either the incompleteness of the reference transcriptome or algorithmic mistakes. We detect anomalies for 30 GEUVADIS and 16 Human Body Map samples. By correcting anomalies when possible, we reduce the number of falsely predicted instances of differential expression. Anomalies that cannot be corrected are suspected to indicate the existence of isoforms unannotated by the reference. We detected 88 common anomalies of this type and find that they tend to have a lower-than-expected coverage toward their 3' ends.
Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  RNA-seq; anomaly detection; expression quantification; unannotated isoform

Mesh:

Substances:

Year:  2019        PMID: 31786209      PMCID: PMC6938679          DOI: 10.1016/j.cels.2019.10.005

Source DB:  PubMed          Journal:  Cell Syst        ISSN: 2405-4712            Impact factor:   10.304


  40 in total

1.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

2.  GRIM-19 in Health and Disease.

Authors:  Valdemar Máximo; Jorge Lima; Paula Soares; André Silva; Inês Bento; Manuel Sobrinho-Simões
Journal:  Adv Anat Pathol       Date:  2008-01       Impact factor: 3.875

3.  RNA-Seq gene expression estimation with read mapping uncertainty.

Authors:  Bo Li; Victor Ruotti; Ron M Stewart; James A Thomson; Colin N Dewey
Journal:  Bioinformatics       Date:  2009-12-18       Impact factor: 6.937

4.  Fast and accurate approximate inference of transcript expression from RNA-seq data.

Authors:  James Hensman; Panagiotis Papastamoulis; Peter Glaus; Antti Honkela; Magnus Rattray
Journal:  Bioinformatics       Date:  2015-08-26       Impact factor: 6.937

5.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

6.  Discovery and functional prioritization of Parkinson's disease candidate genes from large-scale whole exome sequencing.

Authors:  Iris E Jansen; Hui Ye; Sasja Heetveld; Marie C Lechler; Helen Michels; Renée I Seinstra; Steven J Lubbe; Valérie Drouet; Suzanne Lesage; Elisa Majounie; J Raphael Gibbs; Mike A Nalls; Mina Ryten; Juan A Botia; Jana Vandrovcova; Javier Simon-Sanchez; Melissa Castillo-Lizardo; Patrizia Rizzu; Cornelis Blauwendraat; Amit K Chouhan; Yarong Li; Puja Yogi; Najaf Amin; Cornelia M van Duijn; Huw R Morris; Alexis Brice; Andrew B Singleton; Della C David; Ellen A Nollen; Shushant Jain; Joshua M Shulman; Peter Heutink
Journal:  Genome Biol       Date:  2017-01-30       Impact factor: 13.583

Review 7.  RNA-Seq differential expression analysis: An extended review and a software tool.

Authors:  Juliana Costa-Silva; Douglas Domingues; Fabricio Martins Lopes
Journal:  PLoS One       Date:  2017-12-21       Impact factor: 3.240

8.  Gene co-expression analysis for functional classification and gene-disease predictions.

Authors:  Sipko van Dam; Urmo Võsa; Adriaan van der Graaf; Lude Franke; João Pedro de Magalhães
Journal:  Brief Bioinform       Date:  2018-07-20       Impact factor: 11.622

9.  Analysis of alternative cleavage and polyadenylation in mature and differentiating neurons using RNA-seq data.

Authors:  Aysegul Guvenek; Bin Tian
Journal:  Quant Biol       Date:  2018-09-05

10.  The Pfam protein families database in 2019.

Authors:  Sara El-Gebali; Jaina Mistry; Alex Bateman; Sean R Eddy; Aurélien Luciani; Simon C Potter; Matloob Qureshi; Lorna J Richardson; Gustavo A Salazar; Alfredo Smart; Erik L L Sonnhammer; Layla Hirsh; Lisanna Paladin; Damiano Piovesan; Silvio C E Tosatto; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  1 in total

1.  Effects of transcriptional noise on estimates of gene and transcript expression in RNA sequencing experiments.

Authors:  Ales Varabyou; Steven L Salzberg; Mihaela Pertea
Journal:  Genome Res       Date:  2020-12-23       Impact factor: 9.043

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.