Literature DB >> 30503520

OUTRIDER: A Statistical Method for Detecting Aberrantly Expressed Genes in RNA Sequencing Data.

Felix Brechtmann1, Christian Mertes1, Agnė Matusevičiūtė1, Vicente A Yépez2, Žiga Avsec2, Maximilian Herzog1, Daniel M Bader1, Holger Prokisch3, Julien Gagneur4.   

Abstract

RNA sequencing (RNA-seq) is gaining popularity as a complementary assay to genome sequencing for precisely identifying the molecular causes of rare disorders. A powerful approach is to identify aberrant gene expression levels as potential pathogenic events. However, existing methods for detecting aberrant read counts in RNA-seq data either lack assessments of statistical significance, so that establishing cutoffs is arbitrary, or rely on subjective manual corrections for confounders. Here, we describe OUTRIDER (Outlier in RNA-Seq Finder), an algorithm developed to address these issues. The algorithm uses an autoencoder to model read-count expectations according to the gene covariation resulting from technical, environmental, or common genetic variations. Given these expectations, the RNA-seq read counts are assumed to follow a negative binomial distribution with a gene-specific dispersion. Outliers are then identified as read counts that significantly deviate from this distribution. The model is automatically fitted to achieve the best recall of artificially corrupted data. Precision-recall analyses using simulated outlier read counts demonstrated the importance of controlling for covariation and significance-based thresholds. OUTRIDER is open source and includes functions for filtering out genes not expressed in a dataset, for identifying outlier samples with too many aberrantly expressed genes, and for detecting aberrant gene expression on the basis of false-discovery-rate-adjusted p values. Overall, OUTRIDER provides an end-to-end solution for identifying aberrantly expressed genes and is suitable for use by rare-disease diagnostic platforms.
Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  RNA sequencing; aberrant gene expression; normalization; outlier detection; rare disease

Mesh:

Substances:

Year:  2018        PMID: 30503520      PMCID: PMC6288422          DOI: 10.1016/j.ajhg.2018.10.025

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  29 in total

1.  A Bayesian missing value estimation method for gene expression profile data.

Authors:  Shigeyuki Oba; Masa-aki Sato; Ichiro Takemasa; Morito Monden; Ken-ichi Matsubara; Shin Ishii
Journal:  Bioinformatics       Date:  2003-11-01       Impact factor: 6.937

2.  Robust inference in the negative binomial regression model with an application to falls data.

Authors:  William H Aeberhard; Eva Cantoni; Stephane Heritier
Journal:  Biometrics       Date:  2014-08-25       Impact factor: 2.571

3.  Auto-association by multilayer perceptrons and singular value decomposition.

Authors:  H Bourlard; Y Kamp
Journal:  Biol Cybern       Date:  1988       Impact factor: 2.086

4.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

5.  Whole exome sequencing of suspected mitochondrial patients in clinical practice.

Authors:  Saskia B Wortmann; David A Koolen; Jan A Smeitink; Lambert van den Heuvel; Richard J Rodenburg
Journal:  J Inherit Metab Dis       Date:  2015-03-04       Impact factor: 4.982

6.  Guidelines for investigating causality of sequence variants in human disease.

Authors:  D G MacArthur; T A Manolio; D P Dimmock; H L Rehm; J Shendure; G R Abecasis; D R Adams; R B Altman; S E Antonarakis; E A Ashley; J C Barrett; L G Biesecker; D F Conrad; G M Cooper; N J Cox; M J Daly; M B Gerstein; D B Goldstein; J N Hirschhorn; S M Leal; L A Pennacchio; J A Stamatoyannopoulos; S R Sunyaev; D Valle; B F Voight; W Winckler; C Gunter
Journal:  Nature       Date:  2014-04-24       Impact factor: 49.962

7.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  Transcriptome sequencing of a large human family identifies the impact of rare noncoding variants.

Authors:  Xin Li; Alexis Battle; Konrad J Karczewski; Zach Zappala; David A Knowles; Kevin S Smith; Kim R Kukurba; Eric Wu; Noah Simon; Stephen B Montgomery
Journal:  Am J Hum Genet       Date:  2014-09-04       Impact factor: 11.025

9.  The Ensembl Variant Effect Predictor.

Authors:  William McLaren; Laurent Gil; Sarah E Hunt; Harpreet Singh Riat; Graham R S Ritchie; Anja Thormann; Paul Flicek; Fiona Cunningham
Journal:  Genome Biol       Date:  2016-06-06       Impact factor: 13.583

10.  The UCSC Genome Browser database: 2018 update.

Authors:  Jonathan Casper; Ann S Zweig; Chris Villarreal; Cath Tyner; Matthew L Speir; Kate R Rosenbloom; Brian J Raney; Christopher M Lee; Brian T Lee; Donna Karolchik; Angie S Hinrichs; Maximilian Haeussler; Luvina Guruvadoo; Jairo Navarro Gonzalez; David Gibson; Ian T Fiddes; Christopher Eisenhart; Mark Diekhans; Hiram Clawson; Galt P Barber; Joel Armstrong; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

View more
  30 in total

1.  ORE identifies extreme expression effects enriched for rare variants.

Authors:  F Richter; G E Hoffman; K B Manheimer; N Patel; A J Sharp; D McKean; S U Morton; S DePalma; J Gorham; A Kitaygorodksy; G A Porter; A Giardini; Y Shen; W K Chung; J G Seidman; C E Seidman; E E Schadt; B D Gelb
Journal:  Bioinformatics       Date:  2019-10-15       Impact factor: 6.937

2.  Transcriptome-directed analysis for Mendelian disease diagnosis overcomes limitations of conventional genomic testing.

Authors:  David R Murdock; Hongzheng Dai; Lindsay C Burrage; Jill A Rosenfeld; Shamika Ketkar; Michaela F Müller; Vicente A Yépez; Julien Gagneur; Pengfei Liu; Shan Chen; Mahim Jain; Gladys Zapata; Carlos A Bacino; Hsiao-Tuan Chao; Paolo Moretti; William J Craigen; Neil A Hanchard; Brendan Lee
Journal:  J Clin Invest       Date:  2021-01-04       Impact factor: 14.808

3.  Mendelian Gene Discovery: Fast and Furious with No End in Sight.

Authors:  Michael J Bamshad; Deborah A Nickerson; Jessica X Chong
Journal:  Am J Hum Genet       Date:  2019-09-05       Impact factor: 11.025

4.  A form of muscular dystrophy associated with pathogenic variants in JAG2.

Authors:  Sandra Coppens; Alison M Barnard; Sanna Puusepp; Sander Pajusalu; Katrin Õunap; Dorianmarie Vargas-Franco; Christine C Bruels; Sandra Donkervoort; Lynn Pais; Katherine R Chao; Julia K Goodrich; Eleina M England; Ben Weisburd; Vijay S Ganesh; Sanna Gudmundsson; Anne O'Donnell-Luria; Mait Nigul; Pilvi Ilves; Payam Mohassel; Teepu Siddique; Margherita Milone; Stefan Nicolau; Reza Maroofian; Henry Houlden; Michael G Hanna; Ros Quinlivan; Mehran Beiraghi Toosi; Ehsan Ghayoor Karimiani; Sabine Costagliola; Nicolas Deconinck; Hazim Kadhim; Erica Macke; Brendan C Lanpher; Eric W Klee; Anna Łusakowska; Anna Kostera-Pruszczyk; Andreas Hahn; Bertold Schrank; Ichizo Nishino; Masashi Ogasawara; Rasha El Sherif; Tanya Stojkovic; Isabelle Nelson; Gisèle Bonne; Enzo Cohen; Anne Boland-Augé; Jean-François Deleuze; Yao Meng; Ana Töpf; Catheline Vilain; Christina A Pacak; Marie L Rivera-Zengotita; Carsten G Bönnemann; Volker Straub; Penny A Handford; Isabelle Draper; Glenn A Walter; Peter B Kang
Journal:  Am J Hum Genet       Date:  2021-04-15       Impact factor: 11.025

5.  Detection of aberrant gene expression events in RNA sequencing data.

Authors:  Vicente A Yépez; Christian Mertes; Michaela F Müller; Daniela Klaproth-Andrade; Leonhard Wachutka; Laure Frésard; Mirjana Gusic; Ines F Scheller; Patricia F Goldberg; Holger Prokisch; Julien Gagneur
Journal:  Nat Protoc       Date:  2021-01-18       Impact factor: 13.491

Review 6.  Patient derived stem cells for discovery and validation of novel pathogenic variants in inherited retinal disease.

Authors:  Nathaniel K Mullin; Andrew P Voigt; Jessica A Cooke; Laura R Bohrer; Erin R Burnight; Edwin M Stone; Robert F Mullins; Budd A Tucker
Journal:  Prog Retin Eye Res       Date:  2020-10-29       Impact factor: 21.198

7.  AdaTiSS: A Novel Data-Adaptive Robust Method for Identifying Tissue Specificity Scores.

Authors:  Meng Wang; Lihua Jiang; Michael P Snyder
Journal:  Bioinformatics       Date:  2021-06-19       Impact factor: 6.931

8.  Integrating whole-genome sequencing with multi-omic data reveals the impact of structural variants on gene regulation in the human brain.

Authors:  Ricardo A Vialle; Katia de Paiva Lopes; David A Bennett; John F Crary; Towfique Raj
Journal:  Nat Neurosci       Date:  2022-03-14       Impact factor: 28.771

Review 9.  How Machine Learning and Statistical Models Advance Molecular Diagnostics of Rare Disorders Via Analysis of RNA Sequencing Data.

Authors:  Lea D Schlieben; Holger Prokisch; Vicente A Yépez
Journal:  Front Mol Biosci       Date:  2021-06-01

Review 10.  Strategies to Uplift Novel Mendelian Gene Discovery for Improved Clinical Outcomes.

Authors:  Eleanor G Seaby; Heidi L Rehm; Anne O'Donnell-Luria
Journal:  Front Genet       Date:  2021-06-17       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.