Literature DB >> 25150836

Normalization of RNA-seq data using factor analysis of control genes or samples.

Davide Risso1, John Ngai2, Terence P Speed3, Sandrine Dudoit4.   

Abstract

Normalization of RNA-sequencing (RNA-seq) data has proven essential to ensure accurate inference of expression levels. Here, we show that usual normalization approaches mostly account for sequencing depth and fail to correct for library preparation and other more complex unwanted technical effects. We evaluate the performance of the External RNA Control Consortium (ERCC) spike-in controls and investigate the possibility of using them directly for normalization. We show that the spike-ins are not reliable enough to be used in standard global-scaling or regression-based normalization procedures. We propose a normalization strategy, called remove unwanted variation (RUV), that adjusts for nuisance technical effects by performing factor analysis on suitable sets of control genes (e.g., ERCC spike-ins) or samples (e.g., replicate libraries). Our approach leads to more accurate estimates of expression fold-changes and tests of differential expression compared to state-of-the-art normalization methods. In particular, RUV promises to be valuable for large collaborative projects involving multiple laboratories, technicians, and/or sequencing platforms.

Entities:  

Mesh:

Year:  2014        PMID: 25150836      PMCID: PMC4404308          DOI: 10.1038/nbt.2931

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


  33 in total

1.  Using control genes to correct for unwanted variation in microarray data.

Authors:  Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2011-11-17       Impact factor: 5.899

2.  Evaluation of DNA microarray results with quantitative gene expression platforms.

Authors:  Roger D Canales; Yuling Luo; James C Willey; Bradley Austermiller; Catalin C Barbacioru; Cecilie Boysen; Kathryn Hunkapiller; Roderick V Jensen; Charles R Knight; Kathleen Y Lee; Yunqing Ma; Botoul Maqsodi; Adam Papallo; Elizabeth Herness Peters; Karen Poulter; Patricia L Ruppel; Raymond R Samaha; Leming Shi; Wen Yang; Lu Zhang; Federico M Goodsaid
Journal:  Nat Biotechnol       Date:  2006-09       Impact factor: 54.908

3.  Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories.

Authors:  Peter A C 't Hoen; Marc R Friedländer; Jonas Almlöf; Michael Sammeth; Irina Pulyakhina; Seyed Yahya Anvar; Jeroen F J Laros; Henk P J Buermans; Olof Karlberg; Mathias Brännvall; Johan T den Dunnen; Gert-Jan B van Ommen; Ivo G Gut; Roderic Guigó; Xavier Estivill; Ann-Christine Syvänen; Emmanouil T Dermitzakis; Tuuli Lappalainen
Journal:  Nat Biotechnol       Date:  2013-09-15       Impact factor: 54.908

4.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

5.  Synthetic spike-in standards for RNA-seq experiments.

Authors:  Lichun Jiang; Felix Schlesinger; Carrie A Davis; Yu Zhang; Renhua Li; Marc Salit; Thomas R Gingeras; Brian Oliver
Journal:  Genome Res       Date:  2011-08-04       Impact factor: 9.043

6.  mRNA enrichment protocols determine the quantification characteristics of external RNA spike-in controls in RNA-Seq studies.

Authors:  Tao Qing; Ying Yu; Tingting Du; Leming Shi
Journal:  Sci China Life Sci       Date:  2013-02-08       Impact factor: 6.038

7.  Correcting gene expression data when neither the unwanted variation nor the factor of interest are observed.

Authors:  Laurent Jacob; Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2015-08-17       Impact factor: 5.899

Review 8.  Development and applications of single-cell transcriptome analysis.

Authors:  Fuchou Tang; Kaiqin Lao; M Azim Surani
Journal:  Nat Methods       Date:  2011-04       Impact factor: 28.547

9.  Removing technical variability in RNA-seq data using conditional quantile normalization.

Authors:  Kasper D Hansen; Rafael A Irizarry; Zhijin Wu
Journal:  Biostatistics       Date:  2012-01-27       Impact factor: 5.899

10.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

View more
  632 in total

1.  Genome-Wide Methylation Study Identifies an IL-13-induced Epigenetic Signature in Asthmatic Airways.

Authors:  Jessie Nicodemus-Johnson; Katherine A Naughton; Jyotsna Sudi; Kyle Hogarth; Edward T Naurekas; Dan L Nicolae; Anne I Sperling; Julian Solway; Steven R White; Carole Ober
Journal:  Am J Respir Crit Care Med       Date:  2016-02-15       Impact factor: 21.405

2.  The Overlooked Fact: Fundamental Need for Spike-In Control for Virtually All Genome-Wide Analyses.

Authors:  Kaifu Chen; Zheng Hu; Zheng Xia; Dongyu Zhao; Wei Li; Jessica K Tyler
Journal:  Mol Cell Biol       Date:  2015-12-28       Impact factor: 4.272

3.  OCTAD: an open workspace for virtually screening therapeutics targeting precise cancer patient groups using gene expression features.

Authors:  Billy Zeng; Benjamin S Glicksberg; Patrick Newbury; Evgeny Chekalin; Jing Xing; Ke Liu; Anita Wen; Caven Chow; Bin Chen
Journal:  Nat Protoc       Date:  2020-12-23       Impact factor: 13.491

4.  Cardiomyocyte cell cycling, maturation, and growth by multinucleation in postnatal swine.

Authors:  Nivedhitha Velayutham; Christina M Alfieri; Emma J Agnew; Kyle W Riggs; R Scott Baker; Sithara Raju Ponny; Farhan Zafar; Katherine E Yutzey
Journal:  J Mol Cell Cardiol       Date:  2020-07-22       Impact factor: 5.000

5.  Developmental origins of transgenerational sperm DNA methylation epimutations following ancestral DDT exposure.

Authors:  Millissia Ben Maamar; Eric Nilsson; Ingrid Sadler-Riggleman; Daniel Beck; John R McCarrey; Michael K Skinner
Journal:  Dev Biol       Date:  2018-11-27       Impact factor: 3.582

6.  ANPELA: analysis and performance assessment of the label-free quantification workflow for metaproteomic studies.

Authors:  Jing Tang; Jianbo Fu; Yunxia Wang; Bo Li; Yinghong Li; Qingxia Yang; Xuejiao Cui; Jiajun Hong; Xiaofeng Li; Yuzong Chen; Weiwei Xue; Feng Zhu
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

7.  A scaling-free minimum enclosing ball method to detect differentially expressed genes for RNA-seq data.

Authors:  Yan Zhou; Bin Yang; Junhui Wang; Jiadi Zhu; Guoliang Tian
Journal:  BMC Genomics       Date:  2021-06-26       Impact factor: 3.969

8.  Nonparametric expression analysis using inferential replicate counts.

Authors:  Anqi Zhu; Avi Srivastava; Joseph G Ibrahim; Rob Patro; Michael I Love
Journal:  Nucleic Acids Res       Date:  2019-10-10       Impact factor: 16.971

9.  Discordant roles for FGF ligands in lung branching morphogenesis between human and mouse.

Authors:  Soula Danopoulos; Matthew E Thornton; Brendan H Grubbs; Mark R Frey; David Warburton; Saverio Bellusci; Denise Al Alam
Journal:  J Pathol       Date:  2018-12-13       Impact factor: 7.996

10.  Targeting CDK9 Reactivates Epigenetically Silenced Genes in Cancer.

Authors:  Hanghang Zhang; Somnath Pandey; Meghan Travers; Hongxing Sun; George Morton; Jozef Madzo; Woonbok Chung; Jittasak Khowsathit; Oscar Perez-Leal; Carlos A Barrero; Carmen Merali; Yasuyuki Okamoto; Takahiro Sato; Joshua Pan; Judit Garriga; Natarajan V Bhanu; Johayra Simithy; Bela Patel; Jian Huang; Noël J-M Raynal; Benjamin A Garcia; Marlene A Jacobson; Cigall Kadoch; Salim Merali; Yi Zhang; Wayne Childers; Magid Abou-Gharbia; John Karanicolas; Stephen B Baylin; Cynthia A Zahnow; Jaroslav Jelinek; Xavier Graña; Jean-Pierre J Issa
Journal:  Cell       Date:  2018-10-25       Impact factor: 41.582

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.