Literature DB >> 34015820

Benchmarking association analyses of continuous exposures with RNA-seq in observational studies.

Tamar Sofer1, Nuzulul Kurniansyah1, François Aguet2, Kristin Ardlie2, Peter Durda3, Deborah A Nickerson4, Joshua D Smith5, Yongmei Liu6, Sina A Gharib7, Susan Redline8, Stephen S Rich9, Jerome I Rotter10, Kent D Taylor11.   

Abstract

Large datasets of hundreds to thousands of individuals measuring RNA-seq in observational studies are becoming available. Many popular software packages for analysis of RNA-seq data were constructed to study differences in expression signatures in an experimental design with well-defined conditions (exposures). In contrast, observational studies may have varying levels of confounding transcript-exposure associations; further, exposure measures may vary from discrete (exposed, yes/no) to continuous (levels of exposure), with non-normal distributions of exposure. We compare popular software for gene expression-DESeq2, edgeR and limma-as well as linear regression-based analyses for studying the association of continuous exposures with RNA-seq. We developed a computation pipeline that includes transformation, filtering and generation of empirical null distribution of association P-values, and we apply the pipeline to compute empirical P-values with multiple testing correction. We employ a resampling approach that allows for assessment of false positive detection across methods, power comparison and the computation of quantile empirical P-values. The results suggest that linear regression methods are substantially faster with better control of false detections than other methods, even with the resampling method to compute empirical P-values. We provide the proposed pipeline with fast algorithms in an R package Olivia, and implemented it to study the associations of measures of sleep disordered breathing with RNA-seq in peripheral blood mononuclear cells in participants from the Multi-Ethnic Study of Atherosclerosis.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  RNA-seq; continuous exposure; empirical P-values; non-normality; observational studies

Mesh:

Year:  2021        PMID: 34015820      PMCID: PMC8574950          DOI: 10.1093/bib/bbab194

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  35 in total

Review 1.  Statistical design and the analysis of gene expression microarray data.

Authors:  M K Kerr; G A Churchill
Journal:  Genet Res       Date:  2001-04       Impact factor: 1.588

2.  Estimating p-values in small microarray experiments.

Authors:  Hyuna Yang; Gary Churchill
Journal:  Bioinformatics       Date:  2006-10-30       Impact factor: 6.937

3.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

4.  Racial/Ethnic Differences in Sleep Disturbances: The Multi-Ethnic Study of Atherosclerosis (MESA).

Authors:  Xiaoli Chen; Rui Wang; Phyllis Zee; Pamela L Lutsey; Sogol Javaheri; Carmela Alcántara; Chandra L Jackson; Michelle A Williams; Susan Redline
Journal:  Sleep       Date:  2015-06-01       Impact factor: 5.849

5.  Why weight? Modelling sample and observational level variability improves power in RNA-seq analyses.

Authors:  Ruijie Liu; Aliaksei Z Holik; Shian Su; Natasha Jansz; Kelan Chen; Huei San Leong; Marnie E Blewitt; Marie-Liesse Asselin-Labat; Gordon K Smyth; Matthew E Ritchie
Journal:  Nucleic Acids Res       Date:  2015-04-29       Impact factor: 16.971

6.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Authors:  Michael I Love; Wolfgang Huber; Simon Anders
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

Review 7.  RNA-Seq differential expression analysis: An extended review and a software tool.

Authors:  Juliana Costa-Silva; Douglas Domingues; Fabricio Martins Lopes
Journal:  PLoS One       Date:  2017-12-21       Impact factor: 3.240

8.  Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution.

Authors:  Maarten van Iterson; Erik W van Zwet; Bastiaan T Heijmans
Journal:  Genome Biol       Date:  2017-01-27       Impact factor: 13.583

9.  Normalizing RNA-sequencing data by modeling hidden covariates with prior knowledge.

Authors:  Sara Mostafavi; Alexis Battle; Xiaowei Zhu; Alexander E Urban; Douglas Levinson; Stephen B Montgomery; Daphne Koller
Journal:  PLoS One       Date:  2013-07-18       Impact factor: 3.240

10.  Low oxygen saturation during sleep reduces CD1D and RAB20 expressions that are reversed by CPAP therapy.

Authors:  Tamar Sofer; Ruitong Li; Roby Joehanes; Honghuang Lin; Adam C Gower; Heming Wang; Nuzulul Kurniansyah; Brian E Cade; Jiwon Lee; Stephanie Williams; Reena Mehra; Sanjay R Patel; Stuart F Quan; Yongmei Liu; Jerome I Rotter; Stephen S Rich; Avrum Spira; Daniel Levy; Sina A Gharib; Susan Redline; Daniel J Gottlieb
Journal:  EBioMedicine       Date:  2020-06-05       Impact factor: 8.143

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.