Literature DB >> 24341889

RNA-Seq optimization with eQTL gold standards.

Shannon E Ellis, Simone Gupta, Foram N Ashar, Joel S Bader, Andrew B West, Dan E Arking1.   

Abstract

BACKGROUND: RNA-Sequencing (RNA-Seq) experiments have been optimized for library preparation, mapping, and gene expression estimation. These methods, however, have revealed weaknesses in the next stages of analysis of differential expression, with results sensitive to systematic sample stratification or, in more extreme cases, to outliers. Further, a method to assess normalization and adjustment measures imposed on the data is lacking.
RESULTS: To address these issues, we utilize previously published eQTLs as a novel gold standard at the center of a framework that integrates DNA genotypes and RNA-Seq data to optimize analysis and aid in the understanding of genetic variation and gene expression. After detecting sample contamination and sequencing outliers in RNA-Seq data, a set of previously published brain eQTLs was used to determine if sample outlier removal was appropriate. Improved replication of known eQTLs supported removal of these samples in downstream analyses. eQTL replication was further employed to assess normalization methods, covariate inclusion, and gene annotation. This method was validated in an independent RNA-Seq blood data set from the GTEx project and a tissue-appropriate set of eQTLs. eQTL replication in both data sets highlights the necessity of accounting for unknown covariates in RNA-Seq data analysis.
CONCLUSION: As each RNA-Seq experiment is unique with its own experiment-specific limitations, we offer an easily-implementable method that uses the replication of known eQTLs to guide each step in one's data analysis pipeline. In the two data sets presented herein, we highlight not only the necessity of careful outlier detection but also the need to account for unknown covariates in RNA-Seq experiments.

Entities:  

Mesh:

Year:  2013        PMID: 24341889      PMCID: PMC3890578          DOI: 10.1186/1471-2164-14-892

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  33 in total

1.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

Review 2.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

3.  GenABEL: an R library for genome-wide association analysis.

Authors:  Yurii S Aulchenko; Stephan Ripke; Aaron Isaacs; Cornelia M van Duijn
Journal:  Bioinformatics       Date:  2007-03-23       Impact factor: 6.937

4.  Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories.

Authors:  Peter A C 't Hoen; Marc R Friedländer; Jonas Almlöf; Michael Sammeth; Irina Pulyakhina; Seyed Yahya Anvar; Jeroen F J Laros; Henk P J Buermans; Olof Karlberg; Mathias Brännvall; Johan T den Dunnen; Gert-Jan B van Ommen; Ivo G Gut; Roderic Guigó; Xavier Estivill; Ann-Christine Syvänen; Emmanouil T Dermitzakis; Tuuli Lappalainen
Journal:  Nat Biotechnol       Date:  2013-09-15       Impact factor: 54.908

Review 5.  Computational methods for transcriptome annotation and quantification using RNA-seq.

Authors:  Manuel Garber; Manfred G Grabherr; Mitchell Guttman; Cole Trapnell
Journal:  Nat Methods       Date:  2011-05-27       Impact factor: 28.547

6.  Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies.

Authors:  Andrew E Teschendorff; Joanna Zhuang; Martin Widschwendter
Journal:  Bioinformatics       Date:  2011-04-06       Impact factor: 6.937

Review 7.  Tackling the widespread and critical impact of batch effects in high-throughput data.

Authors:  Jeffrey T Leek; Robert B Scharpf; Héctor Corrada Bravo; David Simcha; Benjamin Langmead; W Evan Johnson; Donald Geman; Keith Baggerly; Rafael A Irizarry
Journal:  Nat Rev Genet       Date:  2010-09-14       Impact factor: 53.242

8.  Genotype imputation with thousands of genomes.

Authors:  Bryan Howie; Jonathan Marchini; Matthew Stephens
Journal:  G3 (Bethesda)       Date:  2011-11-01       Impact factor: 3.154

9.  Brain expression genome-wide association study (eGWAS) identifies human disease-associated variants.

Authors:  Fanggeng Zou; High Seng Chai; Curtis S Younkin; Mariet Allen; Julia Crook; V Shane Pankratz; Minerva M Carrasquillo; Christopher N Rowley; Asha A Nair; Sumit Middha; Sooraj Maharjan; Thuy Nguyen; Li Ma; Kimberly G Malphrus; Ryan Palusak; Sarah Lincoln; Gina Bisceglio; Constantin Georgescu; Naomi Kouri; Christopher P Kolbert; Jin Jen; Jonathan L Haines; Richard Mayeux; Margaret A Pericak-Vance; Lindsay A Farrer; Gerard D Schellenberg; Ronald C Petersen; Neill R Graff-Radford; Dennis W Dickson; Steven G Younkin; Nilüfer Ertekin-Taner
Journal:  PLoS Genet       Date:  2012-06-07       Impact factor: 5.917

10.  Normalizing RNA-sequencing data by modeling hidden covariates with prior knowledge.

Authors:  Sara Mostafavi; Alexis Battle; Xiaowei Zhu; Alexander E Urban; Douglas Levinson; Stephen B Montgomery; Daphne Koller
Journal:  PLoS One       Date:  2013-07-18       Impact factor: 3.240

View more
  12 in total

1.  Comprehensively evaluating cis-regulatory variation in the human prostate transcriptome by using gene-level allele-specific expression.

Authors:  Nicholas B Larson; Shannon McDonnell; Amy J French; Zach Fogarty; John Cheville; Sumit Middha; Shaun Riska; Saurabh Baheti; Asha A Nair; Liang Wang; Daniel J Schaid; Stephen N Thibodeau
Journal:  Am J Hum Genet       Date:  2015-05-14       Impact factor: 11.025

2.  Novel approaches for bioinformatic analysis of salivary RNA sequencing data for development.

Authors:  Karolina Elzbieta Kaczor-Urbanowicz; Yong Kim; Feng Li; Timur Galeev; Rob R Kitchen; Mark Gerstein; Kikuye Koyano; Sung-Hee Jeong; Xiaoyan Wang; David Elashoff; So Young Kang; Su Mi Kim; Kyoung Kim; Sung Kim; David Chia; Xinshu Xiao; Joel Rozowsky; David T W Wong
Journal:  Bioinformatics       Date:  2018-01-01       Impact factor: 6.937

3.  Leveraging genetically simple traits to identify small-effect variants for complex phenotypes.

Authors:  K E Kemper; M D Littlejohn; T Lopdell; B J Hayes; L E Bennett; R P Williams; X Q Xu; P M Visscher; M J Carrick; M E Goddard
Journal:  BMC Genomics       Date:  2016-11-03       Impact factor: 3.969

4.  Dysregulation of Alternative Poly-adenylation as a Potential Player in Autism Spectrum Disorder.

Authors:  Krzysztof J Szkop; Peter I C Cooke; Joanne A Humphries; Viktoria Kalna; David S Moss; Eugene F Schuster; Irene Nobeli
Journal:  Front Mol Neurosci       Date:  2017-09-13       Impact factor: 5.639

5.  Exaggerated CpH methylation in the autism-affected brain.

Authors:  Shannon E Ellis; Simone Gupta; Anna Moes; Andrew B West; Dan E Arking
Journal:  Mol Autism       Date:  2017-02-17       Impact factor: 7.509

Review 6.  Effects of Type 1 Diabetes Risk Alleles on Immune Cell Gene Expression.

Authors:  Ramesh Ram; Grant Morahan
Journal:  Genes (Basel)       Date:  2017-06-21       Impact factor: 4.096

7.  Transcriptome analysis reveals dysregulation of innate immune response genes and neuronal activity-dependent genes in autism.

Authors:  Simone Gupta; Shannon E Ellis; Foram N Ashar; Anna Moes; Joel S Bader; Jianan Zhan; Andrew B West; Dan E Arking
Journal:  Nat Commun       Date:  2014-12-10       Impact factor: 14.919

8.  Sequence-based Association Analysis Reveals an MGST1 eQTL with Pleiotropic Effects on Bovine Milk Composition.

Authors:  Mathew D Littlejohn; Kathryn Tiplady; Tania A Fink; Klaus Lehnert; Thomas Lopdell; Thomas Johnson; Christine Couldrey; Mike Keehan; Richard G Sherlock; Chad Harland; Andrew Scott; Russell G Snell; Stephen R Davis; Richard J Spelman
Journal:  Sci Rep       Date:  2016-05-05       Impact factor: 4.379

9.  Transcriptome analysis of cortical tissue reveals shared sets of downregulated genes in autism and schizophrenia.

Authors:  S E Ellis; R Panitch; A B West; D E Arking
Journal:  Transl Psychiatry       Date:  2016-05-24       Impact factor: 6.222

10.  A Pipeline for High-Throughput Concentration Response Modeling of Gene Expression for Toxicogenomics.

Authors:  John S House; Fabian A Grimm; Dereje D Jima; Yi-Hui Zhou; Ivan Rusyn; Fred A Wright
Journal:  Front Genet       Date:  2017-11-01       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.