Literature DB >> 25964664

Shape analysis of high-throughput transcriptomics experiment data.

Kwame Okrah1, Héctor Corrada Bravo2.   

Abstract

The recent growth of high-throughput transcriptome technology has been paralleled by the development of statistical methodologies to analyze the data they produce. Some of these newly developed methods are based on the assumption that the data observed or a transformation of the data are relatively symmetric with light tails, usually summarized by assuming a Gaussian random component. It is indeed very difficult to assess this assumption for small sample sizes. In this article, we utilize L-moments statistics as the basis of exploratory data analysis, the assessment of distributional assumptions, and the hypothesis testing of high-throughput transcriptomic data. In particular, we use L-moments ratios for assessing the shape (skewness and kurtosis) of high-throughput transcriptome data. Based on these statistics, we propose an algorithm for identifying genes with distributions that are markedly different from the majority in the data. In addition, we also illustrate the utility of this framework to characterize the robustness of distributional assumptions. We apply it to RNA-seq data and find that methods based on the simple [Formula: see text]-test for differential expression analysis using L-moments as weights are robust.
© The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Kurtosis; L-moments ratio diagram; Power; Skewness

Mesh:

Year:  2015        PMID: 25964664      PMCID: PMC4570582          DOI: 10.1093/biostatistics/kxv018

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  19 in total

1.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

2.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

3.  Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays.

Authors:  Martin J Aryee; Andrew E Jaffe; Hector Corrada-Bravo; Christine Ladd-Acosta; Andrew P Feinberg; Kasper D Hansen; Rafael A Irizarry
Journal:  Bioinformatics       Date:  2014-01-28       Impact factor: 6.937

Review 4.  Tackling the widespread and critical impact of batch effects in high-throughput data.

Authors:  Jeffrey T Leek; Robert B Scharpf; Héctor Corrada Bravo; David Simcha; Benjamin Langmead; W Evan Johnson; Donald Geman; Keith Baggerly; Rafael A Irizarry
Journal:  Nat Rev Genet       Date:  2010-09-14       Impact factor: 53.242

5.  Chromatin and transcription transitions of mammalian adult germline stem cells and spermatogenesis.

Authors:  Saher Sue Hammoud; Diana H P Low; Chongil Yi; Douglas T Carrell; Ernesto Guccione; Bradley R Cairns
Journal:  Cell Stem Cell       Date:  2014-05-15       Impact factor: 24.633

6.  Evaluating gene expression in C57BL/6J and DBA/2J mouse striatum using RNA-Seq and microarrays.

Authors:  Daniel Bottomly; Nicole A R Walter; Jessica Ezzell Hunter; Priscila Darakjian; Sunita Kawane; Kari J Buck; Robert P Searles; Michael Mooney; Shannon K McWeeney; Robert Hitzemann
Journal:  PLoS One       Date:  2011-03-24       Impact factor: 3.240

7.  ReCount: a multi-experiment resource of analysis-ready RNA-seq gene count datasets.

Authors:  Alyssa C Frazee; Ben Langmead; Jeffrey T Leek
Journal:  BMC Bioinformatics       Date:  2011-11-16       Impact factor: 3.169

8.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

9.  A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium.

Authors: 
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

10.  Gene expression anti-profiles as a basis for accurate universal cancer signatures.

Authors:  Héctor Corrada Bravo; Vasyl Pihur; Matthew McCall; Rafael A Irizarry; Jeffrey T Leek
Journal:  BMC Bioinformatics       Date:  2012-10-22       Impact factor: 3.169

View more
  2 in total

1.  BatchQC: interactive software for evaluating sample and batch effects in genomic data.

Authors:  Solaiappan Manimaran; Heather Marie Selby; Kwame Okrah; Claire Ruberman; Jeffrey T Leek; John Quackenbush; Benjamin Haibe-Kains; Hector Corrada Bravo; W Evan Johnson
Journal:  Bioinformatics       Date:  2016-08-18       Impact factor: 6.937

2.  Per-sample standardization and asymmetric winsorization lead to accurate clustering of RNA-seq expression profiles.

Authors:  Davide Risso; Stefano Maria Pagnotta
Journal:  Bioinformatics       Date:  2021-02-09       Impact factor: 6.937

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.