Literature DB >> 29192968

A regression framework for assessing covariate effects on the reproducibility of high-throughput experiments.

Qunhua Li1, Feipeng Zhang1.   

Abstract

The outcome of high-throughput biological experiments is affected by many operational factors in the experimental and data-analytical procedures. Understanding how these factors affect the reproducibility of the outcome is critical for establishing workflows that produce replicable discoveries. In this article, we propose a regression framework, based on a novel cumulative link model, to assess the covariate effects of operational factors on the reproducibility of findings from high-throughput experiments. In contrast to existing graphical approaches, our method allows one to succinctly characterize the simultaneous and independent effects of covariates on reproducibility and to compare reproducibility while controlling for potential confounding variables. We also establish a connection between our model and certain Archimedean copula models. This connection not only offers our regression framework an interpretation in copula models, but also provides guidance on choosing the functional forms of the regression. Furthermore, it also opens a new way to interpret and utilize these copulas in the context of reproducibility. Using simulations, we show that our method produces calibrated type I error and is more powerful in detecting difference in reproducibility than existing measures of agreement. We illustrate the usefulness of our method using a ChIP-seq study and a microarray study.
© 2017, The International Biometric Society.

Entities:  

Keywords:  Copula; Correspondence curve regression; Cumulative link model; Genomics; High-throughput experiment; Reproducibility

Mesh:

Substances:

Year:  2017        PMID: 29192968      PMCID: PMC5975112          DOI: 10.1111/biom.12832

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  7 in total

1.  Multiple-laboratory comparison of microarray platforms.

Authors:  Rafael A Irizarry; Daniel Warren; Forrest Spencer; Irene F Kim; Shyam Biswal; Bryan C Frank; Edward Gabrielson; Joe G N Garcia; Joel Geoghegan; Gregory Germino; Constance Griffin; Sara C Hilmer; Eric Hoffman; Anne E Jedlicka; Ernest Kawasaki; Francisco Martínez-Murillo; Laura Morsberger; Hannah Lee; David Petersen; John Quackenbush; Alan Scott; Michael Wilson; Yanqin Yang; Shui Qing Ye; Wayne Yu
Journal:  Nat Methods       Date:  2005-04-21       Impact factor: 28.547

2.  Screening for partial conjunction hypotheses.

Authors:  Yoav Benjamini; Ruth Heller
Journal:  Biometrics       Date:  2008-02-06       Impact factor: 2.571

3.  A concordance correlation coefficient to evaluate reproducibility.

Authors:  L I Lin
Journal:  Biometrics       Date:  1989-03       Impact factor: 2.571

4.  Rat toxicogenomic study reveals analytical consistency across microarray platforms.

Authors:  Lei Guo; Edward K Lobenhofer; Charles Wang; Richard Shippy; Stephen C Harris; Lu Zhang; Nan Mei; Tao Chen; Damir Herman; Federico M Goodsaid; Patrick Hurban; Kenneth L Phillips; Jun Xu; Xutao Deng; Yongming Andrew Sun; Weida Tong; Yvonne P Dragan; Leming Shi
Journal:  Nat Biotechnol       Date:  2006-09       Impact factor: 54.908

5.  AnyExpress: integrated toolkit for analysis of cross-platform gene expression data using a fast interval matching algorithm.

Authors:  Jihoon Kim; Kiltesh Patel; Hyunchul Jung; Winston P Kuo; Lucila Ohno-Machado
Journal:  BMC Bioinformatics       Date:  2011-03-17       Impact factor: 3.307

6.  A bivariate cumulative probit regression model for ordered categorical data.

Authors:  K Kim
Journal:  Stat Med       Date:  1995-06-30       Impact factor: 2.373

7.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia.

Authors:  Stephen G Landt; Georgi K Marinov; Anshul Kundaje; Pouya Kheradpour; Florencia Pauli; Serafim Batzoglou; Bradley E Bernstein; Peter Bickel; James B Brown; Philip Cayting; Yiwen Chen; Gilberto DeSalvo; Charles Epstein; Katherine I Fisher-Aylor; Ghia Euskirchen; Mark Gerstein; Jason Gertz; Alexander J Hartemink; Michael M Hoffman; Vishwanath R Iyer; Youngsook L Jung; Subhradip Karmakar; Manolis Kellis; Peter V Kharchenko; Qunhua Li; Tao Liu; X Shirley Liu; Lijia Ma; Aleksandar Milosavljevic; Richard M Myers; Peter J Park; Michael J Pazin; Marc D Perry; Debasish Raha; Timothy E Reddy; Joel Rozowsky; Noam Shoresh; Arend Sidow; Matthew Slattery; John A Stamatoyannopoulos; Michael Y Tolstorukov; Kevin P White; Simon Xi; Peggy J Farnham; Jason D Lieb; Barbara J Wold; Michael Snyder
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

  7 in total
  1 in total

1.  Assessing reproducibility of high-throughput experiments in the case of missing data.

Authors:  Roopali Singh; Feipeng Zhang; Qunhua Li
Journal:  Stat Med       Date:  2022-02-17       Impact factor: 2.497

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.