Literature DB >> 31920211

Robust Score Tests With Missing Data in Genomics Studies.

Kin Yau Wong1, Donglin Zeng2, D Y Lin2.   

Abstract

Analysis of genomic data is often complicated by the presence of missing values, which may arise due to cost or other reasons. The prevailing approach of single imputation is generally invalid if the imputation model is misspecified. In this paper, we propose a robust score statistic based on imputed data for testing the association between a phenotype and a genomic variable with (partially) missing values. We fit a semiparametric regression model for the genomic variable against an arbitrary function of the linear predictor in the phenotype model and impute each missing value by its estimated posterior expectation. We show that the score statistic with such imputed values is asymptotically unbiased under general missing-data mechanisms, even when the imputation model is misspecified. We develop a spline-based method to estimate the semiparametric imputation model and derive the asymptotic distribution of the corresponding score statistic with a consistent variance estimator using sieve approximation theory and empirical process theory. The proposed test is computationally feasible regardless of the number of independent variables in the imputation model. We demonstrate the advantages of the proposed method over existing methods through extensive simulation studies and provide an application to a major cancer genomics study.

Entities:  

Keywords:  Association tests; Imputation; Integrative analysis; Multiple genomics platforms; Semiparametric models; Sieve estimation

Year:  2019        PMID: 31920211      PMCID: PMC6951249          DOI: 10.1080/01621459.2018.1514304

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  18 in total

1.  Missing value estimation for DNA microarray gene expression data: local least squares imputation.

Authors:  Hyunsoo Kim; Gene H Golub; Haesun Park
Journal:  Bioinformatics       Date:  2004-08-27       Impact factor: 6.937

2.  Quantitative trait analysis in sequencing studies under trait-dependent sampling.

Authors:  Dan-Yu Lin; Donglin Zeng; Zheng-Zheng Tang
Journal:  Proc Natl Acad Sci U S A       Date:  2013-07-11       Impact factor: 11.205

Review 3.  The Wnt/β-catenin pathway in ovarian cancer: a review.

Authors:  Rebecca C Arend; Angelina I Londoño-Joshi; J Michael Straughn; Donald J Buchsbaum
Journal:  Gynecol Oncol       Date:  2013-10-11       Impact factor: 5.482

4.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.

Authors:  Yun Li; Cristen J Willer; Jun Ding; Paul Scheet; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2010-12       Impact factor: 2.135

5.  Integrative analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: a non-linear model to predict abundance of undetected proteins.

Authors:  Wandaliz Torres-García; Weiwen Zhang; George C Runger; Roger H Johnson; Deirdre R Meldrum
Journal:  Bioinformatics       Date:  2009-05-15       Impact factor: 6.937

6.  Remodeling of the extracellular matrix through overexpression of collagen VI contributes to cisplatin resistance in ovarian cancer cells.

Authors:  Cheryl A Sherman-Baust; Ashani T Weeraratna; Leticia B A Rangel; Ellen S Pizer; Kathleen R Cho; Donald R Schwartz; Teresa Shock; Patrice J Morin
Journal:  Cancer Cell       Date:  2003-04       Impact factor: 31.743

7.  Gene expression profile of ovarian serous papillary carcinomas: identification of metastasis-associated genes.

Authors:  Eliana Bignotti; Renata A Tassi; Stefano Calza; Antonella Ravaggi; Elisabetta Bandiera; Elisa Rossi; Carla Donzelli; Brunella Pasinetti; Sergio Pecorelli; Alessandro D Santin
Journal:  Am J Obstet Gynecol       Date:  2007-03       Impact factor: 8.661

8.  Integrated genomic analyses of ovarian carcinoma.

Authors: 
Journal:  Nature       Date:  2011-06-29       Impact factor: 49.962

9.  MASS: meta-analysis of score statistics for sequencing studies.

Authors:  Zheng-Zheng Tang; Dan-Yu Lin
Journal:  Bioinformatics       Date:  2013-05-21       Impact factor: 6.937

10.  Candidate genes and pathways downstream of PAX8 involved in ovarian high-grade serous carcinoma.

Authors:  Tiziana de Cristofaro; Tina Di Palma; Amata Amy Soriano; Antonella Monticelli; Ornella Affinito; Sergio Cocozza; Mariastella Zannini
Journal:  Oncotarget       Date:  2016-07-05
View more
  2 in total

1.  pixy: Unbiased estimation of nucleotide diversity and divergence in the presence of missing data.

Authors:  Katharine L Korunes; Kieran Samuk
Journal:  Mol Ecol Resour       Date:  2021-02-05       Impact factor: 7.090

2.  Two-phase sample selection strategies for design and analysis in post-genome-wide association fine-mapping studies.

Authors:  Osvaldo Espin-Garcia; Radu V Craiu; Shelley B Bull
Journal:  Stat Med       Date:  2021-10-01       Impact factor: 2.497

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.