Literature DB >> 28940296

Empirical null estimation using zero-inflated discrete mixture distributions and its application to protein domain data.

Iris Ivy M Gauran1,2, Junyong Park1, Johan Lim3, DoHwan Park1, John Zylstra1, Thomas Peterson4, Maricel Kann4, John L Spouge5.   

Abstract

In recent mutation studies, analyses based on protein domain positions are gaining popularity over gene-centric approaches since the latter have limitations in considering the functional context that the position of the mutation provides. This presents a large-scale simultaneous inference problem, with hundreds of hypothesis tests to consider at the same time. This article aims to select significant mutation counts while controlling a given level of Type I error via False Discovery Rate (FDR) procedures. One main assumption is that the mutation counts follow a zero-inflated model in order to account for the true zeros in the count model and the excess zeros. The class of models considered is the Zero-inflated Generalized Poisson (ZIGP) distribution. Furthermore, we assumed that there exists a cut-off value such that smaller counts than this value are generated from the null distribution. We present several data-dependent methods to determine the cut-off value. We also consider a two-stage procedure based on screening process so that the number of mutations exceeding a certain value should be considered as significant mutations. Simulated and protein domain data sets are used to illustrate this procedure in estimation of the empirical null using a mixture of discrete distributions. Overall, while maintaining control of the FDR, the proposed two-stage testing procedure has superior empirical power.
© 2017 The Authors. Biometrics published by Wiley Periodicals, Inc. on behalf of International Biometric Society.

Entities:  

Keywords:  Local false discovery rate; Protein domain; Zero-in ated generalized poisson

Mesh:

Year:  2017        PMID: 28940296      PMCID: PMC5862774          DOI: 10.1111/biom.12779

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  13 in total

1.  Objective method for estimating asymptotic parameters, with an application to sequence alignment.

Authors:  Sergey Sheetlin; Yonil Park; John L Spouge
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2011-09-13

2.  DMDM: domain mapping of disease mutations.

Authors:  Thomas A Peterson; Asa Adadey; Ivette Santana-Cruz; Yanan Sun; Andrew Winder; Maricel G Kann
Journal:  Bioinformatics       Date:  2010-08-04       Impact factor: 6.937

Review 3.  Signal transduction in cancer.

Authors:  Richard Sever; Joan S Brugge
Journal:  Cold Spring Harb Perspect Med       Date:  2015-04-01       Impact factor: 6.915

4.  Generalized Poisson distribution: the property of mixture of Poisson and comparison with negative binomial distribution.

Authors:  Harry Joe; Rong Zhu
Journal:  Biom J       Date:  2005-04       Impact factor: 2.207

5.  Fitting mixture models to grouped and truncated data via the EM algorithm.

Authors:  G J McLachlan; P N Jones
Journal:  Biometrics       Date:  1988-06       Impact factor: 2.571

Review 6.  The role of oncogenic kinases in human cancer (Review).

Authors:  C Tsatsanis; D A Spandidos
Journal:  Int J Mol Med       Date:  2000-06       Impact factor: 4.101

7.  Overexpression of NOTCH-regulated ankyrin repeat protein is associated with breast cancer cell proliferation.

Authors:  Tatsuhiko Imaoka; Tomomi Okutani; Kazuhiro Daino; Daisuke Iizuka; Mayumi Nishimura; Yoshiya Shimada
Journal:  Anticancer Res       Date:  2014-05       Impact factor: 2.480

8.  Activation of an olfactory receptor inhibits proliferation of prostate cancer cells.

Authors:  Eva M Neuhaus; Weiyi Zhang; Lian Gelis; Ying Deng; Joachim Noldus; Hanns Hatt
Journal:  J Biol Chem       Date:  2009-04-23       Impact factor: 5.157

Review 9.  Cadherins and cancer: how does cadherin dysfunction promote tumor progression?

Authors:  A Jeanes; C J Gottardi; A S Yap
Journal:  Oncogene       Date:  2008-11-24       Impact factor: 9.867

10.  A protein domain-centric approach for the comparative analysis of human and yeast phenotypically relevant mutations.

Authors:  Thomas A Peterson; DoHwan Park; Maricel G Kann
Journal:  BMC Genomics       Date:  2013-05-28       Impact factor: 3.969

View more
  1 in total

1.  Oncodomains: A protein domain-centric framework for analyzing rare variants in tumor samples.

Authors:  Thomas A Peterson; Iris Ivy M Gauran; Junyong Park; DoHwan Park; Maricel G Kann
Journal:  PLoS Comput Biol       Date:  2017-04-20       Impact factor: 4.475

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.