Literature DB >> 24855328

Two-Step Hypothesis Testing When the Number of Variables Exceeds the Sample Size.

Yueh-Yun Chi1, Keith E Muller2.   

Abstract

Medical images and genetic assays typically generate data with more variables than subjects. Scientists may use a two-step approach for testing hypotheses about Gaussian mean vectors. In the first step, principal components analysis (PCA) selects a set of sample components fewer in number than the sample size. In the second step, applying classical multivariate analysis of variance (MANOVA) methods to the reduced set of variables provides the desired hypothesis tests. Simulation results presented here indicate that success of the PCA in the first step requires nearly all variation to occur in population components far fewer in number than the number of subjects. In the second step, multivariate tests fail to attain reasonable power except in restrictive, favorable cases. The results encourage using other approaches discussed in the article to provide dependable hypothesis testing with high dimension, low sample size data (HDLSS).

Entities:  

Keywords:  Eigenvalues estimation; HDLSS; MANOVA; Principal component analysis

Year:  2013        PMID: 24855328      PMCID: PMC4028141          DOI: 10.1080/03610918.2012.659819

Source DB:  PubMed          Journal:  Commun Stat Simul Comput        ISSN: 0361-0918            Impact factor:   1.118


  3 in total

1.  Exploratory factor analysis in behavior genetics research: factor recovery with small sample sizes.

Authors:  Kristopher J Preacher; Robert C MacCallum
Journal:  Behav Genet       Date:  2002-03       Impact factor: 2.805

2.  On Consistency and Sparsity for Principal Components Analysis in High Dimensions.

Authors:  Iain M Johnstone; Arthur Yu Lu
Journal:  J Am Stat Assoc       Date:  2009-06-01       Impact factor: 5.033

3.  Comparison of human and automatic segmentations of kidneys from CT images.

Authors:  Manjori Rao; Joshua Stough; Yueh-Yun Chi; Keith Muller; Gregg Tracton; Stephen M Pizer; Edward L Chaney
Journal:  Int J Radiat Oncol Biol Phys       Date:  2005-03-01       Impact factor: 7.038

  3 in total
  1 in total

1.  Power calculation for overall hypothesis testing with high-dimensional commensurate outcomes.

Authors:  Yueh-Yun Chi; Matthew J Gribbin; Jacqueline L Johnson; Keith E Muller
Journal:  Stat Med       Date:  2013-09-30       Impact factor: 2.373

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.