Literature DB >> 22285993

Sample size determination for classifiers based on single-nucleotide polymorphisms.

Xinyu Liu1, Yupeng Wang, Romdhane Rekaya, T N Sriram.   

Abstract

Single-nucleotide polymorphisms (SNPs), believed to determine human differences, are widely used to predict risk of diseases. Typically, clinical samples are limited and/or the sampling cost is high. Thus, it is essential to determine an adequate sample size needed to build a classifier based on SNPs. Such a classifier would facilitate correct classifications, while keeping the sample size to a minimum, thereby making the studies cost-effective. For coded SNP data from 2 classes, an optimal classifier and an approximation to its probability of correct classification (PCC) are derived. A linear classifier is constructed and an approximation to its PCC is also derived. These approximations are validated through a variety of Monte Carlo simulations. A sample size determination algorithm based on the criterion, which ensures that the difference between the 2 approximate PCCs is below a threshold, is given and its effectiveness is illustrated via simulations. For the HapMap data on Chinese and Japanese populations, a linear classifier is built using 51 independent SNPs, and the required total sample sizes are determined using our algorithm, as the threshold varies. For example, when the threshold value is 0.05, our algorithm determines a total sample size of 166 (83 for Chinese and 83 for Japanese) that satisfies the criterion.

Entities:  

Mesh:

Year:  2012        PMID: 22285993     DOI: 10.1093/biostatistics/kxr053

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  3 in total

1.  Study design in high-dimensional classification analysis.

Authors:  Brisa N Sánchez; Meihua Wu; Peter X K Song; Wen Wang
Journal:  Biostatistics       Date:  2016-05-05       Impact factor: 5.899

2.  Sample size and statistical power calculation in genetic association studies.

Authors:  Eun Pyo Hong; Ji Wan Park
Journal:  Genomics Inform       Date:  2012-06-30

3.  Determination of sample size for a multi-class classifier based on single-nucleotide polymorphisms: a volume under the surface approach.

Authors:  Xinyu Liu; Yupeng Wang; T N Sriram
Journal:  BMC Bioinformatics       Date:  2014-06-14       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.