| Literature DB >> 17895998 |
Alexander Statnikov1, Chun Li, Constantin F Aliferis.
Abstract
BACKGROUND: The development of new high-throughput genotyping technologies has allowed fast evaluation of single nucleotide polymorphisms (SNPs) on a genome-wide scale. Several recent genome-wide association studies employing these technologies suggest that panels of SNPs can be a useful tool for predicting cancer susceptibility and discovery of potentially important new disease loci. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2007 PMID: 17895998 PMCID: PMC1978529 DOI: 10.1371/journal.pone.0000958
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1First two principal components extracted from SNPs that were selected by the method GLM1.
The first principal component provides a nearly perfect separation of cases from controls.
Figure 2Distribution of p-values computed by GLM1 and GLM2 SNP selection methods.
The figure is shown in logarithmic scale for convenience. The vertical line is the Bonferroni adjusted α-level (0.05/10,009). While there are SNPs that are significant according to GLM1 method, no SNP is significant by GLM2. The distribution of p-values for GLM2 is uniform, however the distribution for GLM1 is not.
Estimates of classification performance obtained by repeated 10-fold cross-validation procedure.
| Data used for the classifier | Classification performance (AUC) |
| {SNPs} | 0.51 |
| {Alc, Smk, Age, Pck} | 0.60 |
| {Fh} | 0.66 |
| {Fh, Alc, Smk, Age, Pck} | 0.73 |
| {SNPs}+{Alc, Smk, Age, Pck} | 0.62 |
| {SNPs}+{Fh} | 0.64 |
| {SNPs}+{Fh, Alc, Smk, Age, Pck} | 0.73 |
The classification algorithm is Support Vector Machines (SVM). Only SNPs selected by Recursive Feature Elimination (RFE) are used. The following abbreviations are used for variable names: Age (age at interview), Smk (tobacco use), Alc (alcohol consumption), Fh (family history of esophageal cancer), and Pck (consumption of pickled vegetables). The “+” symbol in the Data column denotes that the analysis was performed by ensembling approach.