| Literature DB >> 23418752 |
Jinseog Kim1, Insuk Sohn, Dae-Soon Son, Dong Hwan Kim, Taejin Ahn, Sin-Ho Jung.
Abstract
BACKGROUND: A popular objective of many high-throughput genome projects is to discover various genomic markers associated with traits and develop statistical models to predict traits of future patients based on marker values.Entities:
Mesh:
Year: 2013 PMID: 23418752 PMCID: PMC3651372 DOI: 10.1186/1471-2105-14-58
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Mean numbers of SNPs and prognostic SNPs included in fitted prediction models, recovery rate and means/standard deviations of the log-rank p-value from test sets for the proposed method and methods assuming recessive, dominant, or multiplicative models for all SNPs
| | | |||||
|---|---|---|---|---|---|---|
| 30% | 0 | Proposed | 6.72 | 5.05 | 0.75 | <0.0001 (<0.0001) |
| | | Recessive | 8.03 | 4.01 | 0.50 | 0.0052 (0.0018) |
| | | Dominant | 6.66 | 3.85 | 0.58 | <0.0001 (<0.0001) |
| | | Multiplicative | 7.72 | 4.95 | 0.64 | 0.0004 (0.0003) |
| | 0.3 | Proposed | 6.51 | 4.83 | 0.74 | 0.0001 (0.0001) |
| | | Recessive | 7.73 | 3.83 | 0.50 | 0.0045 (0.0016) |
| | | Dominant | 6.58 | 3.66 | 0.56 | 0.0011 (0.0007) |
| | | Multiplicative | 7.52 | 4.72 | 0.63 | 0.0006 (0.0004) |
| 15% | 0 | Proposed | 6.65 | 5.18 | 0.78 | <0.0001 (<0.0001) |
| | | Recessive | 8.59 | 4.19 | 0.49 | 0.0028 (0.0011) |
| | | Dominant | 6.69 | 3.88 | 0.58 | <0.0001 (<0.0001) |
| | | Multiplicative | 7.96 | 4.98 | 0.63 | 0.0005 (0.0005) |
| | 0.3 | Proposed | 6.37 | 4.98 | 0.78 | <0.0001 (<0.0001) |
| | | Recessive | 7.88 | 3.94 | 0.50 | 0.0048 (0.0028) |
| | | Dominant | 6.38 | 3.74 | 0.59 | 0.0011 (0.0011) |
| Multiplicative | 7.55 | 4.89 | 0.65 | 0.0001 (<0.0001) |
Mean number of SNPs and prognostic SNPs included in the fitted prediction models, recovery rate and means/standard deviations of the log-rank p-values from the test set for the proposed method at = 0 and censoring = 30%
| 200 | 0.8 | 6.72 | 5.05 | 0.75 | <0.0001 (<0.0001) |
| | 1 | 6.13 | 5.18 | 0.85 | <0.0001 (<0.0001) |
| | 2 | 5.60 | 5.17 | 0.92 | <0.0001 (<0.0001) |
| 300 | 0.8 | 6.18 | 5.53 | 0.89 | <0.0001 (<0.0001) |
| 400 | 0.8 | 5.89 | 5.72 | 0.97 | <0.0001 (<0.0001) |
Figure 1Kaplan-Meier curves for high- and low-risk groups classified by the LOOCV procedure.
List of 24 SNPs selected by the proposed method from the whole data set of 119 samples, their MaxTest p-values, genetic models, the number of times selected by prediction models fitted during the LOOCV procedure
| rs1030254 | 16 | 60696651 | LOC644649, CDH8, LOC729159 | 3 | 0.00009 | 119 |
| rs1030252 | 16 | 60696869 | LOC644649, CDH8, LOC729159 | 2 | 0.00010 | 119 |
| rs10798122 | 1 | 187584699 | PLA2G4A, FAM5C | 1 | 0.00048 | 119 |
| rs10026207 | 4 | 186039201 | HELT, SLC25A4 | 3 | 0.00233 | 119 |
| rs13333329 | 16 | 1695776 | CRAMP1L | 3 | 0.00015 | 117 |
| rs2132183 | 3 | 84966867 | LOC440970,CADM2 | 3 | 0.00149 | 117 |
| rs1950400 | 14 | 27105035 | MIR4307,NOVA1 | 2 | 0.00040 | 115 |
| rs2155777 | 11 | 133290007 | OPCML | 3 | 0.00142 | 113 |
| rs1677914 | 12 | 78274425 | NAV3 | 2 | 0.00283 | 106 |
| rs1476847 | 18 | 9834599 | RAB31 | 1 | 0.00029 | 102 |
| rs7614596 | 3 | 84986027 | LOC440970,CADM2 | 2 | 0.00020 | 100 |
| rs2648117 | 4 | 186787096 | SORBS2 | 3 | 0.00856 | 90 |
| rs1851317 | 15 | 35077786 | GJD2,ACTC1 | 1 | 0.00999 | 88 |
| rs3790217 | 20 | 19441650 | SLC24A3 | 2 | 0.00728 | 85 |
| rs4902990 | 14 | 72618432 | RGS6 | 2 | 0.00004 | 81 |
| rs9482583 | 6 | 125318379 | RNF217 | 3 | 0.00847 | 79 |
| rs3020444 | 14 | 64791013 | ESR2 | 3 | 0.00288 | 77 |
| rs10851869 | 15 | 74331083 | PML | 2 | 0.00036 | 65 |
| rs11986200 | 8 | 22698209 | PEBP4 | 1 | 0.00222 | 63 |
| rs11260756 | 1 | 16759616 | SPATA21 | 1 | 0.00827 | 63 |
| rs4968415 | 17 | 60264240 | MED13,TBC1D3P2 | 1 | 0.00075 | 62 |
| rs12416722 | 11 | 133300460 | OPCML | 1 | 0.00067 | 59 |
| rs626266 | 12 | 72888187 | TRHDE | 2 | 0.00070 | 52 |
| rs16852300 | 2 | 167414424 | SCN7A,XIRP2 | 3 | 0.00513 | 33 |