| Literature DB >> 16451595 |
Abstract
A supervised learning method, support vector machine, was used to analyze the microsatellite marker dataset of the Collaborative Study on the Genetics of Alcoholism Problem 1 for the Genetic Analysis Workshop 14. Twelve binary-valued phenotype variables were chosen for analyses using the markers from all autosomal chromosomes. Using various polynomial kernel functions of the support vector machine and randomly divided genome regions, we were able to observe the association of some marker sets with the chosen phenotypes and thus reduce the size of the dataset. The successful classifications established with the chosen support vector machine kernel function had high levels of correctness for each prediction, e.g., 96% in the fourfold cross-validations. However, owing to the limited sample data, we were not able to test the predictions of the classifiers in the new sample data.Entities:
Mesh:
Year: 2005 PMID: 16451595 PMCID: PMC1866800 DOI: 10.1186/1471-2156-6-S1-S136
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Twelve phenotype variables
| No. | Label name of phenotypic traits | Description |
| 1 | Deceased | Individuals who are deceased |
| 2 | ALDX | A combination of ALDX1 and ALDX2 |
| 3 | Binge | Ever binge drink |
| 4 | Blackouts | Blackouts (3 or more) |
| 5 | Morning | Morning drinking |
| 6 | Craving | Craving |
| 7 | Pers | Persistent desire to stop drinking |
| 8 | Narrow | Narrowing of drinking repertoire |
| 9 | GUATD | Stands for "give up activities to drink" |
| 10 | WDSX | Stands for "withdrawal symptoms (2 or more together)" |
| 11 | Phy | Physical health problems from drinking |
| 12 | Emo | Emotional/psychological problems from drinking |
Note: categorical values are assigned as – 1 for No and 1 for Yes In case of ALDX, -1 for ≤ 3 and 1 otherwise
Twelve genome datasets with phenotype variables
| No. | Label name of phenotypic traits | Proportion of classes | Total records | ||
| No. of "-1" | No. of "-1" | "-1" vs. "1" | |||
| 1 | Deceased | 1202 | 2 | 601.0 | 1204 |
| 2 | ALDX | 273 | 927 | 0.3 | 1200 |
| 3 | Binge | 711 | 312 | 2.3 | 1023 |
| 4 | Blackouts | 624 | 399 | 1.6 | 1023 |
| 5 | Morning | 627 | 395 | 1.6 | 1022 |
| 6 | Craving | 826 | 197 | 4.2 | 1023 |
| 7 | Pers | 495 | 528 | 0.9 | 1023 |
| 8 | Narrow | 766 | 218 | 3.5 | 984 |
| 9 | GUATD | 767 | 256 | 3.0 | 1023 |
| 10 | WDSX | 776 | 235 | 3.3 | 1011 |
| 11 | Phy | 833 | 190 | 4.4 | 1023 |
| 12 | Emo | 746 | 277 | 2.7 | 1023 |
Figure 1Results of microsatellite markers pattern/association with ALDX with maximum (4) repeating positives from SVM analysis using 4 different kernels.