| Literature DB >> 18513389 |
Silvana Penco1, Massimo Buscema, Maria Cristina Patrosso, Alessandro Marocchi, Enzo Grossi.
Abstract
BACKGROUND: Few genetic factors predisposing to the sporadic form of amyotrophic lateral sclerosis (ALS) have been identified, but the pathology itself seems to be a true multifactorial disease in which complex interactions between environmental and genetic susceptibility factors take place. The purpose of this study was to approach genetic data with an innovative statistical method such as artificial neural networks to identify a possible genetic background predisposing to the disease. A DNA multiarray panel was applied to genotype more than 60 polymorphisms within 35 genes selected from pathways of lipid and homocysteine metabolism, regulation of blood pressure, coagulation, inflammation, cellular adhesion and matrix integrity, in 54 sporadic ALS patients and 208 controls. Advanced intelligent systems based on novel coupling of artificial neural networks and evolutionary algorithms have been applied. The results obtained have been compared with those derived from the use of standard neural networks and classical statistical analysisEntities:
Mesh:
Year: 2008 PMID: 18513389 PMCID: PMC2443147 DOI: 10.1186/1471-2105-9-254
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Genotype distribution at each marker locus: wild type (WT); heterozygous type (Hetero); homozygous type (Homo).
| LPA 93 C/T | rs1853021 | 45 | 0 | 9 | 161 | 6 | 41 | 0,872596 | 0,127404 |
| LPA 121 G/A | rs1800769 | 33 | 1 | 20 | 166 | 3 | 39 | 0,891827 | 0,108173 |
| APOA4 thr347ser | rs675 | 32 | 1 | 21 | 135 | 8 | 65 | 0,805288 | 0,194712 |
| APOA4 gln360his | rs5110 | 44 | 0 | 10 | 183 | 1 | 24 | 0,9375 | 0,0625 |
| APOB thr71ile | rs1367117 | 35 | 5 | 14 | 100 | 20 | 88 | 0,692308 | 0,307692 |
| APOC3 -641C/A | rs2542052 | 17 | 6 | 31 | 74 | 21 | 113 | 0,627404 | 0,372596 |
| APOC3 -482 C/T | rs2854117 | 23 | 2 | 29 | 104 | 13 | 91 | 0,71875 | 0,28125 |
| APOC3 -455 T/C | rs2854116 | 16 | 7 | 31 | 74 | 24 | 110 | 0,620192 | 0,379808 |
| APOC3 1100 C/T | rs4520 | 30 | 3 | 21 | 104 | 24 | 80 | 0,692308 | 0,307692 |
| APOC3 3175 C/G | rs5128 | 47 | 1 | 6 | 170 | 2 | 36 | 0,903846 | 0,096154 |
| APOC3 3206 T/G | rs4225 | 21 | 13 | 20 | 72 | 37 | 99 | 0,584135 | 0,415865 |
| APOE cys112arg | rs429358 | 49 | 0 | 5 | 175 | 1 | 32 | 0,918269 | 0,081731 |
| APOE arg158cys | rs7412 | 45 | 1 | 8 | 186 | 1 | 21 | 0,944712 | 0,055288 |
| ADRB3 trp64arg | rs4994 | 45 | 0 | 9 | 185 | 0 | 23 | 0,944712 | 0,055288 |
| PPAR pro12ala | rs1801282 | 48 | 0 | 6 | 170 | 1 | 37 | 0,90625 | 0,09375 |
| LIPC -480 C/T | rs1800588 | 29 | 9 | 16 | 146 | 11 | 51 | 0,824519 | 0,175481 |
| LPL -93 T/G | rs1800590 | 54 | 0 | 0 | 201 | 0 | 7 | 0,983173 | 0,016827 |
| LPL asp9asn | rs1801177 | 54 | 0 | 0 | 201 | 0 | 7 | 0,983173 | 0,016827 |
| LPL asn291ser | rs268 | 53 | 0 | 1 | 202 | 0 | 6 | 0,985577 | 0,014423 |
| LPL ser447term | rs328 | 47 | 0 | 7 | 164 | 3 | 41 | 0,887019 | 0,112981 |
| PON1 met55leu | rs854560 | 2 | 19 | 33 | 31 | 84 | 93 | 0,372596 | 0,627404 |
| PON1 gln192arg | rs662 | 26 | 10 | 18 | 102 | 26 | 80 | 0,682692 | 0,317308 |
| PON2 ser311cys | rs6954345 | 33 | 2 | 19 | 131 | 6 | 71 | 0,800481 | 0,199519 |
| LDLR NcoI +/- | rs5742911 | 28 | 5 | 21 | 98 | 19 | 91 | 0,689904 | 0,310096 |
| CETP -631 C/A | rs1800776 | 45 | 0 | 9 | 168 | 2 | 38 | 0,899038 | 0,100962 |
| CETP -629 C/A | rs1800775 | 6 | 16 | 32 | 49 | 60 | 99 | 0,473558 | 0,526442 |
| CETP ile405val | rs5882 | 20 | 9 | 25 | 86 | 30 | 92 | 0,634615 | 0,365385 |
| TNFbeta thr26asn | rs1041981 | 31 | 7 | 16 | 113 | 12 | 83 | 0,742788 | 0,257212 |
| MTHFR 677 C/T | rs1801133 | 19 | 12 | 23 | 60 | 35 | 113 | 0,560096 | 0,439904 |
| NOS3 -922 A/G | rs1800779 | 9 | 11 | 34 | 57 | 52 | 99 | 0,512019 | 0,487981 |
| NOS3 -690 C/T | rs3918226 | 40 | 2 | 12 | 164 | 3 | 41 | 0,887019 | 0,112981 |
| NOS3 glu298asp | rs1799983 | 18 | 11 | 25 | 75 | 36 | 97 | 0,59375 | 0,40625 |
| DCP1 IVS16 ins/del | rs1799752 | 24 | 8 | 22 | 38 | 70 | 100 | 0,423077 | 0,576923 |
| AGTR1 1166 A/C | rs5186 | 31 | 5 | 18 | 99 | 10 | 99 | 0,713942 | 0,286058 |
| AGT met235thr | rs699 | 14 | 4 | 36 | 67 | 46 | 95 | 0,550481 | 0,449519 |
| NPPA 664 G/A | rs5063 | 52 | 0 | 2 | 194 | 0 | 14 | 0,966346 | 0,033654 |
| NPPA 2238 T/C | rs5065 | 31 | 3 | 20 | 149 | 3 | 56 | 0,850962 | 0,149038 |
| ADD1 gly460trp | rs4961 | 36 | 3 | 15 | 145 | 4 | 59 | 0,838942 | 0,161058 |
| SCNN1A trp493arg | rs5742912 | 54 | 0 | 0 | 186 | 0 | 22 | 0,947115 | 0,052885 |
| SCNN1A ala663thr | rs2228576 | 27 | 3 | 24 | 94 | 18 | 96 | 0,682692 | 0,317308 |
| GNB3 825 C/T | rs5443 | 30 | 5 | 19 | 103 | 19 | 86 | 0,701923 | 0,298077 |
| ADRB2 arg16gly | rs1042713 | 11 | 21 | 22 | 31 | 84 | 93 | 0,372596 | 0,627404 |
| ADRB2 gln27glu | rs1042714 | 24 | 7 | 23 | 89 | 22 | 97 | 0,661058 | 0,338942 |
| MMP3(-1171) 5A/6A | rs3025058 | 9 | 14 | 31 | 34 | 59 | 115 | 0,439904 | 0,560096 |
| FII 20210 G/A | rs1799963 | 53 | 0 | 1 | 200 | 0 | 8 | 0,980769 | 0,019231 |
| FV arg506gln | rs6025 | 53 | 0 | 1 | 204 | 0 | 4 | 0,990385 | 0,009615 |
| FVII del/ins | rs5742910 | 41 | 1 | 12 | 144 | 4 | 60 | 0,836538 | 0,163462 |
| FVII arg353gln | rs6046 | 40 | 1 | 13 | 152 | 3 | 53 | 0,858173 | 0,141827 |
| PAI (-675) 5G/4G | rs1799768 | 8 | 21 | 25 | 40 | 52 | 116 | 0,471154 | 0,528846 |
| PAI 11053 G/T | rs7242 | 13 | 12 | 29 | 39 | 66 | 103 | 0,435096 | 0,564904 |
| FGB -455 G/A | rs1800790 | 34 | 2 | 18 | 121 | 8 | 79 | 0,771635 | 0,228365 |
| ITGA2 873 G/A | rs1062535 | 22 | 7 | 25 | 72 | 27 | 109 | 0,608173 | 0,391827 |
| ITGB3 leu33pro | rs5918 | 37 | 1 | 16 | 159 | 6 | 43 | 0,867788 | 0,132212 |
| SELE ser128arg | rs5361 | 45 | 0 | 9 | 168 | 2 | 38 | 0,899038 | 0,100962 |
| SELE leu554phe | rs5355 | 47 | 0 | 7 | 180 | 1 | 27 | 0,930288 | 0,069712 |
| ICAM gly214arg | rs1799969 | 54 | 0 | 0 | 171 | 0 | 37 | 0,911058 | 0,088942 |
| TNF alpha-376 G/A | rs1800750 | 51 | 0 | 3 | 204 | 0 | 4 | 0,990385 | 0,009615 |
| TNF alpha-308 G/A | rs1800629 | 44 | 0 | 10 | 158 | 1 | 49 | 0,877404 | 0,122596 |
| TNF alpha-244 G/A | rs673 | 54 | 0 | 0 | 207 | 0 | 1 | 0,997596 | 0,002404 |
| TNF alpha-238 G/A | rs361525 | 52 | 0 | 2 | 192 | 0 | 16 | 0,961538 | 0,038462 |
Deviations of the genotype frequencies from the Hardy-Weinberg equilibrium were tested in the control group with chi-squared statistics (p values ranging from 0.2 to 1.0). Allele frequencies at each marker locus were calculated from the genotype frequencies of the control group under the null hypothesis of Hardy-Weinberg equilibrium. Allele frequencies at each marker locus are reported.
Results obtained applying the random criterion and classifying with the linear discriminant analysis.
| LDA(1ab) | 78% | 78% | 78% | 78% | 29 |
| LDA(1ba) | 93% | 70% | 81% | 74% | 33 |
| LDA(2ab) | 82% | 78% | 80% | 79% | 28 |
| LDA(2ba) | 82% | 79% | 80% | 79% | 27 |
| LDA(3ab) | 63% | 81% | 72% | 77% | 30 |
| LDA(3ba) | 74% | 74% | 74% | 74% | 34 |
| LDA(4ab) | 78% | 71% | 75% | 73% | 36 |
| LDA(4ba) | 67% | 81% | 74% | 78% | 29 |
| LDA(5ab) | 82% | 67% | 75% | 70% | 39 |
| LDA(5ba) | 59% | 73% | 66% | 70% | 39 |
The second and third columns report the percentage of patients correctly classified as belonging to cases or controls. The fourth and fifth columns report the accuracy obtained by the model as arithmetic mean and weighted mean. The number of errors is reported in the last column. The last row reports the mean values of all the columns.
Results obtained applying the random criterion and classifying with a simple back propagation.
| FF_Bp(1ab) | 74% | 85% | 79% | 82% | 23 |
| FF_Bp(1ba) | 89% | 84% | 86% | 85% | 20 |
| FF_Bp(2ab) | 82% | 83% | 82% | 82% | 23 |
| FF_Bp(2ba) | 78% | 83% | 80% | 82% | 24 |
| FF_Bp(3ab) | 56% | 94% | 75% | 86% | 18 |
| FF_Bp(3ba) | 63% | 88% | 76% | 83% | 22 |
| FF_Bp(4ab) | 67% | 84% | 75% | 80% | 26 |
| FF_Bp(4ba) | 59% | 85% | 72% | 80% | 26 |
| FF_Bp(5ab) | 63% | 80% | 71% | 76% | 31 |
| FF_Bp(5ba) | 52% | 86% | 69% | 79% | 27 |
The second and third columns report the percentage of patients correctly classified as belonging to cases or controls. The fourth and fifth columns report the accuracy obtained by the model as arithmetic mean and weighted mean. The number of errors is reported in the last column. The last row reports the mean values of all the columns.
Results of ten experiment obtained applying TWIST procedure in an independent manner to the whole dataset.
| TWIST 1ab | 96% | 97% | 96.5% | 96% | 6 |
| TWIST 1ba | 98% | 95% | 97.5% | 96% | 4 |
| TWIST 2 ab | 100% | 94% | 97% | 96% | 5 |
| TWIST 2 ba | 100% | 96% | 98% | 96% | 5 |
| TWIST 3 ab | 97% | 99% | 98% | 97% | 4 |
| TWIST 3 ba | 95% | 99% | 97% | 96% | 3 |
| TWIST 4 ab | 98% | 90% | 95% | 93% | 9 |
| TWIST 4 ba | 100% | 94% | 97% | 94% | 7 |
| TWIST 5 ab | 92% | 96% | 94% | 96% | 5 |
| TWIST 5 ab | 92% | 98% | 96% | 96% | 5 |
In the first column ab means training on subset a and testing on subset b; ba means the opposite. The second and third columns report the percentage of patients correctly classified as belonging to cases or controls. The fourth and fifth columns report the accuracy obtained by the model as arithmetic mean and weighted mean. The number of errors is reported in the last column. The last row reports the mean values of all the columns.
Results obtained with ANNs using only the seven genetic variants selected by TWIST procedure.
| FF_Bp(ab) | 93% | 87% | 90% | 88% | 16 |
| FF_Bp(ba) | 89% | 74% | 82% | 78% | 28 |
| 91% | 81% | 86% | 83% | 22 | |
The second and third columns report the percentage of patients correctly classified as belonging to cases or controls. The fourth and fifth columns report the accuracy obtained by the model as arithmetic mean and weighted mean. The number of errors is reported in the last column. The last row reports the mean values of all the columns.