| Literature DB >> 33805090 |
Katarzyna Zaorska1, Tomasz Szczapa2, Maria Borysewicz-Lewicka3, Michał Nowicki1, Karolina Gerreth3.
Abstract
BACKGROUND: Several genes and single nucleotide polymorphisms (SNPs) have been associated with early childhood caries. However, they are highly age- and population-dependent and the majority of existing caries prediction models are based on environmental and behavioral factors only and are scarce in infants.Entities:
Keywords: artificial neural network; complex trait; early childhood caries; early prediction model; single nucleotide polymorphisms
Year: 2021 PMID: 33805090 PMCID: PMC8064067 DOI: 10.3390/genes12040462
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Twenty-two differentiated SNPs analyzed in the current study.
| SNPs Analyzed in This Study | |||||
|---|---|---|---|---|---|
| SNP ID | Gene | Gene ID 1 | Chromosome: Location | Type of Variant | AMINO Acid Change |
| rs4547741 |
| 4057 | 3:46458968 | intron variant | C>T |
| rs7217186 |
| 246 | 17:4636097 | intron variant | C>T |
| rs10429371 |
| 4325 | 8:88981259 | intron variant | C>T |
| rs7096206 |
| 4153 | 10:52771925 | intron variant | C>G |
| rs1884302 |
| 650 | 20:7125642 | intron variant | C>T |
| rs1711437 |
| 9313 | 11:102594495 | intron variant | C>T |
| rs1784418 |
| 9313 | 11:102613665 | intron variant | C>T |
| rs17878486 |
| 265 | X:11295828 | intron variant | C>T |
| rs34538475 |
| 258 | 4:70605459 | intron variant | G>T |
| rs4694075 |
| 258 | 4:70601197 | intron variant | C>T |
| rs3790506 |
| 7286 | 1:151565890 | intron variant | A>G |
| rs4970957 |
| 7286 | 1:151544912 | intron variant | A>G |
| rs2337360 |
| 7286 | 1:151542127 | intron variant | A>G |
| rs134136 |
| 24144 | 22:26503508 | intron variant | C>T |
| rs5997096 |
| 24144 | 22:26499991 | intron variant | C>T |
| rs2235091 |
| 9622 | 19:50907215 | intron variant | A>G |
| rs198969 |
| 9622 | 19:50910072 | intron variant | C>G |
| rs7671281 |
| 10117 | 4:70643369 | missense variant C>T, 2 NM_031889.3:c.1943T>C, 3 NP_114095.2:p.Ile648Thr | C>T |
| rs3796704 |
| 10117 | 4:70643714 | missense variant G>A, NM_031889.3:c.2288G>A, NP_114095.2:p.Arg763Gln | A>G |
| rs12640848 |
| 10117 | 4:70640695 | intron variant | A>G |
| rs144929717 |
| 10117 | 4:70632586 | intron variant | G>A |
| rs139228330 |
| 10117 | 4:70635957 | intron variant | A>G |
Abbreviations: SNP—single nucleotide polymorphism; ID—identification; A—adenine; C—cytosine; G—guanine; T—thymine; Ile—isoleucine; Thr—threonine; Arg—arginine; Gln—glutamine; 1 Gene ID is an accession number from ncbi database (https://www.ncbi.nlm.nih.gov/, accessed on 17 January 2021); 2 NM_—prefix for nucleotide position in the reference sequence accessions for mRNA, c.—coding; 3 NP_—prefix for amino acid position in the reference sequence accessions for protein, p—protein.
Inclusion and exclusion criteria.
| Inclusion Criteria | Exclusion Criteria |
|---|---|
|
| |
| Age between 20 and 40 months | Age under 20 and over 40 months old |
| Group with the same ethnic, regional, cultural, or demographic origin | Other ethnic, regional, cultural, or demographic origin |
| Parental written and informed consent for dental check-up and oral swab collection | No parental written and informed consent for dental check-up and oral swab collection |
| Child’s cooperativeness | Child’s uncooperativeness |
| Set of properly filled in child’s dental chart and sample for molecular analysis | Lack of set of properly filled in child’s dental chart and sample for molecular analysis |
| Individuals from four nursery schools situated in the city of Poznan (Wielkopolska Province, central-west Poland) that constitute one institution | Individuals from other nursery schools than those selected to the research |
| Children present at nursery school on days of examination | Children absent at nursery school on days of examination |
| From 11 to 20 erupted primary teeth present in the oral cavity | Less than 11 primary teeth present in the oral cavity |
| Caucasian origin | Other than Caucasian origin |
|
| |
| Dental caries present in child’s dentition | Lack of dental caries in child’s dentition |
|
| |
| Lack of dental caries in child’s dentition | Dental caries present in child’s dentition |
Genotype and alleles distribution differences for 6 SNPs genotyped in this study between study subgroups (caries vs. caries-free) and between our cohort and CUE data.
| Gene | SNP ID | Genotypes | Caries | Control | Test/Model 1 | OR [95% CI] | CEU data | ||
|---|---|---|---|---|---|---|---|---|---|
| Alleles | n (Freq) | n (Freq) | OR [95% CI] | ||||||
|
| rs4547741 | CC | 35 (0.73) | 32 (0.68) | CC vs. CT+TT | 1.3 [0.5–3.1] | 0.6059 | 0.5 [0.2–0.9] |
|
| CT | 12 (0.25) | 14 (0.3) | CT vs. CC+TT | 0.8 [0.3–1.9] | 0.6012 | 2 [0.9–3.9] | 0.0605 | ||
| TT | 1 (0.02) | 1 (0.02) | TT vs. CC+CT | 1.0 [0.1–16.1] | 0.9880 | 5.3 [0.3–112.3] | 0.2826 | ||
| C | 82 (0.85) | 78 (0.83) | T vs. C | 0.8 [0.4–1.8] | 0.6452 | 2.1 [1.1–4.1] |
| ||
| T | 14 (0.15) | 16 (0.17) | |||||||
|
| rs7217186 | CC | 13 (0.27) | 13 (0.28) | CC vs. CT+TT | 1.0 [0.4–2.4] | 0.9498 | 2.1 [1–4.3] |
|
| CT | 23 (0.48) | 21 (0.44) | CT vs. CC+TT | 1.1 [0.5–2.6] | 0.7519 | 0.7 [0.4–1.3] | 0.2523 | ||
| TT | 12 (0.25) | 13 (0.28) | TT vs. CC+CT | 0.9 [0.4–2.2] | 0.7686 | 0.8 [0.4–1.5] | 0.5382 | ||
| C | 49 (0.51) | 47 (0.5) | T vs. C | 1.0 [0.5–1.7] | 0.8858 | 0.7 [0.5–1.1] | 0.1101 | ||
| T | 47 (0.49) | 47 (0.5) | |||||||
|
| rs10429371 | CC | 18 (0.37) | 17 (0.36) | CC vs. CT+TT | 1.1 [0.5–2.4] | 0.8931 | 18.7 [5.5–63.4] |
|
| CT | 20 (0.42) | 30 (0.64) | CT vs. CC+TT | 0.4 [0.2–0.9] |
| 2.7 [1.5–4.8] |
| ||
| TT | 10 (0.21) | 0 (0.00) | TT vs. CC+CT | 25.9 [1.5–456.4] |
| 0.1 [0.03–0.1] |
| ||
| C | 56 (0.58) | 64 (0.68) | T vs. C | 1.5 [0.8–2.8] | 0.1645 | 0.1 [0.1–0.2] |
| ||
| T | 40 (0.42) | 30 (0.32) | |||||||
|
| rs7096206 | CC | 31 (0.65) | 23 (0.49) | CC vs. CG+GG | 1.9 [0.8–4.3] | 0.1254 | 0.9 [0.5–1.7] | 0.8095 |
| CG | 16 (0.33) | 16 (0.34) | CG vs. CC+GG | 1.0 [0.4–2.3] | 0.9417 | 0.8 [0.5–1.5] | 0.4959 | ||
| GG | 1 (0.02) | 8 (0.17) | GG vs. CC+CG | 0.1 [0.01–0.9] |
| 3.5 [0.9–12.8] | 0.0768 | ||
| C | 78 (0.81) | 62 (0.66) | G vs. C |
| 1.3 [0.8–2] | 0.3473 | |||
| G | 18 (0.19) | 32 (0.34) | |||||||
|
| rs1884302 | CC | 9 (0.19) | 6 (0.13) | CC vs. CT+TT | 1.6 [0.5–4.8] | 0.4263 | 2.5 [1–6.4] | 0.0616 |
| CT | 21 (0.44) | 21 (0.45) | CT vs. CC+TT | 1.0 [0.4–2.2] | 0.9272 | 0.7 0.4–1.3] | 0.3091 | ||
| TT | 18 (0.37) | 20 (0.42) | TT vs. CC+CT | 0.8 [0.4–1.8] | 0.6154 | 0.9 [0.5–1.7] | 0.8412 | ||
| C | 39 (0.41) | 33 (0.35) | T vs. C | 0.8 [0.4–1.4] | 0.4334 | 0.8 [0.5–1.2] | 0.2969 | ||
| T | 57 (0.59) | 61 (0.65) | |||||||
|
| rs1711437 | CC | 19 (0.4) | 20 (0.43) | CC vs. CT+TT | 0.9 [0.4–2.0] | 0.7686 | 1.3 [0.7–2.3] | 0.4143 |
| CT | 23 (0.48) | 23 (0.49) | CT vs. CC+TT | 1.0 [0.4–2.1] | 0.9208 | 1 [0.6–1.8] | 0.9929 | ||
| TT | 6 (0.12) | 4 (0.08) | TT vs. CC+CT | 1.5 [0.4–5.8] | 0.5287 | 0.6 [0.3–1.4] | 0.2526 | ||
| C | 61 (0.64) | 63 (0.67) | T vs. C | 1.2 [0.6–2.1] | 0.6146 | 0.8 [0.5–1.2] | 0.2497 | ||
| T | 35 (0.36) | 31 (0.33) | |||||||
The total number of individuals with caries n = 48, and controls n = 47. Abbreviations: SNP—single nucleotide polymorphism; ID—identification; OR—odds ratio; 95% CI—the 95% Confidence Interval; CEU—Utah residents with Northern and Western European ancestry, from the International HapMap Project; LTF—Lactotransferrin; ALOX15—Arachidonate 15-Lipoxygenase; MMP16—Matrix Metallopeptidase 16; MBL2—Mannose Binding Lectin 2; SMAD6—SMAD Family Member 6; MMP20—Matrix Metallopeptidase 20; C—cytosine; G—guanine; T—thymine.1 The odds ratio (OR) values with corresponding p-values were assessed by the Fisher’s exact test for the inheritance models: dominant AA vs. AB+BB; over-dominant AB vs. AA+BB; recessive BB vs. AA+AB; allelic A vs. B. p-values: p < 0.05 *, p < 0.01 **, p < 0.001 ***; all significant results are shown in bold.
Cochran–Armitage test for trend analysis for 22 differentiated single nucleotide polymorphisms analyzed in this study.
| SNP ID | Cochran–Armitage Test for Trend | |
|---|---|---|
| Chi2 | ||
| rs4547741 | 0.219 | 0.6400 |
| rs7217186 | 0.019 | 0.8898 |
| rs10429371 | 2.234 | 0.1350 |
|
| 5.063 |
|
| rs1884302 | 0.579 | 0.4469 |
| rs1711437 | 0.272 | 0.6019 |
|
| 35.991 |
|
| rs34538475 | 1.779 | 0.1823 |
| rs4694075 | 2.494 | 0.1143 |
| rs3790506 | 1.510 | 0.2191 |
| rs4970957 | 1.885 | 0.1698 |
|
| 23.599 |
|
| rs134136 | 0.230 | 0.6316 |
| rs5997096 | 0.130 | 0.7182 |
|
| 6.782 |
|
|
| 9.586 |
|
| rs1784418 | 0.584 | 0.4447 |
| rs7671281 | 0.0013 | 0.9712 |
| rs3796704 | 0.0492 | 0.8245 |
|
| 11.343 |
|
| rs144929717 | 0.0492 | 0.8245 |
| rs139228330 | 0.0492 | 0.8245 |
Abbreviations: SNP—single nucleotide polymorphism; ID—identification; significant p-value indicates additive model of inheritance, i.e., the possibility of heterozygote advantage. p-values: p < 0.05 *, p < 0.01 **, p < 0.001 ***; all significant results are shown in bold.
Haplotypes distribution in caries and caries-free individuals in this study.
| SNPs (Gene) | Haplotypes | Frequency | OR [95% CI] | ||
|---|---|---|---|---|---|
| Caries | Controls | ||||
| rs34538475_rs4694075 | GC | 0.503 | 0.108 | 8.4 [3.9–18.0] | |
| ( | GT | 0.236 | 0.200 | 1.2 [0.6–2.5] | 0.5501 |
| TC | 0.038 | 0.317 | 0.1 [0.03–0.3] | ||
| TT | 0.222 | 0.374 | 0.5 [0.3–0.9] |
| |
| rs2337360_rs4970957_rs3790506 | A A A | 0.129 | 0.294 | 0.4 [0.2–0.8] |
|
|
| A A G | 0.412 | 0.265 | 2.0 [1.1–3.6] |
|
| A G G | 0.284 | 0.037 | 10.21 [3.2–32.4] | ||
| G A G | 0.074 | 0.197 | 0.3 [3.2–32.4] |
| |
| G G A | 0.000 | 0.015 | - | 0.2322 | |
| G G G | 0.000 | 0.192 | - | ||
| A G A | 0.039 | 0.000 | - | 0.0547 | |
| G A A | 0.061 | 0.000 | - |
| |
| rs134136_rs5997096 | CC | 0.573 | 0.606 | 0.9 [0.5–1.0] | 0.6392 |
|
| TC | 0.302 | 0.287 | 1.1 [0.6–2.0] | 0.8225 |
| TT | 0.125 | 0.106 | 1.2 [0.5–2.9] | 0.6884 | |
| rs2235091_rs198969 | AC | 0.432 | 0.670 | 0.4 [0.2–0.7] |
|
|
| AG | 0.181 | 0.118 | 1.7 [0.7–3.8] | 0.2191 |
| GC | 0.108 | 0.075 | 1.5 [0.6–4.1] | 0.4273 | |
| GG | 0.288 | 0.138 | 2.5 [1.2–5.8] |
| |
| rs7671281_rs3796704_rs12640848_rs144929717_rs139228330 | CAAGA | 0.500 | 0.309 | 2.2 [1.2–4.1] |
|
|
| CAGGA | 0.437 | 0.649 | 0.4 [0.2–0.8] |
|
| TGAAG | 0.041 | 0.043 | 1.0 [0.2–4.0] | 0.9644 | |
| TGGAG | 0.000 | 0.000 | - | - | |
| CAAAG | 0.011 | 0.000 | - | 0.3167 | |
| TGAGA | 0.000 | 0.000 | - | - | |
| TGGGA | 0.010 | 0.000 | - | 0.3234 | |
| rs1711437_rs1784418 | CC | 0.614 | 0.617 | 1.0 [0.6–1.8] | 0.9614 |
|
| CT | 0.022 | 0.053 | 0.4 [0.1–2.0] | 0.2540 |
| TT | 0.322 | 0.330 | 1.0 [0.5–1.8] | 0.9081 | |
| TC | 0.043 | 0.000 | - |
| |
The total number of individuals with caries n=48, and controls n=47. Abbreviations: SNPs—single nucleotide polymorphisms; OR—odds ratio; 95% CI—the 95% Confidence Interval; AMBN—Ameloblastin; TUFT1—Tuftelin 1; TFIP11—Tuftelin Interacting Protein 11; KLK4—Kallikrein Related Peptidase 4; ENAM—enamelin; MMP20—Matrix Metallopeptidase 20; A—adenine, C—cytosine; G—guanine; T—thymine; p-values: p < 0.05 *, p < 0.01 **, p < 0.001 ***; all significant results are shown in bold.
The performance of univariable logistic regression analysis.
| Predictor | Coefficient of Determination R2 | AUC [95% CI] | Sensitivity/Specificity/Total Number of Correct Calls | Statistical Significant Genotypes Distribution 1 | ||
|---|---|---|---|---|---|---|
| rs4547741 | 0.0039 | 0.8703 | 0.525 [0.420–0.628] | 0.617 | 72.9%/31.9%/52.6% | - |
| rs7217186 | 0.0201 | 0.6964 | 0.524 [0.419–0.627] | 0.891 | 48%/55.3%/51.6% | - |
| rs10429371 | 0.2054 |
| 0.560 [0.544–0.744] | 0.268 | 58.3%/63.8%/61.1% | + |
| rs7096206 | 0.0852 |
| 0.599 [0.494–0.699] | 0.0531 | 64.6%/51.1%/57.9% | + |
| rs1884302 | 0.0098 | 0.7051 | 0.539 [0.434–0.642] | 0.4742 | 62.5%/42.6%/52.6% | - |
| rs1711437 | 0.0059 | 0.8115 | 0.525 [0.420–0.628] | 0.6426 | 12.5%/91.5%/51.6% | - |
| rs17878486 | 0.4695 |
| 0.830 [0.739–0.899] |
| 62.5%/93.6%/77.9% | + |
| rs34538475 | 0.0327 | 0.1298 | 0.571 [0.465–0.672] | 0.1253 | 35.4%/78.7%/56.8% | + |
| rs4694075 | 0.0299 | 0.1423 | 0.582 [0.476–0.682] | 0.1404 | 75%/38.3%/56.8% | - |
| rs3790506 | 0.0212 | 0.2177 | 0.559 [0.453–0.660] | 0.2661 | 58.3%/51.1%/54.7% | - |
| rs4970957 | 0.0264 | 0.1683 | 0.580 [0.475–0.681] | 0.1237 | 62.5%/55.3%/59% | - |
| rs2337360 | 0.3813 | <0.0001 *** | 0.781 [0.685–0.860] |
| 75%/89%/77.9% | + |
| rs134136 | 0.0032 | 0.6314 | 0.527 [0.422–0.630] | 0.6251 | 68.8%/36.2%/52.6% | - |
| rs5997096 | 0.0026 | 0.6669 | 0.519 [0.414–0.622] | 0.6699 | 25%/78.7%/51.6% | - |
| rs2235091 | 0.1317 |
| 0.625 [0.520–0.723] |
| 20.8%/97.9%/59% | + |
| rs198969 | 0.1635 |
| 0.658 [0.554–0.752] |
| 70.8%/51.1%/61.1% | + |
| rs1784418 | 0.0050 | 0.5498 | 0.535 0.430–0.638] | 0.5168 | 41.7%/66%/53.7% | - |
| rs7671281 | 0.0014 | 0.7509 | 0.510 [0.405–0.614] | 0.7533 | 100%/0%/50.5% | - |
| rs3796704 | 0.0014 | 0.7509 | 0.510 [0.405–0.614] | 0.7533 | 100%/0%/50.5% | - |
| rs12640848 | 0.2016 |
| 0.672 [0.568–0.765] |
| 93.8%/38.3%/66.3% | + |
| rs144929717 | 0.0014 | 0.7509 | 0.510 [0.405–0.614] | 0.7533 | 100%/0%/50.5% | - |
| rs139228330 | 0.0014 | 0.7509 | 0.510 [0.405–0.614] | 0.7533 | 100%/0%/50.5% | - |
| Age | 0.0050 | 0.4976 | 0.548 [0.443–0.807] | 0.4195 | 39.6%/74.5%/56.8% | - |
| Number of all teeth erupted | 0.0104 | 0.3882 | 0.563 [0.457–0.664] | 0.2454 | 64.6%/51.1%/57.9% | - |
| Nr of incisors | 0.0192 | 0.2408 | 0.510 [0.406–0.614] | 0.3173 | 2.1%/100%/50.5% | - |
| Nr of canines | 0.0019 | 0.7104 | 0.529 [0.424–0.632] | 0.2803 | 10.4%/95.7%/52.6% | - |
| Nr of molars | 0.0235 | 0.1940 | 0.572 [0.467–0.673] | 0.1702 | 64.6%/51.7%/57.9% | - |
Abbreviations: AUC—area under the receiver operating characteristic; 95% CI—the 95% Confidence Interval; 1 the column depicts variables that were statistically significant (+) or insignificant (−) for the differences in genotype/allele distribution in caries vs. caries-free individuals in the previous studies [14,15] and/or in univariable logistic regression and/or ROC curve analysis in this study. p-values: p < 0.05 *, p < 0.01 **, p < 0.001 ***; all significant results are shown in bold.
The overall performance of multivariable logistic regression analysis.
| Test | |
|---|---|
|
| |
| Null model-2 Log Likelihood | 131.69 |
| Full model-2 Log Likelihood | 42.09 |
| Chi2 | 89.59 |
|
| 15 |
| <0.0001 *** | |
| Cox and Snell R2 | 0.6106 |
| Negelkerke R2 | 0.8141 |
|
| |
| Hosmer and Lemeshow test: | |
| Chi2 | 1.58 |
|
| 7 |
| 0.9793 | |
Abbreviations: df—degrees of freedom; Cox and Snell R2 and Negelkerke R2—pseudo R2s that indicate how well the model explains the data; for both coefficients, the closer the value is to “1”, the better the model. Hosmer and Lemeshow test—explains how well the data fit the model; the closer the value is to “1”, the better the data are fitted. p-values: p < 0.001 ***.
The multivariable logistic regression analysis.
| Predictor | β Coefficient | SEβ | Wald’s Chi2 |
| eβ (Odds Ratio) | [95% CI] | |
|---|---|---|---|---|---|---|---|
| Intercept | 0.021 | 0.205 | 0.011 | 1 | 0.918 | 1.021 | - |
| rs10429371_CC | 1.101 | 1.04 | 1.13 | 2 | 0.2882 | 3.01 | [0.39–22.94] |
| rs10429371_TT | 34.4928 | 13,615.64 | 6.42 × 10−6 | 1 | 0.998 | 9.55 × 1014 | - |
| rs7096206_CG | −0.6751 | 1.06 | 0.41 | 1 | 0.523 | 0.51 | [0.06–4.04] |
| rs7096206_GG | −19.4524 | 8976.37 | 4.69 × 10−6 | 1 | 0.9983 | 3.56 × 10−9 | - |
| rs34538475_GT | 0.1413 | 1.09 | 0.02 | 1 | 0.8969 | 1.15 | [0.14–9.77] |
| rs17878486_CT | 1.6945 | 1.09 | 2.41 | 1 | 0.1207 | 5.44 | [0.64–46.28] |
|
| 5.4355 | 1.63 | 11.15 | 1 |
| 229.41 | [9.45–5576.54] |
|
| 3.1515 | 1.1 | 8.29 | 2 |
| 23.37 | [2.73–199.80] |
| rs2337360_GG | 19.7138 | 42,999.26 | 2.10 × 10−7 | 1 | 0.9996 | 3.64 × 108 | - |
| rs2235091_AG | 1.1351 | 1.01 | 1.26 | 1 | 0.2623 | 3.11 | [0.43–22.64] |
| rs2235091_GG | 1.7072 | 1.66 | 1.06 | 1 | 0.3025 | 5.51 | [0.22–141.52] |
| rs198969_CC | −0.2473 | 1.03 | 0.06 | 2 | 0.811 | 0.78 | [0.10–5.93] |
| rs198969_GG | 1.0457 | 1.72 | 0.37 | 1 | 0.542 | 2.85 | [0.10–82.01] |
| rs12640848_AA | 0.2135 | 1.08 | 0.04 | 2 | 0.8432 | 1.24 | [0.15–10.26] |
|
| −3.7776 | 1.84 | 4.21 | 1 |
| 0.02 | [0.001–0.84] |
| Constatnt | −3.9125 | 1.68 | 5.44 | 1 |
| - | - |
Abbreviations: SNPs—single nucleotide polymorphisms; SE—standard error; df—degrees of freedom; 95% CI—the 95% Confidence Interval; A—adenine, C—cytosine, G—guanine, T—thymine. Intercept represents the regression constant in the model without predictors; p-values: p < 0.05 *, p < 0.01 **, p < 0.001 ***; all significant results are shown in bold.
Figure 1The receiver operator characteristic (ROC) curve analysis and area under the receiver operating characteristic (AUC) value. The solid black line indicates the ROC curve based on 8 variables, i.e., rs10429371, rs7096206, rs12640848, rs17878486, rs12640848, rs2337360, rs2235091 and rs198969, and the true positives (sensitivity) and false negatives (100-specificity) for caries prediction when compared to the actual data. The dashed lines indicate the ROC curves for 95% Confidence Interval.
The overall performance for 6 neural network models for caries prediction.
| Neural Network ID | Train | Test | Validation | Learning Algorithm 1 | Error Rate Function 2 | Activation Function (Hidden Layers) | Activation Function (Output Layer) | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | Error | Accuracy | Error | Accuracy | Error | |||||
| NN1 | 0.970 | 0.007 | 0.984 | 0.004 | 0.736 | 0.084 | BFGS 28 | SOS | Tanh | Tanh |
| NN2 | 0.805 | 0.044 | 0.909 | 0.021 | 0.872 | 0.037 | BFGS 5 | SOS | Exponential | Exponential |
| NN3 | 0.840 | 0.037 | 0.944 | 0.016 | 0.854 | 0.038 | BFGS 4 | SOS | Linear | Logistic |
| NN4 | 0.842 | 0.036 | 0.935 | 0.017 | 0.853 | 0.038 | BFGS 4 | SOS | Tanh | Logistic |
| NN5 | 0.824 | 0.040 | 0.962 | 0.012 | 0.859 | 0.041 | BFGS 7 | SOS | Logistic | Logistic |
| NN6 | 0.867 | 0.031 | 0.912 | 0.022 | 0.862 | 0.038 | BFGS 10 | SOS | Tanh | Exponential |
Abbreviations: ID—identification; NN—neural network; 1 The Broyden–Fletcher–Goldfarb–Shanno (BFGS) algorithm; 2 symbiotic organisms search (SOS) error rate.
Figure 2Prediction sensitivity for 8 SNPs used as predictors in neural network modeling. The higher the error rate value, the more sensitive the model is to the lack of a variable. The arrangement of the predictors in the figure is shown on the basis of the mean error value for the 6 neural network models (from NN1to NN6).