| Literature DB >> 35797364 |
Yu Han1, Donna M Rizzo2, John P Hanley3, Emily L Coderre1, Patricia A Prelock1.
Abstract
Autism spectrum disorder (ASD) is a neurodevelopmental disorder that can cause significant social, communication, and behavioral challenges. Diagnosis of ASD is complicated and there is an urgent need to identify ASD-associated biomarkers and features to help automate diagnostics and develop predictive ASD models. The present study adopts a novel evolutionary algorithm, the conjunctive clause evolutionary algorithm (CCEA), to select features most significant for distinguishing individuals with and without ASD, and is able to accommodate datasets having a small number of samples with a large number of feature measurements. The dataset is unique and comprises both behavioral and neuroimaging measurements from a total of 28 children from 7 to 14 years old. Potential biomarker candidates identified include brain volume, area, cortical thickness, and mean curvature in specific regions around the cingulate cortex, frontal cortex, and temporal-parietal junction, as well as behavioral features associated with theory of mind. A separate machine learning classifier (i.e., k-nearest neighbors algorithm) was used to validate the CCEA feature selection and for ASD prediction. Study findings demonstrate how machine learning tools might help move the needle on improving diagnostic and predictive models of ASD.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35797364 PMCID: PMC9262216 DOI: 10.1371/journal.pone.0269773
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Participant behavioral assessments scores: NT vs. ASD.
| NT Group (n = 19) | ASD Group (n = 9) | Group Difference | ||
|---|---|---|---|---|
|
| 10.2 (7–14) | 11 (8–13) | ||
|
| 113 (89–132) | 78.8 (40–121) | ||
|
| 111.9 (100–135) | 96.2 (56–127) | ||
|
| 110.9 (91–124) | 102 (67–121) | ||
|
| 13.3 (11–15) | 11.1 (5–14) | ||
|
|
| 17.2 (11.5–19.3) | 12.4 (9.1–17.6) | |
|
| 17.7 (13.8–20) | 14.6 (11.6–17.9) | ||
|
| 18 (13.2–19.7) | 14 (8.5–19.2) | ||
|
| 16 (8.3–18.9) | 9.4 (6–16.4) | ||
|
| 18.5 (14.1–20) | 17.3 (12.5–20) | ||
|
| 17.5 (7.1–20) | 16.9 (13.1–20) | ||
|
| 17.8 (12.2–20) | 15.3 (10.3–20) | ||
|
| 15.3 (5.8–20) | 10.3 (4.8–20) | ||
|
| 18.9 (15–20) | 16.7 (14.6–20) | ||
Subject inclusion and distribution.
| Models | Subjects | Include Subjects Used In CCEA? N = 21 (14 NT & 7 ASD) | Include Subjects from the Later Cohort That Was Not Used In CCEA? N = 7 (5 NT & 2 ASD) | |
|---|---|---|---|---|
| CCEA | NT = 14 ASD = 7 | All | None | |
| KNN Validation Model | NT = 19 ASD = 9 | All | All | |
| KNN Prediction Model | Training | NT = 6 | NT = 6 | NT = None |
| ASD = 6 | ASD = 4 | ASD = 2 | ||
| Testing | NT = 13 ASD = 3 | NT = 8 | NT = 5 | |
| ASD = 3 | ASD = None | |||
Second-order CC model features and range of values.
| CC # | Feature One | Value | Feature Two | Value |
|---|---|---|---|---|
| 113 | lh posteriorcingulate volume | [3500,4600] | lh rostralmiddlefrontal volume | [20000,25000] |
| 679 | lh posteriorcingulate volume | [3500,4600] | rh rostralmiddlefrontal volume | [21000,26000] |
| 1449 | lh posteriorcingulate volume | [3500,4600] | lh medialorbitofrontal thickness | [2.6,2.8] |
| 2199 | lh posteriorcingulate volume | [3500,4600] | rh rostralmiddlefrontal area | [6400,8500] |
| 887 | rh isthmuscingulate volume | [3300,4100] | rh posteriorcingulate volume | [4100,6200] |
| 1488 | rh isthmuscingulate volume | [3300,4100] | lh medialorbitofrontal thickness | [2.6,2.8] |
| 2200 | rh isthmuscingulate volume | [3300,4100] | rh rostralmiddlefrontal area | [6400,8500] |
| 2269 | rh isthmuscingulate volume | [3300,4100] | rh posteriorcingulate area | [1400,2900] |
Fig 12D visualization of second-order CC models.
Green dots represent ASD subjects and group together within the rectangle defining the range of values in Table 3.
Third-order CC model features and range of values.
| CC # | Feature One | Value | Feature Two | Value | Feature Three | Value |
|---|---|---|---|---|---|---|
| 46 | ToMTB total | [5,13] | ToMI early subscale mean | [12,18] | lh parsorbitalis meancurv | [0.17,0.2] |
| 191 | ToMTB total | [5,13] | ToMI early subscale mean | [12,18] | rh superiorparietal thickness | [2.4,2.7] |
| 264 | ToMTB total | [5,13] | ToMI early subscale mean | [12,18] | rh parsorbitalis meancurv | [0.18,0.2] |
| 317 | ToMTB total | [5,13] | ToMI early subscale mean | [12,18] | rh inferiortemporal meancurv | [0.15,0.18] |
| 1163 | ToMTB total | [5,13] | ToMI total composite mean | [9,18] | lh postcentral thickness | [2.2,2.5] |
| 1446 | ToMTB total | [5,13] | ToMI early subscale mean | [12,18] | lh medialorbitofrontal thickness | [2.6,2.8] |
Fig 23D visualization of third-order CC models.
Green dots represent ASD subjects and group together within the pink cube defining the range of values in Table 4.
Cross validation confusion matrices.
| N = 28 (9 ASD & 19 NT) | Second-order Model Features | Third-order Model Features | Second-order Model Features Plus Three Behavioral Features | |||
|---|---|---|---|---|---|---|
| Predicted ASD | Predicted NT | Predicted ASD | Predicted NT | Predicted ASD | Predicted NT | |
|
| 8 | 1 | 7 | 2 | 8 | 1 |
|
| 2 | 17 | 4 | 15 | 3 | 16 |
Classification confusion matrices.
| N = 16 (3 ASD & 13 NT) | Second-order Model Features | Third-order Model Features | Second-order Model Features Plus Three Behavioral Features | |||
|---|---|---|---|---|---|---|
| Predicted ASD | Predicted NT | Predicted ASD | Predicted NT | Predicted ASD | Predicted NT | |
|
| 3 | 0 | 3 | 0 | 3 | 0 |
|
| 2 | 11 | 3 | 10 | 1 | 12 |