| Literature DB >> 34054599 |
Ping-I Lin1,2, Mohammad Ali Moni1, Susan Shur-Fen Gau3, Valsamma Eapen1,2.
Abstract
Objectives: The identification of subgroups of autism spectrum disorder (ASD) may partially remedy the problems of clinical heterogeneity to facilitate the improvement of clinical management. The current study aims to use machine learning algorithms to analyze microarray data to identify clusters with relatively homogeneous clinical features.Entities:
Keywords: autism spectrum disorder; genomics; language; machine learning; social cognition
Year: 2021 PMID: 34054599 PMCID: PMC8149626 DOI: 10.3389/fpsyt.2021.637022
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 4.157
Figure 1The workflow of the study scheme.
Clinical features of the patients in the current study.
| Age | 9.00 ( | 8.91 ( | |
| ADIR-BV | 17.83 ( | 8.55 ( | |
| ADIR-BN | 8.92 ( | 3.64 ( | |
| SCQ | 22.19 ( | 11.47 ( | |
| VIQ | 82.08 ( | 111.91 ( | |
| PIQ | 90.83 ( | 101.36 ( | |
| SRS | 89.61 ( | 79.55 ( |
ADIR-BV, Autism Diagnostic Interview–Revised, Qualitative Abnormalities in Communication, Total Verbal score. ADIR-BN, Autism Diagnostic Interview–Revised, Qualitative Abnormalities in Communication, Total Non-Verbal score. SCQ, Social Communication Questionnaire score; VIQ, verbal IQ; PIQ, performance IQ; SRS, Social Responsiveness Scale score.
The student's t-test was performed to evaluate whether the the two subgroups classified by the presence of language impairment had different values in each continuous variable.
Figure 2Differentially expressed 54 genes with fold changes and –logarithmic 10 adjusted p-values. The red circle represents logarithmic fold change and the blue color circle represents –logarithmic 10 adjusted p-value for each significant gene.
Figure 3Gene network analysis. The relationship among pathways enriched with candidate genes with expression levels associated with SCQ scores is shown.
Figure 4ASD subgroups identified using RF and PAM clustering algorithms. Dim1 and Dim2 correspond to principal components 1 and 2, respectively. (A,B) The results based on the top 10 principal components (PCs) and the 191 probes, respectively. We used the first two predictors to make the plots to demonstrate how different approaches classified the sample.
Figure 5SVM clustering results based on the top PCs. (A) Shows the color gradient that indicates how confidently a new point would be classified based on its features. PC1 and PC2 represent the first and second principal components, respectively. (B) Shows the color gradient that indicates how confidently a new point would be classified based on its features when predictors were based on all SCQ-associated probes. Probe 1 and probe 2 represent the first and second probes, respectively. The solid symbols indicate the support vectors and the hollow circles indicates other subjects. The circles and triangles represent the first and second subgroups, respectively.
Predicting performance of two machine learning algorithms.
| RF-PAM | 191 probes | 96.90% |
| RF-PAM | 10 PC | 67.70% |
| SVM | 191 probes | 99.90% |
| SVM | 10 PC | 93.30% |
Principal component.