| Literature DB >> 33256061 |
Jung Hyuk Lee1, Geon Woo Lee1, Guiyoung Bong2, Hee Jeong Yoo2,3, Hong Kook Kim1.
Abstract
Autism spectrum disorder (ASD) is a developmental disorder with a life-span disability. While diagnostic instruments have been developed and qualified based on the accuracy of the discrimination of children with ASD from typical development (TD) children, the stability of such procedures can be disrupted by limitations pertaining to time expenses and the subjectivity of clinicians. Consequently, automated diagnostic methods have been developed for acquiring objective measures of autism, and in various fields of research, vocal characteristics have not only been reported as distinctive characteristics by clinicians, but have also shown promising performance in several studies utilizing deep learning models based on the automated discrimination of children with ASD from children with TD. However, difficulties still exist in terms of the characteristics of the data, the complexity of the analysis, and the lack of arranged data caused by the low accessibility for diagnosis and the need to secure anonymity. In order to address these issues, we introduce a pre-trained feature extraction auto-encoder model and a joint optimization scheme, which can achieve robustness for widely distributed and unrefined data using a deep-learning-based method for the detection of autism that utilizes various models. By adopting this auto-encoder-based feature extraction and joint optimization in the extended version of the Geneva minimalistic acoustic parameter set (eGeMAPS) speech feature data set, we acquire improved performance in the detection of ASD in infants compared to the raw data set.Entities:
Keywords: acoustic feature extraction; autism spectrum disorder; auto-encoder; bidirectional long short-term memory (BLSTM); joint optimization
Year: 2020 PMID: 33256061 PMCID: PMC7731374 DOI: 10.3390/s20236762
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Distribution of age and gender (male/female).
| Ages (Month) | No. of Subjects Diagnosed as ASD | No. of Subjects Diagnosed as TD | No. of Infant Subjects |
|---|---|---|---|
| 6–12 months | 0 | 5 M/1 F | 5 M/1 F |
| 12–18 months | 1 M/3 F | 14 M/9 F | 15 M/12 F |
| 18–24 months | 3 M/3 F | 0 | 3 M/3 F |
| Age (average ± SD) | 19.20 ± 2.52 | 14.72 ± 2.45 | 15.92 ± 3.17 |
Detailed information on the age, gender, and initial and definite diagnosis dates of each infant in Table 1.
| Infant ID | Age (Months) on Initial Diagnosis Date | Gender | Initial Diagnosis Date | Definite Final Diagnosis Date (Year/Month/Day) | ASD/TD |
|---|---|---|---|---|---|
| 1 | 18 | Male | 2018/07/28 | 2018/08/28 | TD |
| 2 | 18 | Male | 2017/07/27 | 2017/08/27 | TD |
| 3 | 10 | Male | 2018/08/10 | 2018/09/10 | TD |
| 4 | 13 | Male | 2017/06/10 | 2017/07/10 | TD |
| 5 | 22 | Female | 2018/01/31 | 2018/02/28 | ASD |
| 6 | 16 | Male | 2018/03/17 | 2018/04/17 | TD |
| 7 | 17 | Female | 2018/06/30 | 2018/07/30 | TD |
| 8 | 14 | Female | 2018/01/06 | 2018/02/06 | TD |
| 9 | 18 | Male | 2018/07/17 | 2018/08/17 | TD |
| 10 | 14 | Male | 2017/11/04 | 2017/12/04 | TD |
| 11 | 17 | Female | 2017/06/29 | 2017/07/29 | ASD |
| 12 | 12 | Female | 2018/01/20 | 2018/02/20 | TD |
| 13 | 9 | Male | 2017/02/18 | 2017/03/18 | TD |
| 14 | 18 | Female | 2017/03/04 | 2017/04/04 | ASD |
| 15 | 18 | Male | 2018/05/19 | 2018/06/19 | TD |
| 16 | 24 | Female | 2018/08/08 | 2018/09/08 | ASD |
| 17 | 19 | Male | 2018/02/24 | 2018/03/24 | ASD |
| 18 | 19 | Male | 2017/04/18 | 2017/05/18 | ASD |
| 19 | 18 | Female | 2017/03/04 | 2017/04/04 | TD |
| 20 | 12 | Male | 2016/12/31 | 2017/01/31 | TD |
| 21 | 16 | Female | 2018/03/16 | 2018/04/16 | TD |
| 22 | 20 | Male | 2017/10/14 | 2017/11/14 | ASD |
| 23 | 15 | Male | 2018/05/09 | 2018/06/09 | ASD |
| 24 | 17 | Female | 2017/02/04 | 2017/03/04 | TD |
| 25 | 16 | Male | 2018/03/17 | 2018/04/17 | TD |
| 26 | 12 | Male | 2018/03/29 | 2018/04/29 | TD |
| 27 | 17 | Female | 2017/01/25 | 2017/02/25 | TD |
| 28 | 17 | Male | 2018/02/08 | 2018/03/08 | ASD |
| 29 | 14 | Male | 2018/01/13 | 2018/02/13 | TD |
| 30 | 16 | Male | 2016/11/30 | 2016/12/30 | TD |
| 31 | 12 | Male | 2017/03/22 | 2017/04/22 | TD |
| 32 | 15 | Male | 2017/03/11 | 2017/04/11 | TD |
| 33 | 16 | Male | 2017/12/05 | 2018/01/05 | TD |
| 34 | 13 | Female | 2017/12/13 | 2018/01/13 | TD |
| 35 | 15 | Female | 2017/03/25 | 2018/04/25 | TD |
| 36 | 13 | Male | 2018/08/25 | 2018/09/25 | TD |
| 37 | 21 | Male | 2017/06/24 | 2017/07/24 | ASD |
| 38 | 14 | Male | 2017/02/22 | 2017/03/22 | TD |
| 39 | 14 | Male | 2018/01/27 | 2018/02/27 | TD |
Amount (ratio) of each type of vocalization in seconds.
| Vocal Label | ASD | TD |
|---|---|---|
| 0 | 80.134 (0.104) | 267.897 (0.250) |
| 1 | 314.405 (0.409) | 443.498 (0.414) |
| 2 | 33.241 (0.043) | 34.766 (0.032) |
| 3 | 8.311 (0.011) | 57.286 (0.054) |
| 4 | 333.400 (0.433) | 266.794 (0.249) |
| Total | 769.491 | 1070.241 |
Figure 1Structure of a semi-supervised auto-encoder (AE) model. eGeMAPS, extended version of the Geneva minimalistic acoustic parameter set; ASD, autism spectrum disorder; TD, typical development.
Figure 2Structure of a joint optimization model of an auto-encoder (AE) and bidirectional long short-term memory (BLSTM).
Classification results from the support vector machine (SVM), BLSTM with 88 or 54 eGeMAPS features, 54 selected eGeMAPS features, and BLSTM with AE-encoded features.
| Models | SVM | BLSTM (eGeMAPS-54) | BLSTM (eGeMAPS-88) | BLSTM (AE-Encoded) | ||||
|---|---|---|---|---|---|---|---|---|
| Predicted To | ASD | TD | ASD | TD | ASD | TD | ASD | TD |
| ASD | 62 | 18 | 170 | 103 | 196 | 99 | 215 | 98 |
| TD | 413 | 632 | 305 | 547 | 279 | 551 | 260 | 552 |
| Accuracy | 0.6178 | 0.6373 | 0.6640 | 0.6818 | ||||
| Precision | 0.1305 | 0.3579 | 0.4126 | 0.4526 | ||||
| Recall | 0.7750 | 0.6227 | 0.6644 | 0.6869 | ||||
| F1 score | 0.2234 | 0.4545 | 0.5091 | 0.5457 | ||||
| UAR | 0.5514 | 0.5997 | 0.6302 | 0.6509 | ||||
UAR, unweighted average recall.
Figure 3Two-dimensional scatter plot for (a) eGeMAPS-88, (b) eGeMAPS-54, and (c) the AE processed by t-stochastic neighbor embedding (t-SNE).