Sheng-Yang Tsui1, Yu Tsao2, Chii-Wann Lin3, Shih-Hau Fang4, Feng-Chuan Lin5,6, Chi-Te Wang5,6,7. 1. Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan. 2. Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan. 3. Biomedical Technology and Device Research Laboratories, Industrial Technology Research Institute, Hsinchu, Taiwan. 4. Department of Electrical Engineering, Yuan Ze University, Taoyuan, Taiwan. 5. Department of Otolaryngology Head and Neck Surgery, Far Eastern Memorial Hospital, Taipei, Taiwan. 6. Department of Special Education, University of Taipei, Taipei, Taiwan. 7. Department of Otolaryngology Head and Neck Surgery, National Taiwan University College of Medicine, Taipei, Taiwan.
Abstract
BACKGROUND: Studies have used questionnaires of dysphonic symptoms to screen voice disorders. This study investigated whether the differential presentation of demographic and symptomatic features can be applied to computerized classification. METHODS: We recruited 100 patients with glottic neoplasm, 508 with phonotraumatic lesions, and 153 with unilateral vocal palsy. Statistical analyses revealed significantly different distributions of demographic and symptomatic variables. Machine learning algorithms, including decision tree, linear discriminant analysis, K-nearest neighbors, support vector machine, and artificial neural network, were applied to classify voice disorders. RESULTS: The results showed that demographic features were more effective for detecting neoplastic and phonotraumatic lesions, whereas symptoms were useful for detecting vocal palsy. When combining demographic and symptomatic variables, the artificial neural network achieved the highest accuracy of 83 ± 1.58%, whereas the accuracy achieved by other algorithms ranged from 74 to 82.6%. Decision tree analyses revealed that sex, age, smoking status, sudden onset of dysphonia, and 10-item voice handicap index scores were significant characteristics for classification. CONCLUSION: This study demonstrated a significant difference in demographic and symptomatic features between glottic neoplasm, phonotraumatic lesions, and vocal palsy. These features may facilitate automatic classification of voice disorders through machine learning algorithms.
BACKGROUND: Studies have used questionnaires of dysphonic symptoms to screen voice disorders. This study investigated whether the differential presentation of demographic and symptomatic features can be applied to computerized classification. METHODS: We recruited 100 patients with glottic neoplasm, 508 with phonotraumatic lesions, and 153 with unilateral vocal palsy. Statistical analyses revealed significantly different distributions of demographic and symptomatic variables. Machine learning algorithms, including decision tree, linear discriminant analysis, K-nearest neighbors, support vector machine, and artificial neural network, were applied to classify voice disorders. RESULTS: The results showed that demographic features were more effective for detecting neoplastic and phonotraumatic lesions, whereas symptoms were useful for detecting vocal palsy. When combining demographic and symptomatic variables, the artificial neural network achieved the highest accuracy of 83 ± 1.58%, whereas the accuracy achieved by other algorithms ranged from 74 to 82.6%. Decision tree analyses revealed that sex, age, smoking status, sudden onset of dysphonia, and 10-item voice handicap index scores were significant characteristics for classification. CONCLUSION: This study demonstrated a significant difference in demographic and symptomatic features between glottic neoplasm, phonotraumatic lesions, and vocal palsy. These features may facilitate automatic classification of voice disorders through machine learning algorithms.