| Literature DB >> 33573429 |
Michiel Siebelt1, Dirk Das1, Amber Van Den Moosdijk1, Tristan Warren1, Peter Van Der Putten2, Walter Van Der Weegen1.
Abstract
Background and purpose - Machine learning (ML) techniques are a form of artificial intelligence able to analyze big data. Analyzing the outcome of (digital) questionnaires, ML might recognize different patterns in answers that might relate to different types of pathology. With this study, we investigated the proof-of-principle of ML-based diagnosis in patients with hip complaints using a digital questionnaire and the Kellgren and Lawrence (KL) osteoarthritis score.Patients and methods - 548 patients (> 55 years old) scheduled for consultation of hip complaints were asked to participate in this study and fill in an online questionnaire. Our questionnaire consists of 27 questions related to general history-taking and validated patient-related outcome measures (Oxford Hip Score and a Numeric Rating Scale for pain). 336 fully completed questionnaires were related to their classified diagnosis (either hip osteoarthritis, bursitis or tendinitis, or other pathology). Different AI techniques were used to relate questionnaire outcome and hip diagnoses. Resulting area under the curve (AUC) and classification accuracy (CA) are reported to identify the best scoring AI model. The accuracy of different ML models was compared using questionnaire outcome with and without radiologic KL scores for degree of osteoarthritis.Results - The most accurate ML model for diagnosis of patients with hip complaints was the Random Forest model (AUC 82%, 95% CI 0.78-0.86; CA 69%, CI 0.64-0.74) and most accurate analysis with addition of KL scores was with a Support Vector Machine model (AUC 89%, CI 0.86-0.92; CA 83%, CI 0.79-0.87).Interpretation - Analysis of self-reported online questionnaires related to hip complaints can differentiate between basic hip pathologies. The addition of radiological scores for osteoarthritis further improves these outcomes.Entities:
Year: 2021 PMID: 33573429 PMCID: PMC8231380 DOI: 10.1080/17453674.2021.1884408
Source DB: PubMed Journal: Acta Orthop ISSN: 1745-3674 Impact factor: 3.717
Selected hyperparameters for each evaluated algorithm
| Algorithm | Hyperparameter | Value |
| SVM | Epsilon | 0.1 |
| Cost (C) | 1 | |
| Kernel | RBF | |
| Decision tree | Min. number of | |
| instances in leaves | 2 | |
| Do not split subsets smaller than | 5 | |
| Max. tree depth | 100 | |
| Logistic regression | Regularization | L2 |
| Cost (C) | 1 | |
| Neural network | Hidden neurons | 100 |
| Activation | ReLu | |
| Solver | Adam | |
| Alpha | 0.0001 | |
| KNN | Number of neighbors | 5 |
| Metric | Euclidean | |
| Weight | Uniform | |
| Random Forest | Number of trees | 10 |
| Number of attributes | ||
| considered at each split | 5 | |
| Max. tree depth | 3 | |
| Do not split subsets smaller than | 5 |
Artificial intelligence analysis using machine learning (ML) algorithms on pre-hospital-acquired patient history-taking form for patients aged > 55 years with hip complaints. Values are ML algorithm accuracy in percent
| History-taking only | KL score added | |||||
|---|---|---|---|---|---|---|
| Prediction | Prediction | |||||
| Dataset | AUC | CA | model | AUC | CA | model |
| All questions | 82 | 69 | RF | 89 | 83 | SVM |
| Top 5 questions only | 82 | 73 | SVM | 85 | 79 | SVM |
| Top 10 questions only | 78 | 70 | SVM | 79 | 79 | SVM |
RF = Random Forest.
SVM = Standard Vector Machine.
Distribution (%) of the KL scores accross the 3 diagnosis groups
| KL score | |||||
| Diagnosis | 0 | 1 | 2 | 3 | 4 |
| Bursitis/tendinitis | 24 | 55 | 20 | 1 | 0 |
| Osteoarthritis | 1 | 7 | 26 | 48 | 18 |
| Other | 11 | 52 | 11 | 15 | 11 |