| Literature DB >> 32624476 |
Abstract
OBJECTIVES: We investigated the usefulness of machine learning artificial intelligence (AI) in classifying the severity of ophthalmic emergency for timely hospital visits. STUDYEntities:
Keywords: accident & emergency medicine; biotechnology & bioinformatics; ophthalmology
Mesh:
Year: 2020 PMID: 32624476 PMCID: PMC7337880 DOI: 10.1136/bmjopen-2020-037161
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Emergency severity classification
| Red (immediate) | Orange (within 24 hours) | Yellow (within 7 days) | Green (not emergency) | |
| Diagnosis | Acute glaucoma | Corneal abrasion | AMD, CSCR, or other retinal disorders | Allergy conjunctivitis |
| Symptom | Sudden continuous visual disturbance | Marked one side redness | Diplopia or other visual disturbance | Squint (long-lasting) |
| Event | Pain in postop eye | Arc eye | ||
| Treatment | Urgent operation | Continuous monitoring for 24–72 hours |
AMD, age-related macular degeneration; CN, cranial nerve; CRAO, central retinal artery occlusion; CRVO, central retinal vein occlusion; CSCR, central serous chorioretinopathy; DR, diabetic retinopathy; FB, foreign body; PVD, posterior vitreous detachment.
The clinical variables for training set and validation set
| Category (number of variables) | |
| General patient informations | Others |
| Symptoms | |
| Events | Characteristics (in each symptom category) |
NVC, near vision card; VAS, Visual Analogue Scale.
Figure 1Data set, sampling and the model architecture. Prior to any data processing sequence, patient data were randomly divided into two data sets, that was, training set and validation set (ratio 7:1). The number of each class validation samples were extracted similarly. In training set, unbalanced data set was verified using the synthetic minority oversampling technique (SMOTE) algorithm. The model included the K-fold cross-validation method (K=10) for increasing the model performance. In machine learning algorithm, vision and pain variables were processed by each multilayer perceptron and concatenated with general patient information and other symptom variables. Overfitting was controlled by 50% dropout method, L2 regulation and the early stopping technique. The prediction performance was measured by accuracy of whole data set and precision, recall and F1 score of each class.
The proportion of ECode I and ECode II
| Total 1681 (cases) | Red | Orange | Yellow | Green |
| ECode I | ||||
| Training set (1456 to 100%) | 49 (3.39%) | 63 (4.30%) | 115 (7.92%) | 1229 (84.39%) |
| Validation set (225 to 100%) | 61 (27.11%) | 61 (27.11%) | 52 (23.11%) | 51 (22.67%) |
| Total (100%) | 110 (6.54%) | 124 (7.38%) | 167 (9.93%) | 1280 (76.15%) |
| ECode II | ||||
| Training set (1471 to 100%) | 77 (5.20%) | 110 (7.47%) | 120 (8.14%) | 1164 (24.76%) |
| Validation set (210 to 100%) | 54 (25.72%) | 52 (24.76%) | 52 (24.76%) | 52 (24.76%) |
| Total (100%) | 131 (7.79%) | 162 (9.64%) | 172 (10.23%) | 1216 (72.34%) |
Diagnoses and treatments, except symptoms such as the red flag sign and events, were only used for the first classification (ECode I). Another classification consisted of symptoms, events, diagnoses and treatment (ECode II).
The relationships of ECode I and ECode II
| ECode II | |||||
| Red | Orange (%) | Yellow (%) | Green (%) | ||
| ECode II | Red | 65.2 | 13.0 | – | 21.8 |
| Orange | – | 48.5 | 21.2 | 30.3 | |
| Yellow | – | – | 77.8 | 22.2 | |
| Green | – | – | – | 100 | |
The correlations coefficient between ECode I and ECode II was 0.837 (p=0.0000).
The model performance of ECcSMOTE II
| Total (%) | Red (%) | Orange (%) | Yellow (%) | Green (%) | |
| Accuracy | 99.05 | – | – | – | – |
| Precision | – | 100 | 98.10 | 92.73 | 100 |
| Recall | – | 100 | 100 | 98.08 | 95.33 |
| F1 score | – | 100 | 99.04 | 95.33 | 96.00 |
ECcSMOTE II was the model labelled by ECode II with SMOTE algorithm and their input variables included general patient information.
SMOTE, synthetic minority oversampling technique.