| Literature DB >> 35462698 |
Bo Deng1, Wenwen Zhu2, Xiaochuan Sun1, Yanfeng Xie1, Wei Dan1, Yan Zhan1, Yulong Xia1, Xinyi Liang1, Jie Li2, Quanhong Shi1, Li Jiang1.
Abstract
The main purpose of the study was to explore a reliable way to automatically handle emergency cases, such as intracerebral hemorrhage (ICH). Therefore, an artificial intelligence (AI) system, named, H-system, was designed to automatically recognize medical text data of ICH patients and output the treatment plan. Furthermore, the efficiency and reliability of the H-system were tested and analyzed. The H-system, which is mainly based on a pretrained language model Bidirectional Encoder Representations from Transformers (BERT) and an expert module for logical judgment of extracted entities, was designed and founded by the neurosurgeon and AI experts together. All emergency medical text data were from the neurosurgery emergency electronic medical record database (N-eEMRD) of the First Affiliated Hospital of Chongqing Medical University, Chongqing Emergency Medical Center, and Chongqing First People's Hospital, and the treatment plans of these ICH cases were divided into two types. A total of 1,000 simulated ICH cases were randomly selected as training and validation sets. After training and validating on simulated cases, real cases from three medical centers were provided to test the efficiency of the H-system. Doctors with 1 and 5 years of working experience in neurosurgery (Doctor-1Y and Doctor-5Y) were included to compare with H-system. Furthermore, the data of the H-system, for instance, sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV), and the area under the receiver operating characteristics curve (AUC), were calculated and compared with Doctor-1Y and Doctor-5Y. In the testing set, the time H-system spent on ICH cases was significantly shorter than that of doctors with Doctor-1Y and Doctor-5Y. In the testing set, the accuracy of the H-system's treatment plan was 88.55 (88.16-88.94)%, the specificity was 85.71 (84.99-86.43)%, and the sensitivity was 91.83 (91.01-92.65)%. The AUC value of the H-system in the testing set was 0.887 (0.884-0.891). Furthermore, the time H-system spent on ICH cases was significantly shorter than that of doctors with Doctor-1Y and Doctor-5Y. The accuracy and AUC of the H-system were significantly higher than that of Doctor-1Y. In addition, the accuracy of the H-system was more closed to that of Doctor-5Y. The H-system designed in the study can automatically recognize and analyze medical text data of patients with ICH and rapidly output accurate treatment plans with high efficiency. It may provide a reliable and novel way to automatically and rapidly handle emergency cases, such as ICH.Entities:
Keywords: artificial intelligence (AI); intracerebral hemorrhage (ICH); natural language processing (NLP); neurosurgery emergency electrical medical record database (N-eEMRD); stroke
Year: 2022 PMID: 35462698 PMCID: PMC9028758 DOI: 10.3389/fnagi.2022.798132
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.702
Baseline characteristics of the total study population.
| Characteristic | Training set ( | Validation set ( | Testing set ( | |
| Age, years | 66 ± 15.1 | 65 ± 14.9 | 64 ± 15.0 | 0.763 |
| Female, % | 396 (56.6) | 176 (58.7) | 551 (56.0) | 0.758 |
| Male, % | 304 (43.4) | 124 (41.3) | 433 (44.0) | 0.736 |
| Weight, kg | 61.5 ± 11.7 | 60.9 ± 12.1 | 62.1 ± 12.4 | 0.695 |
| Height, cm | 161 ± 14.5 | 159 ± 15.9 | 160 ± 15.3 | 0.786 |
| Hypertension, % | 476 (68.0) | 219 (73.0) | 699 (71.0) | 0.758 |
| Diabetes, % | 189 (27.0) | 84 (28.0) | 267 (27.1) | 0.774 |
FIGURE 1(A) Flowchart of H-system. Medical texts were input into H-system and then analyzed. After that, the treatment plan was automatically output. BERT, Bidirectional Encoder Representations from Transformers. (B) Found of neurosurgery emergency electronic medical record database (N-eEMRD). Simulated intracerebral hemorrhage (ICH) cases were used for the development of H-system and internal validation with help of an experienced neurosurgeon and reference of the guidelines, and real ICH cases were used for external validation and efficiency testing.
FIGURE 2Testing of H-system’s efficiency and reliability. (A) ROC of H-system in validation set. (B) ROC of H-system in testing set. (A and B meant that the treatment output by H-system was accurate and reliable.) (C) The total time H-system, Doctor-1Y, and Doctor-5Y spent on 60 cases. (D) The mean time H-system, Doctor-1Y, and Doctor-5Y spent on single case. (C and D indicated that the time of Doctor-1Y and Doctor-5Y spent on the fixed quantity cases was significantly longer than H-system.) (E) Comparison of ROC for handling 60 cases among H-system, Doctor-1Y, and Doctor-5Y. The AUC of Doctor-1Y was significantly lower than that of H-system. However, a high degree of agreement on treatment plan was found between Doctor-5Y and H-system. (F) Comparison of number of cases handled by H-system, Doctor-1Y, and Doctor-5Y in 30 min. The figure means that the number of cases handled by H-system was significantly greater than that of Doctor-1Y and Doctor-5Y in a fixed time. **** means there was a significant statistical difference (p < 0.001).
Accuracy and AUC of H-system in the testing set.
| Group | Accuracy (%) (95% CI) | AUC |
| H-system | 88.55 (88.16–88.94) | 0.887 (0.884–0.891) |
| Plan I | 91.83 (91.01–92.65) | |
| Plan II | 85.71 (84.99–86.43) | |
| Plan IIA | 78.03 (76.14–79.92) | |
| Plan IIB | 89.15 (84.40–93.89) |
Comparison of time for handling with 60 cases and single case among H-system and doctors.
| Group | Time (60 cases) | Time (single) | |
| H-system | 280.80 ± 3.82 s | 4.68 ± 0.89 s | … |
|
| |||
| Doctor-1Y | 19,494.67 ± 121.29 s | 324.32 ± 38.44 s | <0.001 |
| Doctor-5Y | 10,757.33 ± 94.54 s | 181.14 ± 19.63 s | <0.001 |
Comparison of efficiency for handling 60 cases among H-system and doctors.
| Group | Sensitivity (%) (95% CI) | Specificity (%) (95% CI) | Accuracy (%) (95% CI) | AUC (95% CI) | κ Value (95% CI) |
| H-system | 92.86 (83.99–100.00) | 84.38 (76.61–92.14) | 88.33 (84.20–92.47) | 0.886 (0.844–0.928) | … |
|
| |||||
| Doctor–1Y | 84.52 (79.40–89.64) | 75.00 (67.24–82.77) | 79.44 (77.05–81.84) | 0.798 (0.777–0.819) | 0.827 (0.785–0.868) |
| Doctor-5Y | 94.05 (88.93–99.17) | 86.46 (81.99–90.93) | 91.51 (87.94–95.08) | 0.895 (0.835–0.955) | 0.963 (0.938–0.988) |
Comparison of cases, sensitivity, specificity, and accuracy in a fixed time among H-system and doctors.
| Group | Sensitivity (%) (95% CI) | Specificity (%) (95% CI) | Accuracy (%) (95% CI) | Cases | |
| H-system | 90.43 (89.58–91.28) | 84.91 (82.81–87.01) | 87.47 (86.63–88.31) | 766 (763–769) | … |
|
| |||||
| Doctor-1Y | 84.92 (81.50–88.34) | 74.29 (62.00–86.58) | 79.61 (71.34–87.87) | 13 (11–15) | <0.01 |
| Doctor-5Y | 93.94 (80.90–100.00) | 85.05 (74.24–95.86) | 89.15 (84.40–93.89) | 21 (18–25) | <0.01 |