| Literature DB >> 36150781 |
Jonathan Y Lam1, Chisato Shimizu2, Adriana H Tremoulet2, Emelia Bainto2, Samantha C Roberts2, Nipha Sivilay2, Michael A Gardiner2, John T Kanegaye2, Alexander H Hogan3, Juan C Salazar3, Sindhu Mohandas4, Jacqueline R Szmuszkovicz4, Simran Mahanta5, Audrey Dionne5, Jane W Newburger5, Emily Ansusinha6, Roberta L DeBiasi6, Shiying Hao7, Xuefeng B Ling7, Harvey J Cohen8, Shamim Nemati9, Jane C Burns2.
Abstract
BACKGROUND: Multisystem inflammatory syndrome in children (MIS-C) is a novel disease that was identified during the COVID-19 pandemic and is characterised by systemic inflammation following SARS-CoV-2 infection. Early detection of MIS-C is a challenge given its clinical similarities to Kawasaki disease and other acute febrile childhood illnesses. We aimed to develop and validate an artificial intelligence algorithm that can distinguish among MIS-C, Kawasaki disease, and other similar febrile illnesses and aid in the diagnosis of patients in the emergency department and acute care setting.Entities:
Mesh:
Year: 2022 PMID: 36150781 PMCID: PMC9507344 DOI: 10.1016/S2589-7500(22)00149-2
Source DB: PubMed Journal: Lancet Digit Health ISSN: 2589-7500
Figure 1Model architecture
A patient could be classified as having MIS-C, other febrile illnesses, or Kawasaki disease if the input data were not rejected by the conformal prediction framework. MIS-C=multisystem inflammatory syndrome in children.
Demographic and clinical characteristics of patients used in internal validation from Rady Children's Hospital San Diego (CA, USA; n=1517), Connecticut Children's Medical Center (Hartford, CT, USA; n=16), and Children's Hospital Los Angeles (CA, USA; n=50)
| Age | 8·4 (4·3–11·3) | 2·9 (1·5–4·8) | 3·4 (1·5–5·9) | <0·0001 | |
| Sex | .. | .. | .. | NS | |
| Male | 77 (57%) | 463 (60%) | 394 (59%) | .. | |
| Female | 58 (43%) | 312 (40%) | 279 (41%) | .. | |
| Ethnicity | .. | .. | .. | <0·0001 | |
| Asian | 6 (4%) | 124 (16%) | 47 (7%) | .. | |
| African American | 14 (10%) | 23 (3%) | 10 (1%) | .. | |
| White | 9 (7%) | 167 (22%) | 167 (25%) | .. | |
| Hispanic | 94 (70%) | 275 (35%) | 272 (40%) | .. | |
| More than two races or other | 12 (9%) | 186 (24%) | 114 (17%) | .. | |
| No information | 0 | 0 | 63 (9%) | .. | |
| Maximum Z score | 1·6 (0·9–2·4) | 1·7 (1·2–2·4) | NA | NS | |
| Lowest left ventricle ejection fraction | 57% (48–61) | 66% (63–70) | NA | <0·0001 | |
| Illness day of sample collection | 4 (3–6) | 5 (4–7) | 5 (4–7) | NS | |
| Automated differential | 32 (24%) | 33 (4%) | 91 (14%) | <0·0001 | |
| Clinical signs | |||||
| Rash | 76 (56%) | 715 (92%) | 442 (66%) | <0·0001 | |
| Conjunctival injection | 81 (60%) | 725 (94%) | 343 (51%) | <0·0001 | |
| Changes in lips or oropharyngeal mucosa | 50 (37%) | 723 (93%) | 304 (45%) | <0·0001 | |
| Cervical lymphadenopathy | 22 (16%) | 274 (35%) | 151 (22%) | <0·0001 | |
| Peripheral extremity changes | 18 (13%) | 625 (81%) | 131 (19%) | <0·0001 | |
| Laboratory data | |||||
| White blood cell count | 9·8 (6·9–13·0) | 13·2 (10·4–17·0) | 9·5 (6·4–13·3) | <0·0001 | |
| Neutrophils | 70% (58–80) | 58% (47–69) | 47% (31–62) | <0·0001 | |
| Bands | 12% (2–24) | 7% (2–15) | 5% (2–11) | <0·0001 | |
| Lymphocytes | 11% (6–20) | 20% (12–31) | 32% (18–47) | <0·0001 | |
| Atypical lymphocytes | 0% (0–1) | 0% (0–1) | 1% (0–3) | <0·0001 | |
| Monocytes | 3% (1–5) | 6% (3–8) | 8% (5–11) | <0·0001 | |
| Eosinophils | 1% (0–3) | 2% (1–4) | 0% (0–1) | <0·0001 | |
| Absolute neutrophil count | 7980 (5476–10 416) | 8892 (6305–11 768) | 4800 (2599–8208) | <0·0001 | |
| Absolute band count | 1160 (236–2167) | 891 (299–2095) | 438 (148–1205) | <0·0001 | |
| Absolute lymphocyte count, cells per μL | 962 (585–1797) | 2565 (1412–3963) | 2730 (1633–4356) | <0·0001 | |
| Haemoglobin concentration normalised for age | −1·6 (−2·9 to −0·7) | −1·3 (−2·3 to −0·5) | −0·3 (−1·3 to 0·5) | <0·0001 | |
| Platelet count | 160 (111–222) | 339 (267–426) | 247 (184–322) | <0·0001 | |
| Erythrocyte sedimentation rate | 49 (31–75) | 60 (39–75) | 29 (15–45) | <0·0001 | |
| C-reactive protein | 18·7 (8·2–25·7) | 7·0 (4·3–16·9) | 2·9 (1·2–6·0) | <0·0001 | |
| Alanine aminotransferase | 39 (21–63) | 46 (26–117) | 27 (19–38) | <0·0001 | |
| γ-glutamyl transferase | 36 (24–85) | 46 (18–128) | 15 (12–20) | <0·0001 | |
| Albumin | 3·6 (3·1–4·0) | 3·8 (3·5–4·2) | 4·1 (3·8–4·4) | <0·0001 | |
| Sodium | 133 (130–135) | 137 (134–139) | 138 (136–139) | <0·0001 | |
Data are n (%) or median (IQR). Percentages might not sum to 100 as a result of rounding. MIS-C=multisystem inflammatory syndrome in children. NA=not applicable. NS=not significant.
p values were calculated with the Mann-Whitney U test for continuous variables between two groups, the Kruskal-Wallis test for three groups, and the χ2 test for categorical variables.
Model feature.
Maximum Z score (internal diameter normalised for body surface area) for the right and left anterior descending coronary arteries.
Illness day 1 was the first day of fever.
Tenfold stratified cross-validation performance metrics for the training and validation cohorts in stage 1 and stage 2
| Training | |||||||
| Logistic regression | 95·4% (95·0–95·5) | 98·6% (98·5–98·7) | 92·4% (91·6–93·3) | 95·6% (95·4–96·0) | 70·9% (69·4–72·2) | 99·1% (99·0–99·2) | |
| Neural network | 96·4% (96·1–97·2) | 99·5% (99·4–99·6) | 100·0% (99·2–100·0) | 96·3% (96·0–96·9) | 74·7% (73·9–78·8) | 100·0% (99·9–100·0) | |
| Validation | |||||||
| Logistic regression | 96·4% (96·1–96·8) | 98·5% (98·0–98·8) | 93·8% (87·5–100·0) | 97·0% (95·8–97·3) | 65·0% (59·3–66·7) | 99·6% (99·2–100·0) | |
| Neural network | 96·4% (96·1–97·8) | 98·8% (98·0–99·3) | 93·8% (93·8–100·0) | 97·0% (95·8–98·1) | 63·6% (59·3–75·0) | 99·6% (99·6–100·0) | |
| Training | |||||||
| Logistic regression | 91·4% (91·2–91·5) | 97·2% (97·0–97·3) | 95·1% (95·0–95·1) | 86·7% (86·1–86·9) | 89·8% (89·6–90·0) | 93·4% (93·3–93·5) | |
| Neural network | 91·7% (91·4–91·8) | 97·4% (97·3–97·4) | 95·0% (95·0–95·1) | 87·6% (86·9–88·0) | 90·4% (90·0–90·7) | 93·4% (93·4–93·5) | |
| Validation | |||||||
| Logistic regression | 88·6% (88·2–90·1) | 96·1% (95·5–96·7) | 94·6% (94·6–95·3) | 80·3% (78·9–84·3) | 86·1% (85·2–88·1) | 92·4% (92·0–92·9) | |
| Neural network | 90·1% (89·4–90·9) | 96·0% (95·6–97·2) | 94·6% (94·6–94·6) | 84·3% (82·5–86·1) | 88·6% (87·6–89·7) | 92·4% (92·2–92·5) | |
Data are median (IQR). AUC=area under the receiver operating characteristic curve.
Figure 2ROCs for stage 1 (A) and stage 2 (B) in the final model
Thresholds for each stage were set based on the red circle on the ROC for the neural network. AUC=area under the receiver operating characteristic curve. MIS-C=multisystem inflammatory syndrome in children. ROC=receiver operating characteristic curve.
Figure 3SHAP summary plot for stage 1 (A) and stage 2 (B) with raw feature values
A feature for a patient with a SHAP value below 0 decreases the risk score. In stage 1, a higher risk score indicates a higher probability of MIS-C. In stage 2, a higher risk score indicates a higher probability of Kawasaki disease. Features are ranked in order of importance from top to bottom. MIS-C=multisystem inflammatory syndrome in children. SHAP=Shapley Additive Explanations.
Predicted classifications of patients with MIS-C from external sites
| Rejected | 2 (2%) | 1 (2%) | 2 (5%) |
| Other febrile illnesses | 3/81 (4%) | 2/49 (4%) | 3/40 (8%) |
| Kawasaki disease | 2/81 (2%) | 0 | 1/40 (3%) |
| MIS-C | 76/81 (94%) | 47/49 (96%) | 36/40 (90%) |
Data are n (%) or n/N (%). Percentages might not sum to 100 as a result of rounding. Percentages are based on the total number of patients from each site who were not rejected by conformal prediction. MIS-C=multisystem inflammatory syndrome in children.
Consisted of patients from 14 US hospitals.
Classifications were other febrile illnesses and Kawasaki disease.
Classification was other febrile illness.