| Literature DB >> 31143882 |
Haotian Lin1, Ruiyang Li1, Zhenzhen Liu1, Jingjing Chen1, Yahan Yang1, Hui Chen1, Zhuoling Lin1, Weiyi Lai1, Erping Long1, Xiaohang Wu1, Duoru Lin1, Yi Zhu1,2, Chuan Chen1,2, Dongxuan Wu3, Tongyong Yu3, Qianzhong Cao1, Xiaoyan Li1, Jing Li1, Wangting Li1, Jinghui Wang1, Mingmin Yang4, Huiling Hu4, Li Zhang5, Yang Yu6, Xuelan Chen6, Jianmin Hu6, Ke Zhu7, Shuhong Jiang8, Yalin Huang9, Gang Tan10, Jialing Huang11, Xiaoming Lin1, Xinyu Zhang1, Lixia Luo1, Yuhua Liu1, Xialin Liu1, Bing Cheng1, Danying Zheng1, Mingxing Wu1, Weirong Chen1, Yizhi Liu1.
Abstract
BACKGROUND: CC-Cruiser is an artificial intelligence (AI) platform developed for diagnosing childhood cataracts and providing risk stratification and treatment recommendations. The high accuracy of CC-Cruiser was previously validated using specific datasets. The objective of this study was to compare the diagnostic efficacy and treatment decision-making capacity between CC-Cruiser and ophthalmologists in real-world clinical settings.Entities:
Keywords: Artificial intelligence; Childhood cataracts; Multicentre randomized controlled trial; Ophthalmology
Year: 2019 PMID: 31143882 PMCID: PMC6510889 DOI: 10.1016/j.eclinm.2019.03.001
Source DB: PubMed Journal: EClinicalMedicine ISSN: 2589-5370
Fig. 1Trial profile.
AI = artificial intelligence.
Baseline demographics and disease characteristics.
| AI group (N = 175 P/350 E) | SC group (N = 175 P/350 E) | ||
|---|---|---|---|
| Sex | |||
| Male | 77 (44.0%) | 82 (46.9%) | |
| Female | 98 (56.0%) | 93 (53.1%) | |
| Age (years) | 6.58 (0.45) | 5.89 (0.45) | |
| Family history of cataracts | |||
| No | 165 (94.3%) | 163 (93.1%) | |
| Yes | 10 (5.7%) | 12 (6.9%) | |
| Eye symptoms | |||
| No | 96 (54.9%) | 93 (53.1%) | |
| Yes | 79 (45.1%) | 82 (46.9%) | |
| Patients with cataracts | |||
| Normal | 106 (60.6%) | 100 (57.1%) | |
| Monocular cataracts | 31 (17.7%) | 28 (16.0%) | |
| Bilateral cataract | 38 (21.7%) | 47 (26.9%) | |
| Eyes with cataracts | |||
| No | 243 (69.4%) | 228 (65.1%) | |
| Yes | 107 (30.6%) | 122 (34.9%) | |
| Opacity area | |||
| Extensive | 71 (66.4%) | 85 (69.7%) | |
| Limited | 36 (33.6%) | 37 (30.3%) | |
| Density | |||
| Dense | 69 (64.5%) | 77 (63.1%) | |
| Non-dense | 38 (35.5%) | 45 (36.9%) | |
| Location | |||
| Central | 77 (72.0%) | 83 (68.0%) | |
| Peripheral | 30 (28.0%) | 39 (32.0%) | |
| Treatment recommendations | |||
| Surgery | 63 (58.9%) | 76 (62.3%) | |
| Follow-up | 44 (41.1%) | 46 (37.7%) |
Data are presented as the number n (%) or mean (standard deviations). Percentages do not add up to 100% in some cases because of rounding. The χ2 test was performed to compare the characteristics of sex, family history of cataracts, eye symptoms and patients with cataracts between the AI group and the senior consultant group. An independent samples t-test was performed to compare age between the two groups. The generalized estimating equation was performed to compare the eyes with cataracts, disease severity, and treatment recommendations. None of the baseline characteristics differed significantly at the 0.05 level between groups. AI = artificial intelligence. SC = senior consultant. P = participants. E = eyes.
χ2 test.
t-Test.
Generalized estimating equation.
Diagnostic performance regarding childhood cataract.
| Sensitivity | Specificity | Accuracy | Positive predictive value | Negative predictive value | Accuracy difference | TPF difference | FPF difference | |
|---|---|---|---|---|---|---|---|---|
| CC-Cruiser | 89.7% | 86.4% | 87.4% | 74.4% | 95.0% | |||
| Senior consultants | 98.4% | 99.6% | 99.1% | 99.2% | 99.1% |
Eyes were the units of analysis (N = 700). There were 350 eyes in the CC-Cruiser group and 350 eyes in the senior consultant group. OR = odd ratio. CI = confidence interval. TPF = true positive fraction. TPF is equivalent to sensitivity. FPF = false positive fraction. FPF is equivalent to 1-specificity. We performed a diagnostic accuracy analysis with reference to the cataract specialists' standards. The TPF and FPF of diagnosis (normal lens versus cataract) were 89.7%, and 13.6%, respectively, for CC-Cruiser and 98.4%, and 0.4%, respectively, for the senior consultants. The generalized estimating equation (GEE) was performed to identify significant differences in accuracy, TPF, and FPF between CC-Cruiser and the senior consultants. GEE results (adjusted results) and logistic regression results (unadjusted results) for cataract diagnosis in the supplementary table were presented to show the impact of the cluster at the level of participants (Supplementary table 1 for adjusted and unadjusted results).
Eyes were the units of analysis (N = 700). There were 350 eyes in the CC-Cruiser group and 350 eyes in the senior consultant group. OR = odd ratio. CI = confidence interval. TPF = true positive fraction. TPF is equivalent to sensitivity. FPF = false positive fraction. FPF is equivalent to 1-specificity. We performed a diagnostic accuracy analysis with reference to the cataract specialists' standards. The TPF and FPF of diagnosis (normal lens versus cataract) were 89.7%, and 13.6%, respectively, for CC-Cruiser and 98.4%, and 0.4%, respectively, for the senior consultants. The generalized estimating equation (GEE) was performed to identify significant differences in accuracy, TPF, and FPF between CC-Cruiser and the senior consultants. GEE results (adjusted results) and logistic regression results (unadjusted results) for cataract diagnosis in the supplementary table were presented to show the impact of the cluster at the level of participants (Supplementary table 1 for adjusted and unadjusted results).
Comprehensive evaluations of childhood cataract and treatment recommendations.
| Sensitivity | Specificity | Accuracy | Accuracy difference | TPF difference ( | FPF difference ( | |
|---|---|---|---|---|---|---|
| Opacity area | ||||||
| CC-Cruiser | 91.3% | 88.9% | 90.6% | − 2.7 ( | − 2.8 ( | 2.5 ( |
| Senior consultants | 94.1% | 91.4% | 93.3% | |||
| Density | ||||||
| CC-Cruiser | 85.3% | 67.9% | 80.2% | − 4.8 ( | 3.5 ( | 22.8 ( |
| Senior consultants | 81.8% | 90.7% | 85.0% | |||
| Location | ||||||
| CC-Cruiser | 84.2% | 50% | 77.1% | − 10.4 ( | − 7.4 ( | 28.4 ( |
| Senior consultants | 91.6% | 78.4% | 87.5% | |||
| Treatment | ||||||
| CC-Cruiser | 86.7% | 44.4% | 70.8% | − 25.9 ( | − 8.0 ( | 55.6 |
| Senior consultants | 94.7% | 100.0% | 96.7% |
Eyes were the units of analysis. A total of 216 eyes (correctly diagnosed as cataracts in both groups, 96 eyes in the CC-Cruiser group and 120 eyes in the senior consultant group) were further analyzed by comprehensive evaluation of lens opacity, including the opacity area (extensive versus limited), density (dense versus non-dense), and location (central versus peripheral), and the recommended treatment (surgery versus follow-up) with reference to the cataract specialists' standards. OR = odd ratio. CI = confidence interval. TPF = true positive fraction. TPF is equivalent to sensitivity. FPF = false positive fraction. FPF is equivalent to 1-specificity. The generalized estimating equation was performed to identify significant differences in the accuracy, TPF, and FPF of the opacity area, density, and location and the treatment recommendations between CC-Cruiser and the senior consultants. The p-value and OR of the difference in FPF of treatment between two groups couldn't be calculated because of the 100% specificity for senior consultants. GEE results (adjusted results) and logistic regression results (unadjusted results) for evaluation of cataract and treatment in the supplementary table were presented to show the impact of the cluster at the level of participants (Supplementary table 1 for adjusted and unadjusted results).
Eyes were the units of analysis. A total of 216 eyes (correctly diagnosed as cataracts in both groups, 96 eyes in the CC-Cruiser group and 120 eyes in the senior consultant group) were further analyzed by comprehensive evaluation of lens opacity, including the opacity area (extensive versus limited), density (dense versus non-dense), and location (central versus peripheral), and the recommended treatment (surgery versus follow-up) with reference to the cataract specialists' standards. OR = odd ratio. CI = confidence interval. TPF = true positive fraction. TPF is equivalent to sensitivity. FPF = false positive fraction. FPF is equivalent to 1-specificity. The generalized estimating equation was performed to identify significant differences in the accuracy, TPF, and FPF of the opacity area, density, and location and the treatment recommendations between CC-Cruiser and the senior consultants. The p-value and OR of the difference in FPF of treatment between two groups couldn't be calculated because of the 100% specificity for senior consultants. GEE results (adjusted results) and logistic regression results (unadjusted results) for evaluation of cataract and treatment in the supplementary table were presented to show the impact of the cluster at the level of participants (Supplementary table 1 for adjusted and unadjusted results).
Time required for the diagnostic process of CC-Cruiser and senior consultants.
| Mean time (minutes) | Standard deviation | 95% CI | Mean difference ( | ||
|---|---|---|---|---|---|
| Lower | Upper | ||||
| CC-Cruiser | 2.79 | 1.11 | 2.64 | 2.96 | 5.74 ( |
| Senior consultants | 8.53 | 1.75 | 8.27 | 8.78 | |
Three hundred patients were included in the analysis (175 participants in the CC-Cruiser group and 175 participants in the senior consultant group). The Mann–Whitney U test was performed to compare the time required. Significant differences in time required were observed between the CC-Cruiser and senior consultant groups (p < 0.001). CI = confidence interval.
Questionnaire provided to the participants with their responses to the clinical service.
| Question | Response in the AI group (N = 172) | Mean rating (SD) | Response in the SC group | Mean rating (SD) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 | ||||
| The initial diagnosis of the eye clinic was credible. | 5.2% (9) | 15.9% (27) | 32.0% (55) | 47.1% (81) | 3.21 (0.893) | 0 | 3.5% (6) | 58.4% (101) | 38.2% (66) | 3.35 (0.546) | |
| The initial therapeutic decision of the eye clinic was credible. | 4.7% (8) | 18.0% (31) | 28.5% (49) | 48.8% (84) | 3.22 (0.902) | 0 | 5.8% (10) | 57.0% (98) | 37.6% (65) | 3.32 (0.578) | |
| The initial diagnosis of the eye clinic was consistent with that of the experts. | 3.5% (6) | 18.6% (32) | 21.5% (37) | 56.4% (97) | 3.31 (0.896) | 0 | 2.9% (5) | 37.6% (65) | 59.5% (103) | 3.57 (0.552) | |
| The initial therapeutic decision of the eye clinic was consistent with that of the experts. | 3.5% (6) | 23.3% (40) | 17.4% (30) | 55.8% (96) | 3.26 (0.918) | 0 | 4.0% (7) | 37.6% (65) | 58.4% (101) | 3.54 (0.575) | |
| I was satisfied with the time required to wait for CC-Cruiser/senior consultants in this eye clinic. | 0 | 0.6% (1) | 41.9% (72) | 57.6% (99) | 3.57 (0.508) | 0 | 5.2% (9) | 50.9% (88) | 43.9% (76) | 3.39 (0.586) | |
| I was satisfied with the time required to make the diagnosis and provide treatment recommendations by CC-Cruiser/senior consultants. | 0 | 0 | 43.6% (75) | 56.4% (97) | 3.56 (0.497) | 0 | 0.5% (1) | 59.5% (103) | 39.9% (69) | 3.38 (0.554) | |
| Overall, I was satisfied with this medical service provided in this eye clinic. | 0 | 0 | 52.9% (91) | 47.1% (81) | 3.47 (0.501) | 0 | 4.0% (7) | 61.3% (106) | 34.7% (60) | 3.31 (0.543) | |
Data are presented as the number (%) or mean (standard deviation). Percentages do not add up to 100% in some cases because of rounding. Three hundred and forty-five patients were included in the analysis (172 in the CC-Cruiser group and 173 in the senior consultant group). Pediatric participants and at least one of their guardians were asked to complete the questionnaire together. Five participants' guardians were unwilling to complete the questionnaires because of personal reasons. The survey questions used a 4-point scale (1, disagree; 2, neutral; 3, agree; and 4, strongly agree). The Mann–Whitney U test was performed to identify significant differences in responses to each question between the two groups. AI = artificial intelligence. SC = senior consultant. SD = standard deviation.