| Literature DB >> 36249461 |
Ryan G Gomes1, Bellington Vwalika2,3, Chace Lee1, Angelica Willis1, Marcin Sieniek1, Joan T Price3,4, Christina Chen1, Margaret P Kasaro2,4, James A Taylor1, Elizabeth M Stringer3, Scott Mayer McKinney1, Ntazana Sindano4, George E Dahl5, William Goodnight2, Justin Gilmer5, Benjamin H Chi3,4, Charles Lau1, Terry Spitz1, T Saensuksopa1, Kris Liu1, Tiya Tiyasirichokchai1, Jonny Wong1, Rory Pilgrim1, Akib Uddin1, Greg Corrado1, Lily Peng1, Katherine Chou1, Daniel Tse1, Jeffrey S A Stringer3,4, Shravya Shetty1.
Abstract
Background: Fetal ultrasound is an important component of antenatal care, but shortage of adequately trained healthcare workers has limited its adoption in low-to-middle-income countries. This study investigated the use of artificial intelligence for fetal ultrasound in under-resourced settings.Entities:
Keywords: Health care; Medical research
Year: 2022 PMID: 36249461 PMCID: PMC9553916 DOI: 10.1038/s43856-022-00194-5
Source DB: PubMed Journal: Commun Med (Lond) ISSN: 2730-664X
Fig. 1Development of an artificial intelligence system to acquire and interpret blind-sweep ultrasound for antenatal diagnostics.
a Datasets were curated from sites in Zambia and the USA and include ultrasound acquired by sonographers and midwives. Ground truth for gestational age was derived from the initial exam as part of clinical practice. An artificial intelligence (AI) system was trained to identify gestational age and fetal malpresentation and was evaluated by comparing the accuracy of AI predictions with the accuracy of clinical standard procedures. The AI system was developed using only sonographer blind-sweep data, and its generalization to novice users was tested on midwife data. Design of the AI system considered suitability for deployment in low-to-middle-income countries in three ways: first, the system interpreted ultrasound from low-cost portable ultrasound devices; second, near real-time interpretation is available offline on mobile phone devices; and finally, the AI system produces feedback scores that can be used to provide feedback to users. b Blind-sweep ultrasound acquisition procedure. The procedure can be performed by novices with a few hours of ultrasound training. While the complete protocol involves six sweeps, a set of two sweeps (M and R) were found to be sufficient for maintaining the accuracy of gestational age estimation.
Gestational age estimation.
| Sweeps collected by sonographers | Sweeps collected by novices | |||
|---|---|---|---|---|
| Standard ultrasound device | Low-cost handheld device | Standard ultrasound device | Low-cost handheld device | |
| Number | 406 | 104 | 112 | 56 |
| Blind-sweep MAE ± sd (days) | 3.8 ± 3.6 | 3.3 ± 2.8 | 4.4 ± 3.5 | 5.0 ± 4.0 |
| Standard fetal biometry estimates MAE ± sd (days) | 5.2 ± 4.6 | 3.8 ± 3.6 | 4.8 ± 3.7 | 4.7 ± 4.0 |
| Blind sweep—standard fetal biometry mean difference ± sd (days) | −1.4 ± 4.5 | −0.6 ± 3.8 | −0.4 ± 4.8 | 0.4 ± 5.1 |
| MAE difference 95% CI (days) | −1.8, −0.9 | −1.3, 0.1 | −1.3, 0.5 | −1.0, 1.7 |
| Blind sweep ME ± sd (days) | −0.9 ± 5.3 | 0.4 ± 4.4 | −1.5 ± 5.5 | −3.8 ± 5.4 |
| Standard fetal biometry estimates ME ± sd (days) | −1.4 ± 7.0 | −0.25 ± 5.4 | −2.6 ± 5.3 | −3.4 ± 5.2 |
| Reduced blind-sweep protocol MAE ± sd (days) | 4.0 ± 3.7 | 3.5 ± 3.0 | 4.5 ± 3.5 | 5.1 ± 4.2 |
Mean absolute error (MAE) and mean error (ME) between gestational age (GA) estimated using the blind-sweep procedure and ground truth, and the MAE and ME between the GA estimated using the standard fetal biometry ultrasound procedure and ground truth. One visit by each participant eligible for each subgroup was selected at random. The reduced blind-sweep protocol (last row) included only two blind sweeps. All other blind-sweep results used a set of six blind sweeps per patient visit. All fetal biometry GA estimates were collected by expert sonographers using standard ultrasound devices.
Fetal malpresentation estimation.
| Subset | Number of participants | Number of malpresentations | AUC-ROC (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) |
|---|---|---|---|---|---|
| All | 613 | 65 | 0.977 (0.949, 1.0) | 0.938 (0.848, 0.983) | 0.973 (0.955, 0.985) |
| Low-cost device only | 213 | 29 | 0.970 (0.944, 0.997) | 0.931 (0.772, 0.992) | 0.940 (0.896, 0.970) |
| Standard device only | 598 | 65 | 0.980 (0.953, 1.000) | 0.954 (0.871, 0.990) | 0.977 (0.961, 0.988) |
| Novice only | 189 | 21 | 0.992 (0.983, 1.000) | 1.000 (0.839, 1.000) | 0.952 (0.908, 0.979) |
| Sonographer only | 424 | 43 | 0.972 (0.933, 989) | 0.907 (0.779, 0.974) | 0.987 (0.970, 0.996) |
The fetal malpresentation model was assessed by comparing predictions to the determination of a sonographer. In each subset of the data, we selected only the latest eligible visit from each patient. For sensitivity and specificity computations, model predictions were binarized according to a predefined threshold. Confidence intervals on the area under the receiver operating characteristic (AUC-ROC) were computed using the DeLong method. Confidence intervals on sensitivity and specificity were computed with the Clopper–Pearson method.
Mobile-device model run-time benchmarks.
| Processor type | |||
|---|---|---|---|
| Mobile phone | GPU mean ± standard deviation | CPU w/ XNNPACK library (4 threads) | CPU (4 threads) |
| Pixel 3 | 0.9 ± 0.1 s | 2.1 ± 1.0 s | 13.2 ± 2.9 s |
| Pixel 4 | 0.2 ± 0.1 s | 1.5 ± 0.8 s | 9.8 ± 2.5 s |
| Samsung Galaxy S10 | 0.5 ± 0.1 s | 1.7 ± 1.1 s | 10.3 ± 2.3 s |
| Xiaomi Mi 9 | 1.0 ± 0.2 s | 1.8 ± 1.3 s | 13.7 ± 3.4 s |
Time to model inference results (mean and standard deviation in seconds) measured from the end of a 10-s-long blind-sweep video. Both gestational age and fetal malpresentation models run simultaneously on the same video sequence and image preprocessing operations are included. Near real-time inference is achievable on smartphones with graphics processing units or compute libraries optimized for neural network operations. This enables a simple and fast examination procedure in clinical environments.