| Literature DB >> 33899504 |
John S Chorba1,2, Avi M Shapiro3, Le Le3, John Maidens3, John Prince3, Steve Pham3, Mia M Kanzawa3, Daniel N Barbosa3, Caroline Currie3, Catherine Brooks3, Brent E White4, Anna Huskin4, Jason Paek4, Jack Geocaris4, Dinatu Elnathan4, Ria Ronquillo5, Roy Kim5, Zenith H Alam6, Vaikom S Mahadevan1, Sophie G Fuller1, Grant W Stalker1, Sara A Bravo1, Dina Jean1, John J Lee6, Medeona Gjergjindreaj6, Christos G Mihos6, Steven T Forman5, Subramaniam Venkatraman3, Patrick M McCarthy4, James D Thomas4.
Abstract
Background Clinicians vary markedly in their ability to detect murmurs during cardiac auscultation and identify the underlying pathological features. Deep learning approaches have shown promise in medicine by transforming collected data into clinically significant information. The objective of this research is to assess the performance of a deep learning algorithm to detect murmurs and clinically significant valvular heart disease using recordings from a commercial digital stethoscope platform. Methods and Results Using >34 hours of previously acquired and annotated heart sound recordings, we trained a deep neural network to detect murmurs. To test the algorithm, we enrolled 962 patients in a clinical study and collected recordings at the 4 primary auscultation locations. Ground truth was established using patient echocardiograms and annotations by 3 expert cardiologists. Algorithm performance for detecting murmurs has sensitivity and specificity of 76.3% and 91.4%, respectively. By omitting softer murmurs, those with grade 1 intensity, sensitivity increased to 90.0%. Application of the algorithm at the appropriate anatomic auscultation location detected moderate-to-severe or greater aortic stenosis, with sensitivity of 93.2% and specificity of 86.0%, and moderate-to-severe or greater mitral regurgitation, with sensitivity of 66.2% and specificity of 94.6%. Conclusions The deep learning algorithm's ability to detect murmurs and clinically significant aortic stenosis and mitral regurgitation is comparable to expert cardiologists based on the annotated subset of our database. The findings suggest that such algorithms would have utility as front-line clinical support tools to aid clinicians in screening for cardiac murmurs caused by valvular heart disease. Registration URL: https://clinicaltrials.gov; Unique Identifier: NCT03458806.Entities:
Keywords: auscultation; machine learning; neural networks; physical examination; valvular heart disease
Mesh:
Year: 2021 PMID: 33899504 PMCID: PMC8200722 DOI: 10.1161/JAHA.120.019905
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 5.501
Characteristics of Study Subjects
| Characteristics | All Subjects | AS Cases | AS Controls | MR Cases | MR Controls |
|---|---|---|---|---|---|
| Total subjects | 962 | 73 | 172 | 68 | 130 |
| Age, mean±SD, y | 65±15 | 73±11 | 56±15 | 64±12 | 55±14 |
| <18, y | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| 18–29 y | 27 (2.8) | 0 (0.0) | 11 (6.4) | 0 (0.0) | 7 (5.4) |
| 30–39 y | 45 (4.7) | 0 (0.0) | 19 (11.0) | 3 (4.4) | 15 (11.5) |
| 40–49 y | 77 (8.0) | 2 (2.7) | 26 (15.1) | 5 (7.4) | 23 (17.7) |
| 50–59 y | 128 (13.3) | 8 (11.0) | 32 (18.6) | 11 (16.2) | 27 (20.8) |
| 60–69 y | 237 (24.6) | 15 (20.5) | 46 (26.7) | 26 (38.2) | 35 (26.9) |
| 70–79 y | 276 (28.7) | 26 (35.6) | 30 (17.4) | 15 (22.1) | 21 (16.2) |
| 80–90 y | 143 (14.9) | 16 (21.9) | 8 (4.7) | 7 (10.3) | 2 (1.5) |
| >90 y | 28 (2.9) | 6 (8.2) | 0 (0.0) | 1 (1.5) | 0 (0.0) |
| Sex | |||||
| Women | 450 (46.8) | 33 (45.2) | 93 (54.1) | 21 (30.9) | 67 (51.5) |
| Men | 512 (53.2) | 40 (54.8) | 79 (45.9) | 47 (69.1) | 63 (48.5) |
| Race/ethnicity | |||||
| White | 748 (77.8) | 54 (74.0) | 129 (75.0) | 57 (83.8) | 99 (76.2) |
| Black/AA | 57 (5.9) | 5 (6.8) | 10 (5.8) | 2 (2.9) | 8 (6.2) |
| Asian | 69 (7.2) | 5 (6.8) | 14 (8.1) | 4 (5.9) | 11 (8.5) |
| Hispanic/Latino | 57 (5.9) | 7 (9.6) | 12 (7.0) | 4 (5.9) | 7 (5.4) |
| Other/unknown | 31 (3.2) | 2 (2.7) | 7 (4.1) | 1 (1.5) | 5 (3.8) |
| Valvular disease | |||||
| Prosthesis | 111 (11.5) | 3 (4.1) | 0 (0.0) | 5 (7.4) | 0 (0.0) |
| Aortic valve | |||||
| Regurgitation | |||||
| Mild | 156 (16.2) | 19 (26.0) | 0 (0.0) | 15 (22.1) | 0 (0.0) |
| Moderate | 48 (5.0) | 6 (8.2) | 0 (0.0) | 3 (4.4) | 0 (0.0) |
| Severe | 9 (0.9) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Stenosis | |||||
| Mild | 30 (3.1) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Moderate | 30 (3.1) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Severe | 82 (8.5) | 73 (100.0) | 0 (0.0) | 1 (1.5) | 0 (0.0) |
| Mitral valve | |||||
| Regurgitation | |||||
| Mild | 225 (23.4) | 19 (26.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Moderate | 98 (10.2) | 10 (13.7) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Severe | 85 (8.8) | 2 (2.7) | 0 (0.0) | 68 (100.0) | 0 (0.0) |
| Stenosis | |||||
| Mild | 21 (2.2) | 3 (4.1) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Moderate | 10 (1.0) | 2 (2.7) | 0 (0.0) | 2 (2.9) | 0 (0.0) |
| Severe | 8 (0.8) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Pulmonic valve | |||||
| Regurgitation | |||||
| Mild | 141 (14.7) | 13 (17.8) | 0 (0.0) | 18 (26.5) | 0 (0.0) |
| Moderate | 13 (1.4) | 0 (0.0) | 0 (0.0) | 1 (1.5) | 0 (0.0) |
| Severe | 2 (0.2) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Stenosis | |||||
| Mild | 2 (0.2) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Moderate | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Severe | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Tricuspid valve | |||||
| Regurgitation | |||||
| Mild | 305 (31.7) | 19 (26.0) | 0 (0.0) | 25 (36.8) | 0 (0.0) |
| Moderate | 88 (9.1) | 6 (8.2) | 0 (0.0) | 16 (23.5) | 0 (0.0) |
| Severe | 27 (2.8) | 1 (1.4) | 0 (0.0) | 2 (2.9) | 0 (0.0) |
| Stenosis | |||||
| Mild | 2 (0.2) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Moderate | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Severe | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
Data are given as number (percentage), unless otherwise noted. For brevity, intermediate grades of valvular disease severity are categorized as the higher grade (ie, moderate to severe included as severe). AA indicates African American; AS, aortic stenosis; and MR, mitral regurgitation.
Figure 1Predicted and annotated signal quality.
The plot on the right shows that the recordings predicted as “inadequate signal” by the algorithm have low signal quality, as assessed by the cardiologist annotators.
Characteristics of Murmur Detection Algorithm
| Confusion Matrix | |||
|---|---|---|---|
| Annotation (Ground Truth) | Total | ||
| Murmur | No Murmur | ||
| Algorithm result | |||
| Murmur detected | 499 | 77 | 576 |
| No murmur detected | 155 | 817 | 972 |
| Inadequate signal | 28 | 198 | 226 |
| Total | 682 | 1092 |
|
Confusion matrix listed at top, with test characteristics stratified by annotated murmur grade, auscultation position, and auscultation device listed below. Under the heading of recordings, “murmur” indicates algorithm‐identified murmurs, “total” indicates algorithm‐analyzed recordings after removing inadequate signals, and “inadequate signal” indicates recordings labeled as inadequate signal by signal quality classifier. Test characteristics are computed after excluding inadequate signals from analysis. LR indicates likelihood ratio.
*represent inadequate signal recordings.
Figure 2Performance of murmur detection algorithm.
Receiver operating characteristic curves for all recordings (blue) and minimal intensity–filtered murmurs (green) are shown. Eko software operates with parameters yielding the orange marker. Error bars indicate 95% CIs. AUC indicates area under the curve.
Figure 3Flow of study participants.
We defined valvular heart disease cases as those graded moderate to severe or worse to encompass all levels of disease that could require timely intervention beyond serial monitoring. We defined controls as subjects free of valvular, structural, or congenital heart disease, with no valvular regurgitation or stenosis beyond trivial or physiologic severity. Potential participants included all enrolled subjects (ie, those with recordings). Eligible participants included only those with the appropriate data for analysis. Aortic stenosis (AS) was assessed by a single recording at either the aortic (preferred) or the pulmonic position, and mitral regurgitation (MR) was assessed by a single recording at the mitral position. Actual cases and controls were further filtered from potential cases and controls by removing subjects with “inadequate signal” at the corresponding anatomic locations by the signal quality classifier. Numbers listed in italics represent the subset of annotated recordings.
Characteristics of Algorithm for VHD Screening
| Variables | Subjects (Cases/Controls) | Sensitivity (95% CI), % | Specificity (95% CI), % | LR Positive (95% CI) | LR Negative (95% CI) |
|---|---|---|---|---|---|
| Aortic stenosis | |||||
| Algorithm | 245 (73/172) | 93.2 (86.9–98.5) | 86.0 (80.9–91.0) | 6.68 (4.82–10.37) | 0.08 (0.018–0.155) |
| Annotator A | 122 (40/82) | 97.5 (91.7–100) | 45.1 (34.1–55.8) | 1.78 (1.46–2.2) | 0.055 (0.0–0.2) |
| Annotator B | 122 (40/82) | 90.0 (79.5–97.7) | 78.0 (68.2–86.8) | 4.1 (2.78–6.84) | 0.128 (0.029–0.27) |
| Annotator C | 122 (40/82) | 82.5 (69.6–93.6) | 90.2 (83.1–96.3) | 8.46 (4.77–23.01) | 0.194 (0.069–0.338) |
| Mitral regurgitation | |||||
| Algorithm | 198 (68/130) | 66.2 (54.7–77.4) | 94.6 (90.4–98.4) | 12.3 (6.7–39.9) | 0.357 (0.239–0.479) |
| Annotator A | 91 (29/62) | 82.8 (68.0–95.5) | 64.5 (52.5–76.1) | 2.33 (1.65–3.51) | 0.267 (0.076–0.514) |
| Annotator B | 91 (29/62) | 69.0 (52.0–85.7) | 82.3 (72.7–90.8) | 3.89 (2.31–7.85) | 0.377 (0.173–0.598) |
| Annotator C | 91 (29/62) | 58.6 (40.0–76.7) | 87.1 (78.5–95.1) | 4.54 (2.42–12.11) | 0.475 (0.263–0.691) |
LR indicates likelihood ratio; and VHD, valvular heart disease.
Figure 4Performance of aortic stenosis screening by murmur detection algorithm.
The algorithm receiver operating characteristic (ROC) curve is shown in blue. Eko software operates at the orange marker. The performance of the individual cardiologists on the annotated subset of the overall data set is shown by the green, red, and purple markers. Error bars indicate 95% CIs. AUC indicates area under the curve.
Figure 5Performance of mitral regurgitation screening by murmur detection algorithm.
The algorithm receiver operating characteristic (ROC) curve is shown in blue. Eko software operates at the orange marker. The performance of the individual cardiologists on the annotated subset of the overall data set is shown by the green, red, and purple markers. Error bars indicate 95% CIs. AUC indicates area under the curve.