| Literature DB >> 35265904 |
Matthew D Solomon1,2, Grace Tabada1, Amanda Allen1, Sue Hee Sung1, Alan S Go1,3,4,5.
Abstract
Background: Systematic case identification is critical to improving population health, but widely used diagnosis code-based approaches for conditions like valvular heart disease are inaccurate and lack specificity. Objective: To develop and validate natural language processing (NLP) algorithms to identify aortic stenosis (AS) cases and associated parameters from semi-structured echocardiogram reports and compare their accuracy to administrative diagnosis codes.Entities:
Keywords: Aortic stenosis; Echocardiography; Machine learning; Population health; Quality and outcomes; Valvular heart disease
Year: 2021 PMID: 35265904 PMCID: PMC8890044 DOI: 10.1016/j.cvdhj.2021.03.003
Source DB: PubMed Journal: Cardiovasc Digit Health J ISSN: 2666-6936
Performance of aortic stenosis natural language processing algorithms
| Variable | Development dataset results | Final validation dataset results | ||
|---|---|---|---|---|
| PPV | NPV | PPV | NPV | |
| Global aortic stenosis | 100% | 99% | 99% | 99% |
| Aortic stenosis severity | 100% | 98% | 99% | 96% |
| Aortic valve max velocity | 100% | 100% | 98% | 100% |
| Left ventricular ejection fraction | 100% | 100% | 99% | 100% |
| Peak aortic valve gradient | 100% | 100% | 99% | 98% |
| Mean aortic valve gradient | 99% | 99% | 100% | 98% |
| Left ventricular hypertrophy | 100% | 100% | 100% | 95% |
| Aortic valve velocity time integral | 100% | 100% | 100% | 100% |
| Left ventricular outflow tract velocity time integral | 100% | 100% | 100% | 100% |
| Left ventricular outflow tract diameter | 100% | 100% | 100% | 99% |
| Aortic valve area | 100% | 100% | 100% | 100% |
| End diastolic volume | 100% | 100% | 100% | 100% |
| End systolic volume | 100% | 100% | 100% | 100% |
| End diastolic diameter | 100% | 96% | 100% | 96% |
| End systolic diameter | 100% | 97% | 100% | 96% |
| Bicuspid aortic valve | 100% | 97% | 97% | 100% |
The bicuspid aortic valve natural language processing algorithm used separate development and validation datasets enriched for patients with this condition.
These algorithms met specified performance criteria in the first tested validation set. They were additionally tested in a second validation dataset for further confirmation of accuracy.
Required iteration and testing in 2 validation datasets to meet specified performance criteria.
Required iteration and testing in 3 validation datasets to meet specified performance criteria.
Positive and negative predictive values of natural language processing algorithm and ICD 9/10 diagnostic codes to identify aortic stenosis compared with physician manual adjudication of medical records in the development and validation datasets
| Total echocardiograms in aortic stenosis development and validation datasets | ||
|---|---|---|
| (N = 1003) | PPV | NPV |
| Identification by ICD 9/10 codes | 59% | 96% |
| Identification by NLP algorithm | 99% | 99% |
ICD 9/10 = International Classification of Diseases, Versions 9 and 10; NLP = natural language processing; NPV = negative predictive value; PPV = positive predictive value.
Figure 1Cohort assembly of echocardiograms for adults with aortic stenosis.
Application of validated natural language processing algorithm vs administrative diagnosis codes to identify aortic stenosis among all adult echocardiograms, 2008–2018
| Validated NLP algorithm | Total echocardiograms, n (column %) | |||
|---|---|---|---|---|
| Positive for AS | Negative for AS | |||
| AS ICD 9/10 codes | Positive for AS | 67,297 | 34,514 | 101,811 (11.0) |
| Negative for AS | 36,793 | 789,280 | 826,073 (89.0) | |
| Total echocardiograms, n (row %) | 104,090 (11.2) | 823,794 (88.8) | 927,884 | |
AS = aortic stenosis.
ICD-9 codes included 395.0, 746.3, 396.2, and 424.1; ICD-10 codes included I06.0, I06.2, I35.0, and Q23.0.
Distribution of severity of aortic stenosis based on natural language processing algorithm applied to echocardiogram reports between 2008 and 2018, overall and stratified by the presence or absence of administrative diagnosis codes
| Severity of aortic stenosis | Overall | Diagnostic code for aortic stenosis | No diagnostic code for aortic stenosis |
|---|---|---|---|
| Mild | 44,767 (43.0) | 17,290 (25.7) | 27,477 (74.7) |
| Mild-to-moderate | 7130 (6.9) | 4958 (7.4) | 2172 (5.9) |
| Moderate | 22,888 (22.0) | 19,049 (28.3) | 3839 (10.4) |
| Moderate-to-severe | 6987 (6.7) | 6624 (9.8) | 363 (1.0) |
| Severe | 17,916 (17.2) | 17,457 (25.9) | 459 (1.3) |
| No severity found | 4402 (4.2) | 1919 (2.9) | 2483 (6.7) |
Severity of aortic stenosis was based upon interpreting physician assessment of the echocardiogram identified through validated natural language processing algorithm.