| Literature DB >> 25276338 |
Quinn S Wells1, Eric Farber-Eger2, Dana C Crawford3.
Abstract
BACKGROUND: Measures of cardiac structure and function are important human phenotypes that are associated with a range of clinical outcomes. Studying these traits in large populations can be time consuming and costly. Utilizing data from large electronic medical records (EMRs) is one possible solution to this problem. We describe the extraction and filtering of quantitative transthoracic echocardiographic data from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study, a large, racially diverse, EMR-based cohort (n = 15,863).Entities:
Keywords: Echocardiography; Electronic health records; Natural language processing
Year: 2014 PMID: 25276338 PMCID: PMC4177384 DOI: 10.1186/2043-9113-4-12
Source DB: PubMed Journal: J Clin Bioinforma ISSN: 2043-9113
Figure 1Representative graphical representations of raw quantitative echocardiographic parameters. (A) Identity plot of end diastolic diameters highlighting extreme outlier (arrow). (B) Histogram of end diastolic diameter showing two distributions related to measurement units. (C) Representative pairwise scatterplot (in this case end diastolic diameter vs aortic root diameter) shows two primary clusters related to measurement units, but also outliers with discordant units (red boxes).
Figure 2Representative graphical representation of data points in reports with inconsistent use of measurement units (red boxes).
Figure 3End systolic diameter plotted against end diastolic diameter. Data points above the line have anatomically impossible diastolic to systolic diameter ratios.
Figure 4Data before (Panel A) and after (Panel B) harmonization of measurement units. Records using centimeters have been converted to millimeters.
Figure 5Representative graphical representations of cleaned data.
Echocardiograms in EAGLE BioVU
| Total subjects | 15,863 |
| Adults | 13,958 |
| Unique Echocardiograms | 6,076 |
| Unique adult subjects with an echocardiogram | 2,834 |
| Median [IQR] echocardiograms/subjects | 1 [1, 2] |
| Range of echocardiograms/subjects | 1–30 |
Reported values represent number of subjects, median [IQR], or range as appropriate.
Figure 6Frequency of echocardiograms for those subjects with at least 1 echocardiogram performed.
Demographics of entire adult cohort in EAGLE BioVU (N = 13,957)
| Age (years) | 47.5 [33.1–61.1] |
| Gender (female) | 65.4% |
| Race/Ethnicity | |
| Black | 74.4% |
| Hispanic | 9.0% |
| Asian | 7.0% |
| Other | 9.0% |
| Body mass index (kg/m2) | 28.5 [24.1–33.8] |
| HbA1C (%) | 6.1 [5.5–7.0] |
| Serum creatinine (mg/dL) | 0.87 [0.71–1.1] |
| Total cholesterol (mg/dL) | 182 [157–209] |
| LDL (mg/dL) | 105 [83–129] |
Values are reported as median [IQR] or percent (%) as appropriate.
Demographics among individuals with and without echocardiography performed
| N | 2,834 (20.3%) | N | 11,123 (79.7%) | <2.2×10-16 | ||||
| Age (years) | 58.6 [46.2–70.2] | Age (years) | 43.8 [31.3–58.1] | <2.2×10-16 | ||||
| Gender (female) | 1,694 (59.8%) | Gender (female) | 7,429 (66.8%) | 2.9×10-12 | ||||
| Race | N | Within group (%)* | Within race (%)** | Race | | Within group (%)* | Within race (%)** | |
| Black | 2,426 | 85.6 | 23.4 | Black | 7,959 | 71.6 | 76.6 | <2.2×10-16 |
| Hispanic | 119 | 4.2 | 9.5 | Hispanic | 1,130 | 10.2 | 90.5 | |
| Asian | 121 | 4.3 | 12.2 | Asian | 869 | 7.8 | 87.8 | |
| Other | 168 | 5.9 | NA | Other | 1,165 | 10.4 | NA | |
| BMI (kg/m2) | 29.6 [24.8–35.3] | BMI (kg/m2) | 28.2 [23.9–33.3] | 4.1×10-16 | ||||
| HbA1C (%) | 6.2 [5.7–7.3] | HbA1C (%) | 6.0 [5.4–6.9] | 3.7×10-7 | ||||
| Serum creatinine (mg/dL) | 1.0 [0.8–1.5] | Serum creatinine (mg/dL) | 0.8 [0.7–1.03] | <2.2×10-16 | ||||
| Total cholesterol (mg/dL) | 184 [158–213] | Total cholesterol (mg/dL) | 181 [157–208] | 0.02 | ||||
| LDL (mg/dL) | 107 [84–132] | LDL (mg/dL) | 105 [83–128] | 0.06 | ||||
Values are reported as median [IQR] or percent (%) as appropriate.
*Percentage of subjects with or without an echocardiogram performed respectively.
**Percentage of subjects within the specified race.
Demographic and echocardiographic data for African American subjects from population-based cohorts in published GWAS for echocardiographic traits
| | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age (mean ± SD) | 57.9 ± 17.6 | 58.6 ± 15.9 | P = 0.27 | 59 ± 6 | 59 ± 6 | P = 1 | 30 ± 4 | 29 ± 4 | P = 3.4x10-6 | 55 ± 13 | 54 ± 13 | P = 0.04 |
| Echocardiographic traits | ||||||||||||
| N | 1,694 | 1,140 | P-value | 698 | 415 | P-value | 854 | 589 | P-value | 1,884 | 1,115 | P-value |
| LV diastolic dimension, mm | 44.4 ± 7.6 | 48.5 ± 8.8 | <2.2×10-16 | 46 ± 6 | 49 ± 6 | <2.2×10-15 | 48 ± 4.5 | 51 ± 4.7 | <2.2×10-16 | 49 ± 4.1 | 51 ± 4 | <2.2×10-16 |
| Left atrial dimension, mm | 37.4 ± 7.3 | 40.1 ± 8.0 | <2.2×10-16 | 39 ± 6 | 39 ± 6 | 1 | 35 ± 5 | 36 ± 4.8 | 1.4×10-4 | Not Reported | Not Reported | NA |
| Aortic root diameter, mm | 28.4 ± 3.6 | 32.8 ± 4.1 | <2.2×10-16 | 30 ± 4 | 34 ± 4 | <2.2×10-16 | 26 ± 3 | 30 ± 3.5 | <2.2×10-16 | 30 ± 2.8 | 34 ± 3 | <2.2×10-16 |
| Posterior wall thickness, mm | 10.5 ± 2.2 | 11.4 ± 2.5 | <2.2×10-16 | 11 ± 2 | 12 ± 2 | <2.4×10-15 | 8 ± 1 | 9 ± 1.4 | <2.2×10-16 | 8 ± 1 | 9 ± 1 | <2.2×10-16 |
| LV systolic dimension, mm | 29.0 ± 8.4 | 33.3 ± 10.7 | <2.2×10-16 | Not Reported | Not Reported | NA | Not Reported | Not Reported | NA | 29 ± 4 | 32 ± 5 | <2.2×10-16 |
| Interventricular septal wall thickness, mm | 11.0 ± 2.5 | 12.0 ± 2.8 | <2.2×10-16 | 12 ± 2 | 12 ± 3 | 1 | 9 ± 2 | 10 ± 1.6 | <2.2×10-16 | 9 ± 1 | 9 ± 1.5 | 1 |
Values are expressed in mean ± standard deviation or number of subjects as appropriate.
Adapted from reference [17].