| Literature DB >> 35127868 |
Avan Suinesiaputra1,2, Charlène A Mauger1, Bharath Ambale-Venkatesh3, David A Bluemke4, Josefine Dam Gade5, Kathleen Gilbert6, Markus H A Janse7, Line Sofie Hald5, Conrad Werkhoven6, Colin O Wu8, Joao A C Lima3, Alistair A Young9.
Abstract
The Multi-Ethnic Study of Atherosclerosis (MESA), begun in 2000, was the first large cohort study to incorporate cardiovascular magnetic resonance (CMR) to study the mechanisms of cardiovascular disease in over 5,000 initially asymptomatic participants, and there is now a wealth of follow-up data over 20 years. However, the imaging technology used to generate the CMR images is no longer in routine use, and methods trained on modern data fail when applied to such legacy datasets. This study aimed to develop a fully automated CMR analysis pipeline that leverages the ability of machine learning algorithms to enable extraction of additional information from such a large-scale legacy dataset, expanding on the original manual analyses. We combined the original study analyses with new annotations to develop a set of automated methods for customizing 3D left ventricular (LV) shape models to each CMR exam and build a statistical shape atlas. We trained VGGNet convolutional neural networks using a transfer learning sequence between two-chamber, four-chamber, and short-axis MRI views to detect landmarks. A U-Net architecture was used to detect the endocardial and epicardial boundaries in short-axis images. The landmark detection network accurately predicted mitral valve and right ventricular insertion points with average error distance <2.5 mm. The agreement of the network with two observers was excellent (intraclass correlation coefficient >0.9). The segmentation network produced average Dice score of 0.9 for both myocardium and LV cavity. Differences between the manual and automated analyses were small, i.e., <1.0 ± 2.6 mL/m2 for indexed LV volume, 3.0 ± 6.4 g/m2 for indexed LV mass, and 0.6 ± 3.3% for ejection fraction. In an independent atlas validation dataset, the LV atlas built from the fully automated pipeline showed similar statistical relationships to an atlas built from the manual analysis. Hence, the proposed pipeline is not only a promising framework to automatically assess additional measures of ventricular function, but also to study relationships between cardiac morphologies and future cardiac events, in a large-scale population study.Entities:
Keywords: MRI; cardiac anatomy; deep learning; left ventricle; machine learning
Year: 2022 PMID: 35127868 PMCID: PMC8813768 DOI: 10.3389/fcvm.2021.807728
Source DB: PubMed Journal: Front Cardiovasc Med ISSN: 2297-055X
Figure 1Fully-automated atlas generation pipeline of cardiac MRI analyses. Three deep learning networks were trained to perform: (1) detection of mitral valve points from long-axis (LAX) images, from both two-chamber or four-chamber views, (2) detection of right ventricular (RV) insert points from short-axis (SAX) images, and (3) segmentation of myocardium mask from SAX images. Landmark points and contours from myocardium mask images were converted into 3D patient coordinates to guide the customization of a left ventricle (LV) model. Breath-hold mis-registration of SAX slices were corrected. The final model was used to construct a statistical shape LV atlas.
Patient demographics from the MESA cohort.
|
|
|
|
| ||
|---|---|---|---|---|---|
|
| 5,003 | 2,372 | 1,545 | 1,052 | |
| Age (years) | 61.5 (10.1) | 61.3 (10.1) | 61.0 (10.2) | 60.1 (9.8) | |
| Gender | Female | 2,622 (52.4) | 1,230 (51.9) | 814 (52.7) | 430 (40.9) |
| Male | 2,381 (47.6) | 1,142 (48.1) | 731 (47.3) | 622 (59.1) | |
| SBP (mmHg) | 125.4(21.3) | 126.2 (21.9) | 126.4 (22.0) | 124.8 (20.2) | |
| DBP (mmHg) | 71.8 (10.30) | 71.6 (10.3) | 71.7 (10.3) | 73.6 (10.1) | |
| Heart Rate (bpm) | 62.8 (9.5) | 62.7 (9.5) | 62.9 (9.5) | 62.1 (9.6) | |
| Diabetes | Yes | 459 (9.2) | 232 (9.8) | 162 (10.5) | 74 (7.0) |
| No | 4,544 (90.8) | 2,140 (90.2) | 1,383 (89.5) | 978 (93.0) | |
| Hypertension | Yes | 1,766 (35.3) | 805 (34.0) | 539 (34.9) | 373 (35.5) |
| No | 3,234 (64.7) | 1,566 (66.0) | 1,005 (65.1) | 677 (64.5) | |
| Smoking status | Never | 2,569 (51.5) | 1,237 (52.3) | 805 (52.4) | 511 (48.6) |
| Former | 1,786 (35.8) | 824 (34.9) | 521 (33.9) | 394 (37.5) | |
| Current | 634 (12.7) | 302 (12.8) | 209 (13.6) | 146 (13.9) | |
| Framingham score | 13.9 (9.5) | 14.1 (9.5) | 14.0 (9.6) | 13.7 (9.2) |
Two sub-cohorts were defined to train and validate deep learning networks for landmark detection and segmentation. Another sub-cohort, disjoint from the two training datasets, was defined for validation of the atlas generated from automated compared with core lab manual analysis. Continuous variables are written as mean (standard deviation), while categorical variables are written as count (percentage). Statistical tests were performed between a sub-cohort against its complement with one-way ANOVA for continuous variables and .
p < 0.05,
p < 0.01,
p < 0.001 for difference between a particular sub-cohort and the rest of the MESA CMR cohort.
Figure 2Division of MESA cases into two independent sets of Atlas Validation and Training sub-cohorts. Within the Training sub-cohort, cases were divided into training, validation and testing sub-groups for the different deep learning networks (Segmentation Network and Landmark Detection Network).
Landmark distance errors from neural networks trained independently compared to networks trained with our training strategy.
|
|
| |
|---|---|---|
| Two-chamber | 2.98 (1.44) | 1.53 (0.74) |
| Four-chamber | 3.24 (1.55) | 1.44 (0.74) |
| Short axis | 2.94 (1.6) | 2.07 (1.11) |
Error values were measured on 232 validation cases and shown as mean (standard deviation). All values are in millimeters.
Figure 3Distributions of distances between landmark points identified by the landmark detection method (Auto) and the two analysts (Obs1 and Obs2). Median (solid line), quartiles (thin lines) outliers (red points).
Differences and intraclass correlation (ICC) values in detecting landmarks on 50 validation cases.
|
|
|
| |
|---|---|---|---|
|
|
|
| |
| Auto vs. Obs1 | 1.86 (1.19) | 2.09 (1.32) | 2.29 (2.15) |
| Auto vs. Obs2 | 1.81 (1.21) | 2.19 (1.28) | 2.27 (1.61) |
| Obs1 vs. Obs2 | 1.78 (1.16) | 2.24 (1.68) | 2.67 (2.29) |
| ICC value | 0.998 | 0.996 | 0.995 |
All difference values are expressed mean (standard deviation) from the Euclidean distance between annotations in millimeters. N is the number of cases.
Figure 4Examples of automated landmark detection (red markers) compared with manually defined placements by two observers (blue and green markers). The top row shows cases with the maximum distance of automated detection to one of the observers while interobserver distances are small. The bottom row shows cases with the largest interobserver distances.
Dice score results of the segmentation network from the test dataset with 2,465 images.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Cavity | ED | 0.92 | 0.95 | 0.97 | 0.93 | 0.07 |
| ES | 0.86 | 0.91 | 0.94 | 0.88 | 0.11 | |
| Myocardium | ED | 0.85 | 0.89 | 0.91 | 0.87 | 0.07 |
| ES | 0.89 | 0.92 | 0.94 | 0.90 | 0.08 |
Frames indicate end-diastole (ED) and end-systole (ES). The 25th quartile (Q1), median, and 75th quartile (Q3) are shown, together with means and standard deviations.
Figure 5Examples of short axis segmentation network results. Top row, base; middle row, mid-ventricle; bottom row, apex. Manual contours are in red while automated contours are in blue. A range of Dice score results are shown.
Comparisons of indexed LV volumes, ejection fraction and mass from the 155 test cases between the predicted segmentation results with manual contours.
|
|
|
|
|---|---|---|
| LVEDVi (mL/m2) | 0.98 ( | −0.02 (2.6) |
| LVESVi (mL/m2) | 0.95 ( | −0.46 (2.3) |
| LVEF (%) | 0.92 ( | 0.69 (3.3) |
| LVMi (g/m2) | 0.92 ( | 3.0 (6.4) |
The differences are written as mean (standard deviation).
Figure 6Differences between automated analysis (Auto) and manually drawn contours (Man). Solid lines are mean differences and dashed lines are the limits of agreement within ±1.96 × standard deviation from the mean. The mean difference values are shown in Table 5.
Area under the ROC curve (AUC) comparisons from the 1,052 LV shape association studies using different contours: manual (Man) and deep learning (Auto).
|
|
| ||
|---|---|---|---|
|
|
| ||
| Hypertension | 0.69 | 0.71 | 0.22 |
| Diabetes | 0.56 | 0.53 | 0.34 |
| Smoking status | 0.59 | 0.61 | 0.33 |
| Cholesterol | 0.50 | 0.54 | 0.02 |
| Calcium score | 0.61 | 0.61 | 0.99 |
Figure 7An example of fully automated CMR pipeline result as a patient-specific LV model. Intermediate predictions of the myocardial contours (in blue) and landmark points (yellow circles) are shown in each corresponding DICOM image. Manual contours are shown in red. The intersection contours between the 3D LV model with the images are shown in green. This particular example demonstrates how failed segmentation contours (in apex and base slices) do not affect the final LV model, which are clearly shown in the LAX intersection contours.