| Literature DB >> 30837638 |
Ahmed E Fetit1, Alexander S Doney2, Stephen Hogg3, Ruixuan Wang3, Tom MacGillivray4, Joanna M Wardlaw5, Fergus N Doubal5, Gareth J McKay6, Stephen McKenna3, Emanuele Trucco3.
Abstract
Cardiovascular diseases are a public health concern; they remain the leading cause of morbidity and mortality in patients with type 2 diabetes. Phenotypic information available from retinal fundus images and clinical measurements, in addition to genomic data, can identify relevant biomarkers of cardiovascular health. In this study, we assessed whether such biomarkers stratified risks of major adverse cardiac events (MACE). A retrospective analysis was carried out on an extract from the Tayside GoDARTS bioresource of participants with type 2 diabetes (n = 3,891). A total of 519 features were incorporated, summarising morphometric properties of the retinal vasculature, various single nucleotide polymorphisms (SNPs), as well as routine clinical measurements. After imputing missing features, a predictive model was developed on a randomly sampled set (n = 2,918) using L1-regularised logistic regression (lasso). The model was evaluated on an independent set (n = 973) and its performance associated with overall hazard rate after censoring (log-rank p < 0.0001), suggesting that multimodal features were able to capture important knowledge for MACE risk assessment. We further showed through a bootstrap analysis that all three sources of information (retinal, genetic, routine clinical) offer robust signal. Particularly robust features included: tortuousity, width gradient, and branching point retinal groupings; SNPs known to be associated with blood pressure and cardiovascular phenotypic traits; age at imaging; clinical measurements such as blood pressure and high density lipoprotein. This novel approach could be used for fast and sensitive determination of future risks associated with MACE.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30837638 PMCID: PMC6401035 DOI: 10.1038/s41598-019-40403-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Participant characteristics (mean values) in the development and clinical validation sets.
| Characteristics | Model development set | Clinical validation set | ||
|---|---|---|---|---|
| MACE | No MACE | MACE | No MACE | |
| Number of patients | 910 | 2,008 | 309 | 664 |
| Age at imaging (years) | 72.45 | 68.13 | 72.38 | 68.32 |
| Sex (% female) | 43 | 48 | 45 | 48 |
| Time to event or censoring (years) | 3.37 | 7.38 | 3.69 | 7.42 |
| OD radius (pixels) | 198.7 | 195.5 | 199.1 | 196.2 |
| CRAE (pixels) | 32.4 | 32.3 | 32.6 | 32.3 |
| CRVE (pixels) | 42.7 | 42.7 | 43.0 | 42.7 |
| Log of tortA | 10.2 × 10−5 | 10.3 × 10−5 | 10.1 × 10−5 | 9.92 × 10−5 |
| Log of tortV | 6.9 × 10−5 | 6.4 × 10−5 | 7.14 × 10−5 | 6.4 × 10-5 |
| AVR | 0.76 | 0.76 | 0.76 | 0.76 |
| CVD gene score | 4.47 | 4.38 | 4.44 | 4.41 |
| Corrected systolic blood pressure (mmHg) | 141.05 | 141.30 | 141.79 | 142.47 |
| Corrected diastolic blood pressure (mmHg) | 76.54 | 78.95 | 77.29 | 79.43 |
| Cholesterol levels (mmol/L) | 4.25 | 4.36 | 4.25 | 4.36 |
| High density lipoproteins (mmol/L) | 1.30 | 1.36 | 1.32 | 1.38 |
| Log Triglycerides (mmol/L) | 2.19 | 2.10 | 2.14 | 2.08 |
| History of CVD (% yes) | 52 | 23 | 51 | 20 |
| History of smoking (%yes) | 81 | 72 | 79 | 73 |
Retinal length measurements are in pixels to avoid the uncertainty introduced by commonly used pixel-micron conversion factors[30]. Differences in image size and resolution are taken into account by VAMPIRE[11–13]. OD: optic disc; CRAE: central retinal arteriolar equivalent; CRVE: central retinal venular equivalent; tortA: tortuosity of arteries, tortV: tortuosity of veins, AVR: retinal arterio-venule-ratio; CVD: cardiovascular disease.
Figure 1The results of repeated 10-fold cross-validation (CV) experiments on the development set, showing how variation in λ affects binomial deviance. The numbers at the top of the figure indicate numbers of features retained within the regularised models. Interval bars represent standard deviation. The vertical line to the left represents λmin, whereas the one to the right represents λ1SE.
51 features assigned non-zero coefficients; these constitute a regularised model using λmin on the entire development set. For the λ1SE -based model, only 7 features were retained.
| Category | Non-zero coefficient features | |
|---|---|---|
| Using λmin | Using λ1se | |
| Retinal | • odradiuspx | • None selected |
| SNPs | • rs34923683 | • None selected |
| Gene scores | • CVD gene score | • CVD gene score |
| Clinical | • Number of blood pressure lowering drugs taken | • Number of blood pressure lowering drugs taken |
CVD: cardiovascular disease. Readers are referred to Supplementary Material for a detailed explanation of retinal features computed by VAMPIRE.
Figure 2Kaplan-Meier curves for λmin-based model predictions and overall time-to-event analysis of the clinical validation set. Cases were stratified into two groups, high-risk and low-risk, using a pre-defined threshold (i.e., the mean predicted probability for the model-development set when λmin was used in a 10 fold cross-validation).
Figure 3Kaplan-Meier curves for λ1SE-based model predictions and overall time-to-event analysis of the clinical validation set. Cases were stratified into two groups, high-risk and low-risk, using a pre-defined threshold (i.e., the mean predicted probability for the model-development set when λ1SE was used in a 10 fold cross-validation).
Summary of bootstrap analyses.
| Clinical features | Genomic features | Retinal features | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Feature | Frequency (%) | Feature | Frequency (%) | SNP | Frequency (%) | SNP | Frequency (%) | Subcategory | Frequency (%) |
| History of CVD | 100 | Duration of diabetes | 54 | rs3752728 | 95 | rs687621 | 80 | Tortuosity features | 100 |
| Diastolic blood pressure | 99 | Sex | 44 | rs12921187 | 89 | rs2291435 | 78 | Width gradient features | 100 |
| History of smoking | 97 | Cholesterol levels | 31 | rs4308 | 84 | rs2014408 | 77 | ||
| High density lipoprotein | 89 | Systolic blood pressure | 14 | rs2048327 | 83 | rs7136259 | 75 | Branching point features | 100 |
| Glycated hemoglobin | 82 | Corrected systolic blood pressure | 9 | rs9549328 | 82 |
|
| OD radius and/or OD-to-fovea | 83 |
| Number of blood pressure lowering drugs taken | 71 | Corrected diastolic blood pressure | 3 | rs34923683 | 81 | CVD gene score | 84 | Fractal analysis features | 82 |
| rs200999181 | 81 | Alzheimer’s gene score | 40 | Zone B width features | 71 | ||||
| Triglycerides | 66 | Blood pressure gene score | 20 | ||||||
All clinical features are listed with their corresponding frequencies. The three composite gene scores evaluated are included with their frequency distribution. Given the large number of SNPs included, only those selected with a frequency >75% have been included. Retinal features were evaluated as sub-categories given features within each sub-category were highly correlated. OD: optic disc; CVD: cardiovascular disease. Readers are referred to Supplementary Material for a detailed explanation of retinal features computed by VAMPIRE.
Mean, median and standard deviation values of feature coefficients across the 500 regularised models. Individual features occurring at high frequencies (>75% threshold) are listed here.
| Feature | Original scale | Mean β | Median β | Std. Dev. β |
|---|---|---|---|---|
| Age at imaging | years | 0.03 | 0.03 | 0.01 |
| History of smoking | [0, 1] | 0.23 | 0.23 | 0.13 |
| History of CVD | [0, 1] | 1.05 | 1.05 | 0.14 |
| Diastolic blood pressure | mmHg | −0.02 | −0.02 | 0.01 |
| High density lipoprotein | mmol/L | −0.23 | −0.23 | 0.16 |
| Glycated haemoglobin | mmol/mol | 0.04 | 0.04 | 0.04 |
| CVD gene score | raw scores | 0.15 | 0.13 | 0.12 |
| rs3752728 | [0, 1, 2] | 0.15 | 0.14 | 0.09 |
| rs12921187 | [0, 1, 2] | 0.12 | 0.12 | 0.08 |
| rs4308 | [0, 1, 2] | −0.09 | −0.08 | 0.07 |
| rs2048327 | [0, 1, 2] | 0.08 | 0.07 | 0.07 |
| rs9549328 | [0, 1, 2] | 0.10 | 0.09 | 0.08 |
| rs34923683 | [0, 1, 2] | 0.27 | 0.24 | 0.23 |
| rs200999181 | [0, 1, 2] | −0.85 | −0.90 | 0.65 |
| rs687621 | [0, 1, 2] | 0.09 | 0.08 | 0.08 |
| rs2291435 | [0, 1, 2] | −0.07 | −0.06 | 0.07 |
| rs2014408 | [0, 1, 2] | 0.09 | 0.08 | 0.08 |
| rs7136259 | [0, 1, 2] | 0.07 | 0.06 | 0.06 |
Note that the reported coefficients are relative to their corresponding features’ original scales e.g. one year increase in age at imaging corresponds to exp(0.03 +/− 0.01) increase in odds of developing a MACE outcome. Original scales are included in the table for reference. Gene variants are coded as 0, 1 or 2 representing the number of alternate alleles for the particular SNP the individual has inherited; coefficients are therefore the average per step going from 0 to 1 and 1 to 2. CVD: Cardiovascular disease. The CVD gene scores were included as raw values; coefficients are interpreted per unit step in the score.