| Literature DB >> 35239643 |
Chris Lauber1,2, Mathias J Gerl1, Christian Klose1, Filip Ottosson3, Olle Melander3, Kai Simons1.
Abstract
Type 2 diabetes (T2D) and cardiovascular disease (CVD) represent significant disease burdens for most societies and susceptibility to these diseases is strongly influenced by diet and lifestyle. Physiological changes associated with T2D or CVD, such has high blood pressure and cholesterol and glucose levels in the blood, are often apparent prior to disease incidence. Here we integrated genetics, lipidomics, and standard clinical diagnostics to assess future T2D and CVD risk for 4,067 participants from a large prospective population-based cohort, the Malmö Diet and Cancer-Cardiovascular Cohort. By training Ridge regression-based machine learning models on the measurements obtained at baseline when the individuals were healthy, we computed several risk scores for T2D and CVD incidence during up to 23 years of follow-up. We used these scores to stratify the participants into risk groups and found that a lipidomics risk score based on the quantification of 184 plasma lipid concentrations resulted in a 168% and 84% increase of the incidence rate in the highest risk group and a 77% and 53% decrease of the incidence rate in lowest risk group for T2D and CVD, respectively, compared to the average case rates of 13.8% and 22.0%. Notably, lipidomic risk correlated only marginally with polygenic risk, indicating that the lipidome and genetic variants may constitute largely independent risk factors for T2D and CVD. Risk stratification was further improved by adding standard clinical variables to the model, resulting in a case rate of 51.0% and 53.3% in the highest risk group for T2D and CVD, respectively. The participants in the highest risk group showed significantly altered lipidome compositions affecting 167 and 157 lipid species for T2D and CVD, respectively. Our results demonstrated that a subset of individuals at high risk for developing T2D or CVD can be identified years before disease incidence. The lipidomic risk, which is derived from only one single mass spectrometric measurement that is cheap and fast, is informative and could extend traditional risk assessment based on clinical assays.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35239643 PMCID: PMC8893343 DOI: 10.1371/journal.pbio.3001561
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 9.593
AUC classification metric for the models underlying the different risk scores for T2D and CVD.
We used class prediction probabilities averaged over the 10 independent cross-validation iterations during the AUC calculation.
| model | T2D | CVD |
|---|---|---|
| N | 0.533 | 0.517 |
| P | 0.609 | 0.507 |
| L | 0.728 | 0.661 |
| N + L + P | 0.745 | 0.634 |
| N + L + P + C | 0.797 | 0.659 |
AUC, area under the curve; CVD, cardiovascular disease; T2D, type 2 diabetes.
Fig 2Lipidomic risk score versus time to first incidence.
The time in years to the first incidence event is compared to the L risk score for T2D (A) and CVD (B). Pearson’s r = −0.107 and −0.169 for T2D and CVD, respectively. The risk scores are average values from 10 independent replications. The curve shows a least squares fit of a linear model to the data. The data underlying this figure may be found in S1 Data. CVD, cardiovascular disease; T2D, type 2 diabetes.
Fig 5Lipidome-based risk subgroups.
Shown is a hierarchical clustering of 3,599 plasma samples that is solely based on 184 lipid concentrations. Pearson correlation has been used as distance measure. The tips of the dendrogram are colored according to the 6 subgroups obtained by splitting the dendrogram into the 6 most basal clusters. (B) Distribution of age (left) and sex (right) in the 6 subgroups. The dashed horizontal line indicates the average female percentage in the full cohort. (C) Distribution of L risk scores for the 6 subgroups for T2D (left) and CVD (right). (D) The 2 risk scores for T2D and CVD are compared for all participants; note that cases are defined here as being either a T2D or CVD incidence. The line indicates a least squares fit of a linear model to the data; Pearson’s r = 0.613. (E) Same as C but only for participants belonging to subgroup 6; Pearson’s r = 0.504. The data underlying this figure may be found in S1 Data. CVD, cardiovascular disease; T2D, type 2 diabetes.
Fig 1Correlation of different risk scores with future disease incidence rate.
Different risk scores are sorted and partitioned into deciles; for each decile, the fraction of future T2D (A) and CVD (C) incidences is shown, with points indicating the mean over 10 independent repetitions and the bars the associated standard errors of the mean. The risk scores differ with respect to the included predictor variables: null model (N), lipidome (L), polygenic score (P), 7 standard clinical or vital measures (C). Different predictor types may be combined to derive composite scores like N + L + P comprising the null model, lipidome and polygenic predictors. The horizontal dashed line shows the average incidence rate across the full cohort. The effect sizes of individual predictor variables from each model are shown for T2D (B) and CVD (D) grouped by lipid class or predictor type. The data underlying this figure may be found in S1 Data. BMI, body mass index, sum of absolute effect sizes across all lipid species/subspecies (total); CE, cholesteryl ester; Cer, ceramide; Chol, cholesterol; CVD, cardiovascular disease; DAG, diacylglyceride; FBG, fasting blood glucose; HbA1c, glycated hemoglobin; HDL, high-density lipoprotein; LDL, low-density lipoprotein; LPC, lysophosphatidylcholine; LPE, lysophosphatidylethanolamine; PC, phosphatidylcholine; PC O-, ether- phosphatidylcholine; PE, phosphatidylethanolamine; PE O-, ether- phosphatidylethanolamine; PI, phosphatidylinositol; PRS, polygenic risk score; SBP, systolic blood pressure; SM, sphingomyelin; TAG, triacylglyceride; TRIGL, triglyceride; T2D, type 2 diabetes.
Correlation test results of L and C risk scores with time to T2D or CVD incidence.
Test statistic (t), p-value (p), Pearson’s product–moment correlation coefficient (r) and its 95% confidence interval (CI95%), and the number of observations (n) are shown.
| disease | model | t | p |
| CI95% | n |
|---|---|---|---|---|---|---|
| T2D | L | −2.41 | 0.016 | −0.107 | [−0.193, −0.020] | 500 |
| T2D | C | −6.98 | 9.3 × 10−12 | −0.299 | [−0.376, −0.217] | 500 |
| CVD | L | −4.98 | 7.8 × 10−07 | −0.169 | [−0.233, −0.102] | 850 |
| CVD | C | −6.88 | 1.2 × 10−11 | −0.230 | [−0.293, −0.165] | 850 |
CVD, cardiovascular disease; T2D, type 2 diabetes.
Comparison of vital, clinical, genetic, and lipidomic variables between participants from subgroup 6 and the other participants.
t Tests were used except for the fraction of males for which a chi-squared test on the absolute counts was applied. p-Values were adjusted for multiple testing; bold font indicates adjusted p-values smaller than 0.05.
| parameter | type | mean subgroups 1 + 2 + 3 + 4 + 5 | mean subgroup 6 | log2 fold-change | adjusted |
|---|---|---|---|---|---|
| age | vital | 57.4 | 56.9 | −0.013 | 0.309 |
| male% | vital | 0.389 | 0.435 | 0.159 | 0.335 |
| BMI | vital | 25.3 | 25.7 | 0.021 | 0.259 |
| SBP | clinical | 140.8 | 142.8 | 0.021 | 0.187 |
| HDL | clinical | 1.418 | 1.346 | −0.075 |
|
| LDL | clinical | 4.158 | 4.251 | 0.032 | 0.260 |
| TRIGL | clinical | 1.253 | 1.349 | 0.107 | 0.064 |
| FBG | clinical | 4.898 | 5.021 | 0.036 |
|
| HbA1c | clinical | 4.798 | 4.793 | −0.002 | 0.900 |
| PRS | genetic | −0.178 | −0.178 | −0.0001 | 0.402 |
| SM | lipid class | 167.3 | 204.5 | 0.290 |
|
| CE | lipid class | 5,774.2 | 4,952.2 | −0.222 |
|
| LPE | lipid class | 5.0 | 4.1 | −0.315 |
|
| PE O- | lipid class | 28.5 | 22.4 | −0.350 |
|
| LPC | lipid class | 142.8 | 170.8 | 0.259 |
|
| PE | lipid class | 13.8 | 11.2 | −0.299 |
|
| Cer | lipid class | 6.5 | 5.7 | −0.170 |
|
| Chol | lipid class | 961.6 | 1,104.8 | 0.200 |
|
| PI | lipid class | 37.6 | 40.7 | 0.113 |
|
| PC | lipid class | 1,672.8 | 1,730.1 | 0.049 | 0.058 |
| DAG | lipid class | 26.9 | 28.0 | 0.055 | 0.336 |
| PC O- | lipid class | 61.8 | 60.6 | −0.029 | 0.336 |
| TAG | lipid class | 1,278.6 | 1,288.3 | 0.011 | 0.836 |
BMI, body mass index; CE, cholesteryl ester; Cer, ceramide; Chol, cholesterol; DAG, diacylglyceride; FBG, fasting blood glucose; HbA1c, glycated hemoglobin; HDL, high-density lipoprotein; LDL, low-density lipoprotein; LPC, lysophosphatidylcholine; LPE, lysophosphatidylethanolamine; PC, phosphatidylcholine; PC O-, ether- phosphatidylcholine; PE, phosphatidylethanolamine; PE O-, ether- phosphatidylethanolamine; PI, phosphatidylinositol; PRS, polygenic risk score; SBP, systolic blood pressure; SM, sphingomyelin; TAG, triacylglyceride; TRIGL, triglyceride.