| Literature DB >> 34145379 |
Thjodbjorg Eiriksdottir1, Steinthor Ardal1, Benedikt A Jonsson1, Sigrun H Lund1, Erna V Ivarsdottir1, Kristjan Norland1, Egil Ferkingstad1, Hreinn Stefansson1, Ingileif Jonsdottir1,2,3, Hilma Holm1, Thorunn Rafnar1, Jona Saemundsdottir1, Gudmundur L Norddahl1, Gudmundur Thorgeirsson1,2,3, Daniel F Gudbjartsson1,2, Patrick Sulem1, Unnur Thorsteinsdottir1,2, Kari Stefansson4,5, Magnus O Ulfarsson6,7.
Abstract
Predicting all-cause mortality risk is challenging and requires extensive medical data. Recently, large-scale proteomics datasets have proven useful for predicting health-related outcomes. Here, we use measurements of levels of 4,684 plasma proteins in 22,913 Icelanders to develop all-cause mortality predictors both for short- and long-term risk. The participants were 18-101 years old with a mean follow up of 13.7 (sd. 4.7) years. During the study period, 7,061 participants died. Our proposed predictor outperformed, in survival prediction, a predictor based on conventional mortality risk factors. We could identify the 5% at highest risk in a group of 60-80 years old, where 88% died within ten years and 5% at the lowest risk where only 1% died. Furthermore, the predicted risk of death correlates with measures of frailty in an independent dataset. Our results show that the plasma proteome can be used to assess general health and estimate the risk of death.Entities:
Year: 2021 PMID: 34145379 PMCID: PMC8213855 DOI: 10.1038/s42003-021-02289-6
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Characteristics of all study participants by age and sample sets.
| Characteristic | ICP + VSP1 | ICP + VSP1 | dHS | dHS | VSP2 | VSP2 |
|---|---|---|---|---|---|---|
| Men | 9991(43.6) | 4816(47.5) | 3876(44.0) | 1652(44.8) | 2657(39.1) | 1109(42.5) |
| Women | 12,922(56.4) | 5320(52.5) | 4938(56.0) | 2032(55.2) | 4141(60.9) | 1502(57.5) |
| Follow up | 13.7(4.7) | 11.1(5.5) | 1.3(0.7) | 1.3(0.7) | 1.6(1.4) | 1.6(1.4) |
| Age | 56.6 (17.4) | 73.0(7.8) | 55.4(14.7) | 69.1(6.4) | 52.9(16.6) | 70.0(6.7) |
| Age-span | 18–101 | 60–101 | 18–96 | 60–96 | 18–98 | 60–98 |
| BMI | 26.5(4.6) | 26.6(4.4) | 28.6(5.3) | 28.9(5.1) | 27.8(5.4) | 27.7(4.9) |
| T2D | 969(4.2) | 809(8.0) | 437(5.0) | 299(8.1) | 258(3.8) | 177(6.8) |
| Statin use estimate | 1897(8.3) | 1617(16.0) | 1623(18.4) | 1256(34.1) | 1489(21.9) | 1140(43.7) |
| HT medication use | 7676(33.5) | 5599(55.2) | 4137(46.9) | 2553(69.3) | 3292(48.4) | 1962(75.1) |
| Smoker estimate | 2998(13.1) | 792(7.8) | 844(9.6) | 310(8.4) | 621(9.1) | 235(9.0) |
| CAD | 1608(7.0) | 1490(14.7) | 645(7.3) | 570(15.5) | 869(12.8) | 723(27.7) |
| History of MI | 1199(5.2) | 1095(10.8) | 249(2.8) | 202(5.5) | 394(5.8) | 314(12.0) |
| History of Stroke | 568(2.5) | 511(5.0) | 168(1.9) | 129(3.5) | 184(2.7) | 128(4.9) |
| Cancer diagnosis | 5484(23.9) | 3880(38.3) | 675(7.7) | 512(13.9) | 526(7.7) | 400(15.3) |
| Deaths | 7061(30.8) | 6222(61.4) | 25(0.3) | 22(0.6) | 83(1.2) | 74(2.8) |
| Age at death | 81.2(10.7) | 84.0(7.5) | 76.8(10.1) | 79.8(6.6) | 75.1(10.4) | 77.3(8.5) |
| Neoplasms | 2687(38.1) | 2098(33.7) | 12(48.0) | 11(50.0) | 49(59.0) | 43(58.1) |
| Nervous system | 596(8.4) | 550(8.8) | 1(4.0) | 1(4.5) | 3(3.6) | 3(4.1) |
| Circulatory system | 2472(35.0) | 2345(37.7) | 9(36.0) | 7(31.8) | 23(27.7) | 20(27.0) |
| Respiratory system | 544(7.7) | 507(8.1) | 0(0.0) | 0(0.0) | 6(7.2) | 6(8.1) |
| Other causes | 762(10.8) | 722(11.6) | 3(12.0) | 3(13.6) | 2(2.4) | 2(2.7) |
The numbers are number (percent of participants), number (percent of total deaths), mean (sd), or range
Fig. 1Discrimination power of different models for death within 1,2,…,15 years.
a AUCs for all participants, N = 6893. b ROC curves for death within 5 years for all participants. c AUCs for participants older than 60, N = 3052. d ROC curves for death within 5 years for participants older than 60.
Fig. 2Survival of 60–80 years old participants.
The Kaplan–Meier curves for 2488 participants are split by quantiles of predicted 10-year risk by each model, demonstrating the different survival rates in the different risk groups. The colored areas represent 95% confidence intervals. The red dots show survival after 5 and 10 years.
Fig. 3Predicted risk of death within 5 years for different causes of death.
Participants who died within 5 years are shown as one group and categorized by cause of death and those who were alive after 5 years are shown as one group. The yellow center line represents the median, the box limits are the upper and lower quartiles, the whiskers represent the 1.5× interquartile range, and the black dots are data points outside the whisker range.
Correlation of 5-year mortality risk predicted by the protein model and corrected for age and sex with frailty related phenotypes in the dHS dataset.
| Phenotype | All participants | Participants older than 60 | ||||
|---|---|---|---|---|---|---|
| Correlation | Correlation | |||||
| Graded cycle ergometer exercise test: VO2 max | 6930 | −0.15 | 3.5E−36 | 2334 | −0.23 | 3.4E−28 |
| Max grip strength corrected for height | 8737 | −0.10 | 1.1E−21 | 3637 | −0.16 | 1.5E−21 |
| FEV1 | 8015 | −0.15 | 1.3E-40 | 3217 | −0.18 | 1.5E−25 |
| Digit coding: number of correct codes | 8562 | −0.09 | 6.1E−17 | 3530 | −0.14 | 1.5E−17 |
| Trail making test B: time to complete | 8485 | 0.09 | 2.7E−16 | 3475 | 0.10 | 1.1E−09 |
| Resting heart rate | 6688 | 0.08 | 2.0E−10 | 2851 | 0.08 | 6.0E−05 |
| Average length from neck to waist over back adjusted for height | 8022 | 0.08 | 7.0E−12 | 3300 | 0.12 | 1.0E−11 |
| Lean appendicular body mass divided by height squared | 8711 | −0.11 | 2.8E−23 | 3643 | −0.16 | 5.9E−22 |
| Non-HDL cholesterol level, not using statins | 6397 | −0.01 | 6.2E−01 | 2181 | 0.00 | 9.0E−01 |
| Non-HDL cholesterol level, using statins | 1525 | −0.03 | 1.9E−01 | 1181 | −0.04 | 1.7E−01 |
| BMI | 8812 | 0.03 | 8.7E−03 | 3683 | 0.01 | 5.5E−01 |