| Literature DB >> 33334933 |
Yasmina Bauer1,2, Simon de Bernard3, Peter Hickey4,5, Karri Ballard6, Jeremy Cruz6, Peter Cornelisse2, Harbajan Chadha-Boreham7, Oliver Distler8, Daniel Rosenberg7, Martin Doelberg7, Sebastien Roux2, Oliver Nayler2, Allan Lawrie9.
Abstract
Pulmonary arterial hypertension (PAH) is a devastating complication of systemic sclerosis (SSc). Screening for PAH in SSc has increased detection, allowed early treatment for PAH and improved patient outcomes. Blood-based biomarkers that reliably identify SSc patients at risk of PAH, or with early disease, would significantly improve screening, potentially leading to improved survival, and provide novel mechanistic insights into early disease. The main objective of this study was to identify a proteomic biomarker signature that could discriminate SSc patients with and without PAH using a machine learning approach and to validate the findings in an external cohort.Serum samples from patients with SSc and PAH (n=77) and SSc without pulmonary hypertension (non-PH) (n=80) were randomly selected from the clinical DETECT study and underwent proteomic screening using the Myriad RBM Discovery platform consisting of 313 proteins. Samples from an independent validation SSc cohort (PAH n=22 and non-PH n=22) were obtained from the University of Sheffield (Sheffield, UK).Random forest analysis identified a novel panel of eight proteins, comprising collagen IV, endostatin, insulin-like growth factor binding protein (IGFBP)-2, IGFBP-7, matrix metallopeptidase-2, neuropilin-1, N-terminal pro-brain natriuretic peptide and RAGE (receptor for advanced glycation end products), that discriminated PAH from non-PH in SSc patients in the DETECT Discovery Cohort (average area under the receiver operating characteristic curve 0.741, 65.1% sensitivity/69.0% specificity), which was reproduced in the Sheffield Confirmatory Cohort (81.1% accuracy, 77.3% sensitivity/86.5% specificity).This novel eight-protein biomarker panel has the potential to improve early detection of PAH in SSc patients and may provide novel insights into the pathogenesis of PAH in the context of SSc.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33334933 PMCID: PMC8276065 DOI: 10.1183/13993003.02591-2020
Source DB: PubMed Journal: Eur Respir J ISSN: 0903-1936 Impact factor: 16.671
Baseline characteristics of pulmonary arterial hypertension (PAH) and non-pulmonary hypertension (non-PH) systemic sclerosis (SSc) patients in the DETECT Discovery Cohort
| 80 | 77 | 157 | |
| Age years | |||
| Patients | 80 | 76 | 156 |
| Mean± | 55.7±11.5 | 61.5±9.5 | 58.5±11.0 |
| Median (IQR) | 57.0 (48.0–61.0) | 62.0 (55.8–68.0) | 59.0 (51.0–66.0) |
| Minimum–maximum | 26–82 | 39–80 | 26–82 |
| p=0.00059 | |||
| Sex | |||
| Patients | 80 | 77 | 157 |
| Female | 71 (88.8) | 58 (75.3) | 129 (82.2) |
| Male | 9 (11.2) | 19 (24.7) | 28 (17.8) |
| p=0.037 | |||
| BMI kg·m−2 | |||
| Patients | 79 | 76 | 155 |
| Mean± | 26.4±7.0 | 26.3±5.7 | 26.3±6.4 |
| Median (IQR) | 24.6 (21.9–29.1) | 25.6 (22.4–28.7) | 25.1 (22.1–28.8) |
| p=0.73 | |||
| Smoking history | 35/80 (43.8) | 38/77 (49.4) | 73/157 (46.5) |
| p=0.52 | |||
| SSc subtype | |||
| Patients | 80 | 76 | 156 |
| Limited | 46 (57.5) | 55 (72.4) | 101 (64.7) |
| Diffuse | 28 (35.0) | 15 (19.7) | 43 (27.6) |
| Mixed | 6 (7.5) | 6 (7.9) | 12 (7.7) |
| p=0.1 | |||
| Current/past telangiectasias | 59/80 (73.8) | 67/77 (87.0) | 126/157 (80.3) |
| p=0.045 | |||
| Disease severity mRSS | |||
| Patients | 80 | 76 | 156 |
| Mean± | 9.4±8.2 | 11.2±8.2 | 10.3±8.3 |
| Median (IQR) | 6.5 (4.0–11.2) | 9.0 (5.8–14.2) | 8.0 (4.8–13.2) |
| p=0.047 | |||
| Qualitative evaluation of right ventricle pump | |||
| Patients | 80 | 77 | 157 |
| Normal | 79 (98.8) | 60 (77.9) | 139 (88.5) |
| Slightly impaired | 1 (1.2) | 8 (10.4) | 9 (5.7) |
| Moderately impaired | 0 (0) | 5 (6.5) | 5 (3.2) |
| Severely impaired | 0 (0) | 4 (5.2) | 4 (2.5) |
| p=4.2e-05 | |||
| Right atrium area (maximum) cm2 | |||
| Patients | 75 | 73 | 148 |
| Mean± | 13.4±4.6 | 16.8±5.9 | 15.1±5.5 |
| Median (IQR) | 13.0 (10.2–15.3) | 15.5 (13.9–19.0) | 14.0 (11.9–16.9) |
| p=2.7e-05 | |||
| Right ventricle area (maximum) cm2 | |||
| Patients | 76 | 73 | 149 |
| Mean± | 15.1±5.2 | 19.4±6.8 | 17.2±6.4 |
| Median (IQR) | 14.2 (12.0–17.1) | 18.6 (13.6–22.7) | 16.0 (13.0–20.8) |
| p=5.8e-05 | |||
| Right ventricle diameter (maximum) mm | |||
| Patients | 77 | 73 | 150 |
| Mean± | 29.0±8.6 | 32.6±9.0 | 30.8±8.9 |
| Median (IQR) | 29.0 (25.0–32.0) | 32.0 (27.0–36.3) | 30.0 (26.0–35.0) |
| p=0.002 | |||
| TAPSE m | |||
| Patients | 74 | 67 | 141 |
| Mean± | 23.1±4.4 | 21.2±4 | 22.2±4.3 |
| Median (IQR) | 23.2 (20.0–26.0) | 21.0 (19.0–24.0) | 22.0 (19.0–25.0) |
| p=0.0076 | |||
| TRV m·s−1 | |||
| Patients | 75 | 74 | 149 |
| Mean± | 2.4±0.5 | 3.0±0.7 | 2.7±0.7 |
| Median (IQR) | 2.4 (2.1–2.7) | 2.9 (2.5–3.5) | 2.6 (2.3–3.1) |
| p=9.8e-09 | |||
| Presence of right axis deviation | 2/74 (2.7) | 10/75 (13.3) | 12/149 (8.1) |
| p=0.031 | |||
| PVR dyn·s·cm−5 | |||
| Patients | 78 | 77 | 155 |
| Mean± | 141.1±60.9 | 366.0±207.1 | 252.8±189.0 |
| Median (IQR) | 144.6 (100.0–177.3) | 300.0 (226.9–410.3) | 208.2 (136.1–296.9) |
| p=2.8e-20 | |||
| mPAP mmHg | |||
| Patients | 80 | 77 | 157 |
| Mean± | 17.9±3.7 | 32.3±7.8 | 25.0±9.4 |
| Median (IQR) | 18.0 (15.8–21.0) | 29.0 (27.0–36.0) | 24.0 (18.0–29.0) |
| p=2.7e-27 | |||
| PAWP mmHg | |||
| Patients | 79 | 77 | 156 |
| Mean± | 9.1±3.4 | 10.1±3.2 | 9.6±3.3 |
| Median (IQR) | 9.0 (7.0–11.5) | 11.0 (8.0–12.0) | 10.0 (7.0–12.0) |
| p=0.034 | |||
| log10(NT-proBNP) pg·mL−1 | |||
| Patients | 80 | 77 | 157 |
| Mean± | 2.5±0.6 | 2.9±0.5 | 2.7±0.6 |
| Median (IQR) | 2.6 (2.2–2.9) | 3.0 (2.6–3.2) | 2.7 (2.3–3.0) |
| p=4.5e-05 | |||
| FVC % pred | |||
| Patients | 80 | 77 | 157 |
| Mean± | 82.2±20.0 | 90.9±18.2 | 86.4±19.6 |
| Median (IQR) | 82.0 (65.7–96.5) | 89.0 (77.4–101.0) | 84.2 (72.0–99.1) |
| p=0.0075 | |||
| | |||
| Patients | 80 | 77 | 157 |
| Mean± | 47.4±8.7 | 43.2±10.7 | 45.4±9.9 |
| Median (IQR) | 49.1 (41.1–54.1) | 44.0 (36.0–53.0) | 47.3 (39.0–53.2) |
| p=0.016 |
Data are presented as n, n (%) or n/N (%), unless otherwise stated. IQR: interquartile range; BMI: body mass index; mRSS: modified Rodnan skin score; TAPSE: tricuspid annular plane systolic excursion; TRV: tricuspid regurgitant velocity; PVR: pulmonary vascular resistance; mPAP: mean pulmonary arterial pressure; PAWP: pulmonary arterial wedge pressure; NT-proBNP: N-terminal pro-brain natriuretic peptide; FVC: forced vital capacity; DLCO: diffusing capacity of the lung for carbon monoxide. Comparison of PAH and non-PH groups performed using the Wilcoxon rank-sum test for continuous variables (and for the “qualitative evaluation of right ventricle pump” after conversion of the ordered factor levels to 1–4 values) and Fisher's exact test for categorical variables.
Patient characteristics of pulmonary arterial hypertension (PAH) and non-pulmonary hypertension (non-PH) systemic sclerosis (SSc) patients in the Sheffield Confirmatory Cohort
| 22 | 22 | 44 | |
| Age years | |||
| Patients | 22 | 22 | 44 |
| Mean± | 62.3±9.7 | 68.1±8.0 | 65.2±9.2 |
| Median (IQR) | 61.5 (56.3–67.8) | 69.0 (63.3–72.0) | 64.5 (58.8–72.0) |
| Minimum–maximum | 45–84 | 53–84 | 45–84 |
| p=0.026 | |||
| Sex | |||
| Patients | 22 | 22 | 44 |
| Female | 20 (90.9) | 15 (68.2) | 35 (79.5) |
| Male | 2 (9.1) | 7 (31.8) | 9 (20.5) |
| p=0.091 | |||
| BMI kg·m−2 | |||
| Patients | 22 | 20 | 42 |
| Mean± | 27.7±5.8 | 27.5±6.9 | 27.6±6.3 |
| Median (IQR) | 28.1 (23.9–32.6) | 28.1 (21.7–31.7) | 28.1 (21.9–32.0) |
| p=0.782 | |||
| Smoking history | 8/19 (42.1) | 13/19 (68.4) | 21/38 (55.3) |
| p=1 | |||
| SSc subtype | |||
| Patients | 22 | 22 | 44 |
| Limited | 21 (95.5) | 22 (100) | 43 (97.7) |
| Diffuse | 1 (4.5) | 0 (0) | 1 (2.3) |
| Mixed | 0 (0) | 0 (0) | 0 (0) |
| p=1 | |||
| Qualitative evaluation of right ventricle pump | |||
| Patients | 22 | 22 | 44 |
| Normal | 18 (81.8) | 11 (50.0) | 29 (65.9) |
| Mild impairment | 4 (18.2) | 2 (9.1) | 6 (13.6) |
| Moderate impairment | 0 (0) | 1 (4.5) | 1 (2.3) |
| Severe impairment | 0 (0) | 8 (36.4) | 8 (18.2) |
| p=0.0083 | |||
| Right atrium area (maximum) cm2 | |||
| Patients | 22 | 22 | 44 |
| Mean± | 16.0±5.4 | 22.5±9.6 | 19.2±8.4 |
| Median (IQR) | 15.4 (11.5–17.1) | 19.8 (13.7–29.9) | 16.9 (13.2–23.7) |
| p=0.018 | |||
| TRV m·s−1 | |||
| Patients | 17 | 22 | 39 |
| Mean± | 2.7±0.3 | 3.7±0.8 | 3.2±0.8 |
| Median (IQR) | 2.7 (2.5–2.8) | 3.4 (3.1–4.2) | 3.1 (2.7–3.5) |
| p=1.98e-05 | |||
| Presence of right axis deviation | 1/20 (5.0) | 9/22 (40.9) | 10/42 (23.8) |
| p=0.45 | |||
| PVR dyn·s·cm−5 | |||
| Patients | 22 | 22 | 44 |
| Mean± | 146.4±56.5 | 548.2±375.4 | 347.3±334.2 |
| Median (IQR) | 139.5 (116.8–170.8) | 396.0 (271.2–850.2) | 219.0 (142.8–395.5) |
| p=1.5e-09 | |||
| mPAP mmHg | |||
| Patients | 22 | 22 | 44 |
| Mean± | 20.7±3.1 | 41.7±15.3 | 31.2±15.2 |
| Median (IQR) | 21.0 (19.0–22.0) | 38.0 (30.0–52.0) | 26.0 (21.0–37.0) |
| p=3.0e-08 | |||
| PAWP mmHg | |||
| Patients | 22 | 22 | 44 |
| Mean± | 9.1±3.8 | 11.7±4.9 | 10.4±4.5 |
| Median (IQR) | 8.5 (7.0–11.8) | 12.0 (8.0–14.0) | 10.0 (7.8–13.2) |
| p=0.071 | |||
| log10(NT-proBNP) pg·mL−1 | |||
| Patients | 22 | 22 | 44 |
| Mean± | 2.6±0.5 | 3.3±0.5 | 3.0±0.6 |
| Median (IQR) | 2.7 (2.4–2.9) | 3.3 (3.0–3.7) | 2.9 (2.6–3.3) |
| p=9.3e-05 | |||
| FVC % pred | |||
| Patients | 22 | 22 | 44 |
| Mean± | 90.8±22.6 | 101.2±15.7 | 96.0±20.0 |
| Median (IQR) | 94.7 (73.4–107.9) | 100.8 (90.9–111.5) | 96.3 (80.9–110.9) |
| p=0.13 | |||
| | |||
| Patients | 21 | 22 | 43 |
| Mean± | 56.2±13.4 | 43.9±12.7 | 49.9±14.3 |
| Median (IQR) | 55.1 (46.7–67.3) | 43.6 (39.6–47.0) | 46.7 (42.0–56.5) |
| p=0.002 |
Data are presented as n, n (%) or n/N (%), unless otherwise stated. IQR: interquartile range; BMI: body mass index; CTPA: computed tomography pulmonary angiography; MRI: magnetic resonance imaging; TRV: tricuspid regurgitant velocity; PVR: pulmonary vascular resistance; mPAP: mean pulmonary arterial pressure; PAWP: pulmonary arterial wedge pressure; NT-proBNP: N-terminal pro-brain natriuretic peptide; FVC: forced vital capacity; DLCO: diffusing capacity of the lung for carbon monoxide. Comparison of PAH and non-PH groups performed using the Wilcoxon rank-sum test for continuous variables (and for the “qualitative evaluation of right ventricle pump” after conversion of the ordered factor levels to 1–4 values) and Fisher's exact test for categorical variables.
FIGURE 1Patients and analytes for a) the DETECT Discovery Cohort and b) the Sheffield Confirmatory Cohort. RHC: right heart catheterisation; PH: pulmonary hypertension; PAH: pulmonary arterial hypertension; STH-ObS: The Sheffield Teaching Hospitals Observational Study of Patients with Pulmonary Hypertension, Cardiovascular and Lung Disease; CTD: connective tissue disease; ILD: interstitial lung disease. #: enrolled 2008–2011; ¶: enrolled 2008–2015; +: 271 protein analytes passed quality control in the DETECT Discovery Cohort and 238 protein analytes passed quality control in the Sheffield Confirmatory Cohort (238 protein analytes were suitable for investigation in both cohorts).
FIGURE 2Variables (proteins) of importance to classify pulmonary arterial hypertension. Variable importance output of random forests applied to a) the DETECT Discovery Cohort, b) the Sheffield Confirmatory Cohort and c) 238 common proteins between the two cohorts, applied on the DETECT Discovery Cohort. The plots show the most important variables (y-axis) as assessed by the mean decrease of the Gini index (x-axis). Proteins are ordered top to bottom as most to least important. The eight common variables in all analyses appear in red. See supplementary table S1 for details of the proteins on the Myriad RBM Discovery platform.
FIGURE 3Serum concentrations of the eight best-performing and common proteins in predicting pulmonary arterial hypertension (PAH) in the DETECT Discovery Cohort and the Sheffield Confirmatory Cohort: a) collagen IV, b) endostatin, c) insulin-like growth factor binding protein (IGFBP)-2, d) IGFBP-7, e) matrix metallopeptidase (MMP)-2, f) neuropilin-1, g) N-terminal pro-brain natriuretic peptide (NT-proBNP) and h) RAGE (receptor for advanced glycation end products). PH: pulmonary hypertension. Boxes indicate median and interquartile range; whiskers indicate the full range of the data. Individual patient samples are represented by dots. p-values from the Wilcoxon rank-sum test between the two patient groups.
FIGURE 4a, b) Performance of the panel of six common protein biomarkers in a) the DETECT Discovery Cohort and b) the Sheffield Confirmatory Cohort: receiver operating characteristic (ROC) curves of the pulmonary arterial hypertension (PAH) versus non-pulmonary hypertension (non-PH) classifier. ROC-AUC: area under the ROC curve; RAGE: receptor for advanced glycation end products; IGFBP: insulin-like growth factor binding protein; MMP: matrix metallopeptidase; SSc: systemic sclerosis; NT-proBNP: N-terminal pro-brain natriuretic peptide. The six selected proteins are the subset from the eight common proteins that produced the best ROC-AUC in the DETECT Discovery Cohort (0.751). c, d) Addition of c) NT-proBNP or d) NT-proBNP plus neuropilin-1 to the six selected proteins.
FIGURE 5Sparse partial least squares association of pulmonary vascular resistance (PVR) to six common biomarker proteins: a) N-terminal pro-brain natriuretic peptide (NT-proBNP), b) RAGE (receptor for advanced glycation end products), c) insulin-like growth factor binding protein (IGFBP)-7, d) pyruvate carboxylase (cFib), e) vascular cell adhesion molecule (VCAM)-1 and f) surfactant protein D (SP-D). Correlation plots for each individual biomarker variable with PVR, showing Pearson's correlation coefficient between the logarithm of the two variables and the corresponding p-value.