| Literature DB >> 31844116 |
Marie Cuvelliez1,2, Vincent Vandewalle3,4, Maxime Brunin4, Olivia Beseme1,2, Audrey Hulot4, Pascal de Groote1,2, Philippe Amouyel1, Christophe Bauters1,2, Guillemette Marot3,4, Florence Pinet5,6.
Abstract
Heart failure (HF) remains a main cause of mortality worldwide. Risk stratification of patients with systolic chronic HF is critical to identify those who may benefit from advanced HF therapies. The aim of this study is to identify plasmatic proteins that could predict the early death (within 3 years) of HF patients with reduced ejection fraction hospitalized in CHRU de Lille. The subproteome targeted by an aptamer-based technology, the Slow Off-rate Modified Aptamer (SOMA) scan assay of 1310 proteins, was profiled in blood samples from 168 HF patients, and 203 proteins were significantly modulated between patients who died of cardiovascular death and patients who were alive after 3 years of HF evaluation (Wilcoxon test, FDR 5%). A molecular network was built using these 203 proteins, and the resulting network contained 2281 molecules assigned to 34 clusters annotated to biological pathways by Gene Ontology. This network model highlighted extracellular matrix organization as the main mechanism involved in early death in HF patients. In parallel, an adaptive Least Absolute Shrinkage and Selection Operator (LASSO) was performed on these 203 proteins, and six proteins were selected as candidates to predict early death in HF patients: complement C3, cathepsin S and F107B were decreased and MAPK5, MMP1 and MMP7 increased in patients who died of cardiovascular causes compared with patients living 3 years after HF evaluation. This proteomic signature of 6 circulating plasma proteins allows the identification of systolic HF patients with a risk of early death.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31844116 PMCID: PMC6914779 DOI: 10.1038/s41598-019-55727-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Baseline characteristics of the patients included in the INCA study.
| Cardiovascular death | No cardiovascular death | ||
|---|---|---|---|
| Age (years) | 58.73 ± 10.67 | 59 ± 10 | 1 |
| Male | 78 | 78 | 1 |
Ischaemic Non-ischaemic | 51 33 | 51 33 | 1 |
| Diabetes mellitus | 53 | 51 | 0.874 |
1 2 3 | 1 52 31 | 8 61 15 | 0.003 |
| LV ejection fraction (%) | 28.11 ± 9.95 | 29.30 ± 9.20 | 0.440 |
| Peak VO2 (mL/min/kg) | 10.5 ± 1.73 | 21 ± 5.19 | 1.933E-07 |
Low Intermediate High | 14 38 29 | 36 25 20 | 4.550E-04 |
| Creatinine (mg/L) | 12.55 ± 3.55 | 11.12 ± 2.63 | 0.003 |
ACE inhibitors β-Blockers Diuretics | 77 76 75 | 77 78 65 | 1 0.577 0.038 |
*BNP was measured by a radio-immuno-assay (Shionoria BNP kit, Shionogi & Co. Ltd., Osaka, Japan) from 1998 to 2003 and by the Triage BNP assay (Biosite diagnostics Inc., San Diego, CA, USA) from 2003 to 2005. The BNP level was categorized as low (deciles 1, 2 and 3), intermediate (deciles 4, 5, 6 and 7) or high (deciles 8, 9 and 10) for each individual patient. Continuous variables are expressed as mean ± standard deviation (SD), and the means of the two groups were compared using Student’s t-test. Categorical variables are presented as absolute number and/or percentages whose distribution between the two groups was compared using the χ2 test or the Fisher test, as appropriate.
Figure 1Overview of the study. Patients with systolic HF evaluation from the INCA study (84 with CV death (cases) and 84 alive (controls) were selected for measurement of 1310 plasma proteins by SOMAscan assay. By the Wilcoxon test, we identified 203 modulated proteins between patients who died of CV causes and the patients who were alive 3 years after HF evaluation. A molecular network was built based on these 203 proteins (see Supplementary Table S1). In parallel, we used an adaptive LASSO on the 203 modulated proteins. Six proteins were selected and linked to the INCA network built. Four proteins were measured by conventional assays (ELISA and Luminex technology) for validation in a subpopulation (33 cases and 33 controls).
Figure 2Subnetwork of the INCA molecular network model containing seed nodes that are directly interacting. This INCA subnetwork includes all significantly changed proteins (seed nodes) and their direct interactions for 8 clusters (1, 2, 3, 4, 7, 13, 15 and 27). Node size is scaled by the betweenness centrality in the full network model, meaning that a higher size corresponds to a higher centrality. The colour of the nodes represents the log2FC (or log2 ratio) of the comparison between the “case” and “control” groups. Proteins, microRNAs (miRNAs) and metabolites are indicated by different forms. Blue colour indicates decreased expression, and red colour indicates increased expression in patients who died of CV causes. Edges are coloured by cluster assignment in the network model (for corresponding pathway annotations, see Supplementary Table S4).
Top 10 nodes with the highest betweenness centrality in the INCA network model.
| Name/UniProt ID* | Full name of proteins | Betweenness Centrality≠ | Cluster≠ | Best pathways‡ |
|---|---|---|---|---|
| RPS27A/P62979 | Ribosomal protein S27a | 0.097 | 9 | Cell cycle |
| STAT3/P40763 | Signal transducer and activator of transcription 3 | 0.071 | 11 | JAK-STAT cascade |
| MK14/Q16539 | Mitogen-activated protein kinase 14 | 0.062 | 5 | DNA methylation |
| RAC1/P63000 | Ras-related C3 botulinum toxin substrate 1 | 0.060 | 1 | Immune response |
| HSP90A/P07900 | Heat shock protein 90 kDa alpha family class A member 1 | 0.050 | 2 | Plasma membrane |
| STAT1/P42224 | Signal transducer and activator of transcription 1 | 0.049 | 11 | JAK-STAT cascade |
| SRC/P12931 | SRC proto-oncogene | 0.047 | 1 | Immune response |
| VAV1/P15498 | Vav guanine nucleotide exchange factor 1 | 0.045 | 10 | G protein signalling |
| 1433Z/P63104 | Tyrosine 3-monooxygenase/tryptophane 5-monooxygenase activation protein, zeta | 0.044 | 2 | Plasma membrane |
| CRK/P46108 | V-crk avian sarcoma virus CT10 oncogene homologue | 0.042 | 1 | Immune response |
*Name and UniProt ID are provided from the UniProtKB database (https://www.uniprot.org); ≠ Information provided from the INCA network and supplementary Table S4; ‡Information provided by GO (Gene Ontology, http://geneontology.org/).
List of the 6 candidate proteins selected by LASSO analysis.
| Protein full name (Protein) | Fold-change | Frequency‡ | Betweenness centrality/Rank≈ | Cluster in INCA network | Best GO pathways≡ | |
|---|---|---|---|---|---|---|
Complement C3 (C3) P01024 | 0.65 (104221 ± 4830 | 3. 18 10−7 | 1 | 0.037/13 | 3 | Protein G |
Mitogen-activated protein kinase-activated protein kinase 5 (MAPK5) Q8IW41 | 1.17 (484.3 ± 15.5 | 0.0066 | 0.97 | 0.017/30 | 6 | Diseases of immune system |
Cathepsin S (CATS) P25774 | 0.78 (752.6 ± 16.1 | 1.38 10−6 | 1 | 0.008/63 | 9 | Cell cycle |
Matrix metallopeptidase 1 (MMP1) P03956 | 1.96 (1499.6 ± 93.7 | 5.49 10−5 | 1 | 0.001/202 | 13 | Extracellular matrix organization |
Matrix metallopeptidase 7 (MMP7) P09237 | 1.44 (913.1 ± 51.6 | 0.0011 | 1 | 0.001/213 | 13 | Extracellular matrix organization |
Family with sequence similarity 107 member B (F107B) Q9H098 | 0.84 (812.4 ± 21 | 5.54 10−4 | 1 | 0.0004/353 | — | — |
*Full name, protein symbol and UniProt ID are provided from the UniProtKB database (https://www.uniprot.org/); **Data are expressed in relative fluorescence units; ≠P value was calculated by the Mann-Whitney-Wilcoxon test; ‡Frequency of selection after the 168 adaptative LASSOs; ≈Information provided from the INCA network (for more details, see Supplementary Fig. S1, and Table S4); ≡Information provided by GO (Gene Ontology, http://geneontology.org/); -, no cluster and information available on the INCA network.
Figure 3Heat map visualization of the 6 identified and selected proteins. Columns represent the patients divided into 2 fixed groups (group 0: control; and group 1: case). Rows above the patients represent the 6 proteins that were gathered based on their expression profile. Cells are coloured based on the protein abundance. Red represents a high abundance, while blue indicates a low abundance. The coloured bars below the patients represent the different clinical parameters detailed in Table 1 with 0 = no and 1 = yes, except for sex (0 = women and 1 = men) or when values are indicated. ace_i: angiotensin-converting enzyme inhibitor.
Figure 4Plasma levels of the 6 proteins quantified by Somalogic and linked to the molecular INCA network. Quantification of complement C3b (a), MAPK5 (b), cathepsin S (c) MMP1 (d), MMP7 (e) and F107B (f) by SOMAscan assay (left panels) and closest edges from the INCA network (right panels). Data are expressed in arbitrary units (AU) corresponding to relative fluorescence units for the SOMAscan assay. Data are presented as box-and-whisker plots showing median (line) and min to max (whisker). Statistical significance was determined by the Wilcoxon-Mann-Whitney test. ****P < 0.0001. Visualization of the INCA molecular subnetworks centralized on these molecules (right panels) for their interactions with other proteins quantified in the plasma of INCA patients. The colour of the nodes represents the log2FC of the comparison between the 2 groups of patients who died of CV causes (case) or alive (control) after 3 years, with red corresponding to high log2FC and blue to low log2FC (for details, see Supplementary Table S5). The size of the nodes is related to the centrality calculated from the INCA network model.