| Literature DB >> 35767073 |
Frank R Heinzel1,2, Sanjiv J Shah3.
Abstract
Heart failure (HF) with preserved ejection fraction (HFpEF) is a multi-organ, systemic syndrome that involves multiple cardiac and extracardiac pathophysiologic abnormalities. Because HFpEF is a heterogeneous syndrome and resistant to a "one-size-fits-all" approach it has proven to be very difficult to treat. For this reason, several research groups have been working on methods for classifying HFpEF and testing targeted therapeutics for the HFpEF subtypes identified. Apart from conventional classification strategies based on comorbidity, etiology, left ventricular remodeling, and hemodynamic subtypes, researchers have been combining deep phenotyping with innovative analytical strategies (e.g., machine learning) to classify HFpEF into therapeutically homogeneous subtypes over the past few years. Despite the growing excitement for such approaches, there are several potential pitfalls to their use, and there is a pressing need to follow up on data-driven HFpEF subtypes in order to determine their underlying mechanisms and molecular basis. Here we provide a framework for understanding the phenotype-based approach to HFpEF by reviewing (1) the historical context of HFpEF; (2) the current HFpEF paradigm of comorbidity-induced inflammation and endothelial dysfunction; (3) various methods of sub-phenotyping HFpEF; (4) comorbidity-based classification and treatment of HFpEF; (5) machine learning approaches to classifying HFpEF; (6) examples from HFpEF clinical trials; and (7) the future of phenomapping (machine learning and other advanced analytics) for the classification of HFpEF.Entities:
Keywords: Cardiovascular disease; Classification; Machine learning; Phenomapping; Phenotype
Mesh:
Year: 2022 PMID: 35767073 PMCID: PMC9244058 DOI: 10.1007/s00059-022-05124-8
Source DB: PubMed Journal: Herz ISSN: 0340-9937 Impact factor: 1.740
Classification schemes for heart failure with preserved ejection fraction (HFpEF). Reproduced with permission from [82]
| Classification scheme | Categories of HFpEF | Description |
|---|---|---|
| Clinical classification | “Garden-variety” HFpEF | HTN, diabetes, obesity, and/or chronic kidney disease |
| CAD-HFpEF | Typically, multivessel CAD with prior coronary revascularization | |
| Right heart failure-HFpEF | Predominant right-sided HF with or without pulmonary HTN | |
| Atrial fibrillation-predominant HFpEF | Atrial arrhythmias dominate the clinical presentation | |
| HCM-like HFpEF | These patients do not have genetic forms of HCM, but their clinical course and echocardiographic features are typical of HCM | |
| High-output HFpEF | Typically, due to liver disease, severe anemia | |
| Valvular HFpEF | Multiple moderate valvular lesions | |
| Rare causes of HFpEF | For example, infiltrative cardiomyopathies, cardiotoxicities, genetic cardiomyopathies | |
| Presentation phenotypes | Exercise-induced increase in LA pressure | These patients typically are very breathless with exertion but do not have overt signs of volume overload and typically do not have a history of HF hospitalization |
| Volume overload | Signs and symptoms of volume overload; typically have a history of HF hospitalization | |
| RV failure + pulmonary HTN | Right heart failure predominates the clinical picture; often pulmonary HTN is present and systemic blood pressure is reduced | |
| Myocardial phenotypes | Type 1: HCM | Typical genetic forms of HCM |
| Type 2: Infiltrative | Cardiac amyloidosis and other forms of infiltrative or restrictive cardiomyopathies | |
| Type 3: Non-HTN, non-LVH | No history of HTN and LV wall thickness < 1.2 cm | |
| Type 4: HTN | Typical, “garden-variety” form of HFpEF with history of HTN | |
| Latent class analysis | A: Younger males with CAD, lower LVEF | Based on latent class analysis of the I‑PRESERVE AND CHARM-Preserved trials. The authors used latent class analysis of 11 clinical features (age, gender, BMI, atrial fibrillation, CAD, diabetes, hyperlipidemia, valvular disease, alcohol use, eGFR, and hematocrit) to find 6 distinct groups of HFpEF in I‑PRESERVE and validated these findings in CHARM-Preserved |
| B: Younger females with lowest NT-proBNP | ||
| C: Obesity, hyperlipidemia, diabetes mellitus, anemia, and renal insufficiency | ||
| D: Obese females | ||
| E: Older males with CAD, lowest LVEF | ||
| F: female predominance, advanced age, lower BMI, atrial fibrillation, CKD, highest NT-proBNP | ||
| Phenomapping | Pheno-group 1: BNP deficiency syndrome | Model-based clustering of 67 continuous variables (phenotypes): physical characteristics, vital signs, ECG data, laboratory data, and echocardiographic parameters |
| Pheno-group 2: Cardiometabolic phenotype | ||
| Pheno-group 3: RV failure + cardiorenal phenotype |
CAD coronary artery disease, HCM hypertrophic cardiomyopathy, HF heart failure, RV right ventricular, HTN hypertension, LVH left ventricular hypertrophy, LVEF left ventricular ejection fraction, BMI body mass index, CKD chronic kidney disease, NT-proBNP N-terminal pro-B-type natriuretic peptide
Fig. 1Left atrial myopathy phenotype of heart failure with preserved ejection fraction (HFpEF). Top panel: Scatterplot of left atrial (LA) vs. left ventricular (LV) longitudinal strain showing a correlation between the two phenotypes and deviation from the correlation representing different LA/LV phenotypes of HFpEF, including the “LA predominant” (i.e., LA myopathy) phenotype, defined as LA reservoir strain lower than expected for any given value of LV longitudinal strain. Middle panel: Volcano plot of proteins associated with LA myopathy. Proteins in red on the right represent those in which higher levels are significantly associated with increased LA myopathy, and on the left represent those in which higher levels are significantly associated with reduced LA myopathy. Bottom panel: Venn diagram showing that while three proteins overlap between LA myopathy and atrial fibrillation, several others appear to be specific for the disproportionate LA myopathy phenotype (proteins identified using a proteomic analysis in the PROMIS-HFpEF study with validation in the Northwestern University HFpEF cohort). Reproduced with permission from Patel RB et al. [44]
Fig. 2Potential role of plasminogen activator inhibitor‑1 (PAI‑1) as a molecular mechanism underlying heart failure with preserved ejection fraction (HFpEF). Both comorbidity-induced inflammation (via reactive oxygen species [ROS]-induced cellular stress) and visceral adipocytes (common in HFpEF) induce upregulation of PAI‑1 in the circulation. PAI‑1 and related proteins (e.g., insulin-like growth factor binding protein‑3 [IGFBP3]) result in cellular senescence and accelerated aging, which may be important risk factors for the development of HFpEF
Fig. 3Phenomapping of heart failure with preserved ejection fraction (HFpEF). Left panel: hierarchical clustering heatmap of 397 patients with HFpEF (columns) and 67 continuous variables (features, rows) demonstrating the heterogeneity of HFpEF. Red indicates increased and blue indicates decreased values. Middle panel: principal components analysis showing the clear differentiation of the three identified pheno-groups. Right panel: Kaplan–Meier curves for survival free of cardiovascular hospitalization or death among the three pheno-groups. PC principal component, CV cardiovascular. Reproduced with permission from Shah SJ et al. [7]
Fig. 4Protein clusters identified by weighted coexpression network analyses in the PROMIS-HFpEF derivation cohort and the Northwestern University validation cohort. The inflammation cluster (turquoise) mediated the association between comorbidity burden and markers of elevated left atrial pressure in heart failure with preserved ejection fraction (HFpEF) and differentiated HFpEF from controls in an external cohort. a Adjacency network map of circulating proteins color-coded by cluster assignment by hierarchical clustering-based nearness or coexpression of proteins. For clarity of presentation, only nodes (proteins) that were assigned to a cluster are shown (n = 159/248); the remaining proteins lie on the outer edges of the network map. b Overrepresented, nonredundant pathways in each cluster with false discovery rate corrected p values. c Detailed network maps of proteins in the three clusters that were representative of inflammation (i.e., overrepresentation of ≥2 inflammatory pathways). Node size reflects intracluster connectivity (i.e., the sum of weighted edges [correlations] with all other proteins in the cluster). Node color density reflects the strength of cluster membership. Edge thickness and transparency reflect the adjacency of proteins according to weighted coexpressions. d Adjacency network map of circulating proteins in the Northwestern patients with HFpEF in the validation cohort. Clusters with most significant overlap were assigned the same color as the corresponding cluster in the PROMIS-HFpEF cohort. e Adjacency network map of circulating proteins in the Northwestern control patients in the validation cohort. Cluster preservation was tested against the Northwestern patients with HFpEF; clusters with significant overlap were assigned the same color as the corresponding cluster in the Northwestern patients with HFpEF. FDR false discovery rate, PROMIS-HFpEF Prevalence of Microvascular Dysfunction in Heart Failure With Preserved Ejection Fraction, WCNA weight coexpression network analysis. Reproduced with permission from Sanders-van Wijk S et al. [9]
Examples of going beyond initial HFpEF phenomapping studies: ongoing follow-up studies of the original three HFpEF pheno-groups identified by Shah et al. [7]
| Pheno-group | Follow-up studies |
|---|---|
| 1: Morbid obesity with the BNP deficiency syndrome | STEP-HFpEF: RCT of a GLP1 receptor agonist in obese HFpEF patients |
| 2: Extreme cardiometabolic syndrome | PROMIS-HFpEF: evaluation of impaired coronary flow reserved in HFpEF, and proteomic analysis of comorbidity-inflammation paradigm in HFpEF |
| 3: RV cardio-abdomino-renal syndrome | VICTORY: observational study of exfoliated colonocytes (extracted from stool samples) in hospitalized HFpEF vs. non-HF hospitalized controls to determine whether colonocyte NHE3 (by flow cytometry) is upregulated in HFpEF and whether magnitude of NHE3 upregulation correlates with RV enlargement/dysfunction |
BNP B-type natriuretic peptide, RCT randomized controlled trial, GLP1 glucagon-like peptide 1, HFpEF heart failure with preserved ejection fraction, NHE3 sodium-hydrogen exchanger‑3, RV right ventricular
Fig. 5Main heart failure with preserved ejection fraction sub-phenotypes identified across phenomapping studies. NP natriuretic peptide, CV cardiovascular, T2DM type 2 diabetes mellitus, LV left ventricular, CVP central venous pressure, PAH pulmonary arterial hypertension. Reproduced with permission from Galli E et al. [77]
Key steps in the derivation and validation of HFpEF subtypes using unsupervised machine learning (phenomapping) analyses
| Step | Details |
|---|---|
| 1 | Identify a rich dataset for training (derivation) and a separate, similar dataset for external validation |
| 2 | Determine which variables (features) to include in the ML models, evaluate for missingness, perform imputation |
| 3 | Data reduction techniques (e.g., PCA) for high-dimensionality data and to account for redundancy |
| 4 | Split training dataset into training and test subsets |
| 5 | Evaluate a variety of unsupervised ML techniques (or ensemble methods) on the training dataset |
| 6 | Cross-validate in the internal test dataset, perform regularization to prevent overfitting |
| 7 | Determine optimal (most parsimonious) number of clusters (subtypes) using multiple methods |
| 8 | Deploy the model to assign the most probable clusters in the external validation dataset |
| 9 | Compare clinical characteristics of pheno-groups (clusters): derivation vs. validation datasets |
| 10 | Create a simplified regression model to assign pheno-groups in further validation datasets |
| 11 | Follow-up studies to probe disease mechanisms in identified pheno-groups (disease subtypes) |
HFpEF heart failure with preserved ejection fraction, ML machine learning, PCA principal components analysis
HFpEF phenomapping analyses: possible problems and potential solutions
| Problem | Explanation | Solution |
|---|---|---|
| Lack of external validation | ML models often perform well in the derivation dataset because they are designed to perform well when given a lot of data (features) that are not correlated. However, they may not perform well in an external validation dataset | Always include a validation dataset (preferably external to the derivation dataset) to validate ML models, and plan for this from the design phase of the study |
| Publication bias | Many ML and -omics studies cannot be explained biologically, or may not validate, both of which create publication biases (results that do not fit known paradigms tend not to be published), limiting clinical applications and future studies | Cloud platforms for data sharing should be implemented which will reduce publication bias and allow future studies to validate or refute our analyses and find meaning in unexplained results. In addition, independent investigators would be able to reanalyze data as new statistical and bioinformatics approaches are developed |
| Single timepoint measurements | The development of HFpEF likely requires multiple consecutive triggers, creating a dynamically evolving phenotype. Even after clinically overt HFpEF emerges, the underlying molecular phenotype(s) are further evolving with time with disease progression, which will change the circulating proteome/metabolome. Even in healthy individuals, changes in multi-omics over a relatively short time can be strikingly variable. Single timepoint omics data and ML analyses alone will not be able to determine which signals are reactive or causal | When possible, investigators should leverage -omics and other high-dimensional (e.g., imaging) data from serial timepoints, and they should validate any identified signals in multiple cohorts and in orthogonal study types (e.g., transcriptomic analyses, animal studies). Mechanistic experiments on tissues or patient-derived cell lines may also address these challenges |
| Cohort-driven and feature-driven biases of ML analyses | It is well known that ML studies often suffer from lack of external validation. However, less well known is that the features included in the ML model often drive the identified clusters (subtypes) | Compare unbiased vs. biased selection of features for incorporation in ML models |
| True pathobiological HFpEF subtypes vs. disease progression HFpEF subtypes | In previous unsupervised ML analyses of HFpEF that sought to identify different HFpEF pathobiological subtypes, often subtypes that represent different stages of HFpEF progression (disease severity) are instead identified | Identification of HFpEF subtypes that reflect disease severity/progression, can still be used to identify and stratify treatment targets, which would be clinically relevant. Upon identification of HFpEF subtypes, investigators should use multivariable analyses to determine whether subtypes are independent of markers of disease severity. Investigators can also use input features that are markers of disease severity prior to inclusion in ML models |
HFpEF heart failure with preserved ejection fraction, ML machine learning
Fig. 6Beneficial effects of interatrial shunt device therapy in patients with heart failure with preserved ejection fraction (HFpEF) with peak exercise pulmonary vascular resistance (PVR) < 1.74 WU enrolled in the REDUCE LAP-HF II trial. Top panel: In the sham control group, there was no association between peak exercise PVR and change in health status (Kansas City Cardiomyopathy Questionnaire overall summary score [KCCQ-OSS]) from baseline to 12 months (all patients improved approximately 10 points). However, in the atrial shunt device-treated patients, those with peak exercise PVR < 1.74 WU improved to a greater extent with the device compared with sham, whereas those with peak exercise PVR ≥ 1.74 WU did worse with the device compared with sham. Middle panel: Only HFpEF patients with peak PVR < 1.74 WU experienced a significant improvement in NYHA functional class in the trial. Bottom panel: Patients without latent pulmonary vascular disease (PVD; i.e., patients with peak exercise PVR < 1.74 WU) and no pacemaker at baseline had the highest win ratio, lowest HF event rate, and greatest improvement in health status in response to the atrial shunt device (compared with sham control). KCCQ-OSS Kansas City Cardiomyopathy Questionnaire, WU Wood units, NYHA New York Heart Association, LA left atrial. Reproduced with permission from Borlaug BA et al. [83]