| Literature DB >> 30062185 |
J Gustav Smith1,2,3.
Abstract
Heart failure (HF) is the end-stage of all heart disease and arguably constitutes the greatest unmet therapeutic need in cardiovascular medicine today. Classic epidemiological studies have established clinical risk factors for HF, but the cause remains poorly understood in many cases. Biochemical analyses of small case-control series and animal models have described a plethora of molecular characteristics of HF, but a single unifying pathogenic theory is lacking. Heart failure appears to result not only from cardiac overload or injury but also from a complex interplay among genetic, neurohormonal, metabolic, inflammatory, and other biochemical factors acting on the heart. Recent development of robust, high-throughput tools in molecular biology provides opportunity for deep molecular characterization of population-representative cohorts and HF cases (molecular epidemiology), including genome sequencing, profiling of myocardial gene expression and chromatin modifications, plasma composition of proteins and metabolites, and microbiomes. The integration of such detailed information holds promise for improving understanding of HF pathophysiology in humans, identification of therapeutic targets, and definition of disease subgroups beyond the current classification based on ejection fraction which may benefit from improved individual tailoring of therapy. Challenges include: 1) the need for large cohorts with deep, uniform phenotyping; 2) access to the relevant tissues, ideally with repeated sampling to capture dynamic processes; and 3) analytical issues related to integration and analysis of complex datasets. International research consortia have formed to address these challenges and combine datasets, and cohorts with up to 1 million participants are being collected. This paper describes the molecular epidemiology of HF and provides an overview of methods and tissue types and examples of published and ongoing efforts to systematically evaluate molecular determinants of HF in human populations.Entities:
Keywords: ATP, adenosine triphosphate; CAD, coronary artery disease; DCM, dilated cardiomyopathy; GWAS, genome-wide association study; HCM, hypertrophic cardiomyopathy; HF, heart failure; LVEF, left ventricular ejection fraction; MS, mass spectrometry; genetics; heart failure; metabolomics; molecular epidemiology; nMR, nuclear magnetic resonance; proteomics
Year: 2017 PMID: 30062185 PMCID: PMC6058947 DOI: 10.1016/j.jacbts.2017.07.010
Source DB: PubMed Journal: JACC Basic Transl Sci ISSN: 2452-302X
Central IllustrationTissues and Molecular Sets
Common signs of heart failure include enlargement of the heart (cardiomegaly) and fluid accumulation in the lungs (pulmonary edema), pleural cavity (pleural effusion), abdomen (ascites), legs (ankle edema), distended jugular veins, and liver (hepatomegaly). Locations for tissue sampling and sets of molecules for systematic studies of the molecular epidemiology of heart failure are indicated. Heart samples are obtained through endomyocardial biopsy (left section), from deceased donors, or donors undergoing heart transplantation. Plasma samples are obtained by centrifugation of blood from peripheral or central venipuncture. DNA for genome analysis can be isolated from any tissue, but most easily from leukocytes in blood samples. Microbial samples are obtained from urine, stool, genital, respiratory, skin, or oral swabs.
Glossary
| Epigenome | Chemical modifications of DNA or histones, such as methylation, that may influence gene expression and chromatin conformation |
| Genetic polymorphism | Genetic variant that is common in the general population, usually biallelic, limited to a single nucleotide, and defined as minor allele frequency ≥5% |
| Genome | Complete set of DNA in an organism, including all genes, that is present in all cells; the human genome consists of 3 billion base pairs |
| Genome-wide association study | Test for association of a genome-wide set of genetic variants with a phenotype, typically common variants (polymorphisms) |
| High-throughput sequencing | Methods to determine nucleotide sequence at high throughput based on multiple parallelized reactions; alternative to the classical, low-throughput Sanger sequencing |
| Linkage analysis | Method in genetics for discovery of chromosomal regions transmitted with disease in a family through genotyping of polymorphic sites distributed across the genome |
| lncRNA | Long, nonprotein-coding RNA transcripts with diverse and often unclear functions |
| Mendelian randomization | Observational method of using genetic variation to examine causal effect of a modifiable exposure on an outcome |
| Metabolome | A heterogeneous group of small molecules (metabolites), including amino acids, carbohydrates, lipids |
| Microarray | Technology for parallel testing of multiple analytes from mixture, based on a small glass or plastic slide to which multiple reagents are attached |
| Microbiome | Collection of all genomes in a microbial ecosystem |
| MicroRNA | Small non–protein coding RNA molecule (usually about 22 nucleotides) that functions in RNA silencing and post-transcriptional regulation of gene expression |
| Peripartum cardiomyopathy | A form of dilated cardiomyopathy with onset during the final month of pregnancy or in the 6 months after delivery |
| Proteome | Full set of proteins expressed in an organism |
| Sequencing depth | Number of times a given nucleotide has been read in a sequencing experiment, high depth facilitates distinguishing sequencing errors |
| Throughput | Number of samples and/or analytes undergoing analysis in a certain timeframe |
| Transcriptome | The full set of RNA transcripts in an organism |
Study Populations and Selected Findings
| General Community | Myocardial Injury | Intermediate HF Phenotypes | Symptomatic HF | Outcomes in HF | |
|---|---|---|---|---|---|
| Population-based, risk factors | Acute myocardial infarction or myocarditis | Myocardial remodeling | Symptom onset and diagnosis | Decompensation, arrhythmia, death, LVAD, or transplantation | |
| >300 independent SNPs for blood pressure, 113 for diabetes | 73 SNPs for CAD | >150 SNPs for systolic function, diastolic function, left ventricular mass, left ventricular diameter, QRS duration, QT duration and heart rate, 25 for AF, 7 for valvular disease | Sarcomere gene mutations (including MYBPC3 founder mutations, TTNtv), 8 SNPs for HF and cardiomyopathy | Mutations in LMNA, LAMP2, and MYH7; SNP at 5q22 (death), SNPs at 4q25 and 16q22 for AF, an SNP near hypocretin receptor-2 for myocardial recovery | |
| — | — | — | Sarcomere changes (α- to β-myosin switch, titin isoform switch), reduced calcium channel expression, increased NP expression, metabolic changes (reduced mitochondrial fatty acid and glucose oxidation, reduced phosphocreatine, increased ketone oxidation) | Same as for HF (death) | |
| BCAAs for diabetes | TMAO for CAD | — | Increased catecholamines, angiotensin II, NPs, ST2, galectin-3, endothelin, CRP, TNFα, ketone bodies, lactate, reduced iron | Same as for HF, TMAO (death) | |
| Strains that generate BCAAs for diabetes | Strains that generate TMAO for CAD | — | — | Strains that generate TMAO |
Overview of populations under study in the molecular epidemiology of heart failure, with a selection of key findings from different molecular sets. Some forms of HF are not preceded by myocardial injury or overload, such as monogenic cardiomyopathies, but likely influence intermediate phenotypes.
AF = atrial fibrillation; BCAAs = branched-chain amino acids; CAD = coronary artery disease; HF = heart failure; LVAD = left ventricular assist device; NPs = natriuretic peptides; SNP = single nucleotide polymorphism; TMAO = trimethylamine-N-oxide; TTNtv = titin-truncating variants.
Design Considerations in Prospective Molecular Epidemiology Studies of HF
| Study Design Feature | HF Implications | Cohort Examples |
|---|---|---|
| Sample size | Very large cohorts or combined cohorts needed due to disease heterogeneity | PMI (n = 1 million), UK Biobank (n = 500,000), consortia for genomics (CHARGE-HF, GENIUS-CHD, HERMES, MGC, TOPMed) and plasma profiles (HOMAGE, inHForm) |
| Phenotypic characterization | Heterogeneous phenotype: need for standardized definitions and cardiac imaging | ESC diagnostic algorithm in several cohorts, Framingham criteria |
| Tissue sampling | Multiple tissues involved, heart most important but difficult to obtain | Multi-tissue: GTEx, heart tissue: MAGNet |
| Follow-up | Repeat sampling warranted | Biannual examinations in Framingham Heart Study, annual sampling in inHForm project |
CHARGE-HF = Heart Failure working group of the Cohorts for Heart and Aging Research consortium; GENIUS-CHD = GENetIcs of sUbSequent Coronary Heart Disease; GTEx = Genotype-Tissue Expression project; HERMES = Heart Failure Molecular Epidemiology for Therapeutic Targets; HOMAGE = Heart Omics in AGEing consortium; inHForm = INtegrative omics of Heart Failure to infORM discovery of novel drug targets and clinical biomarkers; MAGNet = Myocardial Applied Genomics Network; MGC = Myocardial Genetics Consortium; PMI = Precision Medicine Initiative; TOPMed = National Heart Lung Blood Institute Trans-Omics for Precision Medicine Program; other abbreviations as in Table 2.
High-Throughput Molecular Assays in Molecular Epidemiology
| Molecular set | Method | Description |
|---|---|---|
| Genome | DNA microarrays | Genotyping of 100,000 to 1 million SNVs from which additional SNVs can be imputed for GWAS or exome-targeted analysis |
| Exome sequencing | Sequencing of the 30 million base pairs that encode proteins | |
| Whole-genome sequencing | Sequencing of the entire genome of an organism, typically at lower sequencing depth than exome sequencing | |
| Transcriptome | RNA microarrays | Gene expression profiling for all protein-coding genes and select noncoding transcripts |
| RNA sequencing | Sequencing of all RNA transcripts | |
| Proteome | Mass spectrometry | Method for unbiased detection of proteins in a mixture by mass and charge, usually coupled to liquid or gas chromatography or electrophoresis for sample separation |
| Affinity-based methods | Methods for protein detection based on protein isolation by an affinity reagent (e.g., antibody or aptamer) coupled to a reporter system for detection, can be robustly multiplexed for thousands of proteins with recent methods | |
| Metabolome | Mass spectrometry | Method for unbiased detection of metabolites in a mixture by mass and charge |
| nMR spectroscopy | Method for detection of soluble compounds in a mixture based on content of atoms of a certain spin, typically hydrogen | |
| Epigenome | ChIP | Method to identify histone modifications or transcription factor binding using antibodies, coupled to sequencing or microarrays to determine location |
| MeDIP | Method to identify DNA methylation using antibodies, coupled to sequencing or microarrays to determine location | |
| Bisulfite sequencing | Method to identify DNA methylation by bisulfite treatment and sequencing | |
| Hi-C | Method to determine chromatin conformation at high throughput | |
| Microbiome | 16S rRNA sequencing | Method to determine species of a prokaryotic organism by sequencing the rRNA gene |
ChIP = chromatin immunoprecipitation; MeDIP = methylation DNA immunoprecipitation; nMR = nuclear magnetic resonance; SNV = single-nucleotide variant.