| Literature DB >> 30062151 |
Kipp W Johnson1,2, Khader Shameer1,2, Benjamin S Glicksberg1,2, Ben Readhead1,2, Partho P Sengupta3,4, Johan L M Björkegren2,5, Jason C Kovacic3,4, Joel T Dudley1,2,4.
Abstract
The traditional paradigm of cardiovascular disease research derives insight from large-scale, broadly inclusive clinical studies of well-characterized pathologies. These insights are then put into practice according to standardized clinical guidelines. However, stagnation in the development of new cardiovascular therapies and variability in therapeutic response implies that this paradigm is insufficient for reducing the cardiovascular disease burden. In this state-of-the-art review, we examine 3 interconnected ideas we put forth as key concepts for enabling a transition to precision cardiology: 1) precision characterization of cardiovascular disease with machine learning methods; 2) the application of network models of disease to embrace disease complexity; and 3) using insights from the previous 2 ideas to enable pharmacology and polypharmacology systems for more precise drug-to-patient matching and patient-disease stratification. We conclude by exploring the challenges of applying a precision approach to cardiology, which arise from a deficit of the required resources and infrastructure, and emerging evidence for the clinical effectiveness of this nascent approach.Entities:
Keywords: CAD, coronary artery disease; EHR, electronic health record; GWAS, genome-wide association studies; HF, heart failure; cardiology; clinical informatics; multi-omics; precision medicine; translational bioinformatics
Year: 2017 PMID: 30062151 PMCID: PMC6034501 DOI: 10.1016/j.jacbts.2016.11.010
Source DB: PubMed Journal: JACC Basic Transl Sci ISSN: 2452-302X
Figure 1Genome and Phenome-Wide Associations of Coronary Atherosclerosis, a Fundamental Mechanism Driving Several Cardiovascular Diseases
(A) Circos plot representing genome-wide associations of coronary atherosclerosis with each section representing human chromosomes. Phenome-wide associations of cardiovascular disease variants across other disease categories are represented in different colors. (B) rs11209026, a coding variant (c.G>A: R381Q) localized on interleukin-23 receptor (IL-23R); and (C) rs3184504 coding variant (c.784T>C: W262R) localized to PH domain of the SH2B3 protein.
Central IllustrationMachine Learning-Driven Precision Cardiology
Multiple sources of heterogeneous data, including experimental evidence, bioinformatics databases, lifestyle measurements, electronic health records, environmental influences, and biobank findings, can be incorporated together using machine learning algorithms to identify causal disease networks, stratify patients, and ultimately predict more efficacious therapies.
Statistical Learning Approach to Precision Cardiology
| Type of Learning | Problem Tasks | Algorithms | Example Application in Cardiology | PMID |
|---|---|---|---|---|
| Supervised learning | ||||
| Regression | Ordinary least squares regression | Many | ||
| Classification | Logistic regression | Many | ||
| K nearest neighbors | Many | |||
| Predictive modeling | Lasso regression | Sex-dependent risk factors for myocardial infarction | ||
| Survival analysis | Ridge regression | Discovery of biomarkers associated with CAD prognosis | ||
| Elastic net regression | Discovery of biomarkers associated with CAD prognosis | |||
| Naïve Bayes | Classification of cardiovascular risk level | |||
| Support vector machines | Diagnosis of acute coronary syndrome | |||
| Information maximizing component analysis | Feature extraction of left ventricle shape changes following myocardial infarction | |||
| Bayesian networks | Meta-analysis of stroke prevention treatments; real-time prediction of cardiovascular events | |||
| Decision trees | Estimating risk in congenital heart surgery | |||
| Random forests | Predictive modeling of chemoreflex sensitivity; predictive modeling of pediatric dilated cardiomyopathy from miRNAs | |||
| AdaBoost classifiers | Myocardial perfusion analysis from CT imaging | |||
| Neural networks | Length of hospital stay prediction; automated detection of stent failure; pharmacokinetics of losartan; prediction of heart failure outcomes | |||
| Ensemble methods | All-cause mortality prediction | |||
| Unsupervised learning | ||||
| Dimensionality reduction | Hierarchical clustering | Many | ||
| Clustering | K means | Many | ||
| Principal components | Many | |||
| Self-organizing map neural network | Clustering ECG complexes | |||
| Linear discriminant analysis | Quantifying self-similarity of multimodal signals in the ICU; evaluation of atherosclerosis from multimodal imaging | |||
| Topological data analysis | Pulmonary embolism diagnosis; few other examples | |||
| Deep learning | Ultrasound image processing; causal phenotype discovery | |||
Table of selected statistical learning approaches with previous example applications in cardiology.
CAD = coronary artery disease; CT = computed tomography; ECG = electrocardiogram; ICU = intensive care unit; miRNA = microribonucleic acid; PMID = PubMed identifier number.
Figure 2Conceptualization of a Personalized Medicine Approach to Cardiology Contrasted With the Current Standard of Care
In the precision approach to cardiology, multi-omic information is incorporated to identify subtle strata of patients which can be differentially treated within the existing therapeutic space. ACE = angiotensin-converting enzyme.
Figure 3Comorbidity and Shared Genetic Architecture Between Hypertension and Coronary Artery Disease
(Top) Example of disease comorbidity networks for coronary artery disease (CAD) and hypertension (HTN), with comorbid diseases ascertained from Mount Sinai Hospital’s electronic health record data arranged around the central node. Distance from the central node is proportional to comorbidity odds ratio. We calculated comorbidity from ICD-9 codes using a logistic regression model controlling for age, sex, and self-reported ethnicity. Due to space limitations, we only show disease comorbidities with odds ratio ≥2. (Bottom) Networks of shared genetic architecture between CAD and HTN and other diseases, with shared genetic architecture defined as shared genome-wide association studies (GWAS) loci (gene level) between the 2 diseases. We compiled all data from GWASdb version 2 (August 2015) (159) and associated genes to a disease if they were GWAS threshold significant (p < 5 × 10−6) and conferred an increased risk. We calculated shared genetic architecture using a 1-sided Fisher exact test. Distance from central node is proportional to odds ratio. DNA = deoxyribonucleic acid.
Figure 4Hypertension Drug Repositioning Bipartite Network From RepurposeDB
Example of drugs originally developed or used for hypertension that have been repurposed for other indications.