Guo-Xing Wan1,2, Li-Hua Ji1,2, Wen-Bin Xia1,2, Lan Cheng1,2, Yong-Gang Zhang1,2. 1. Department of Cardiology, Second Affiliated Hospital Shantou University Medical College, Shantou, Guangdong 515041, P.R. China. 2. Cardiovascular Laboratory, Centre for Translational Medicine, Second Affiliated Hospital Shantou University Medical College, Shantou, Guangdong 515041, P.R. China.
Abstract
The care of individual patients requiring anthracyclines remains challenging as uncertainty persists on predictors of cardiotoxicity. The aim of the present study was to identify potential candidate blood indicators of doxorubicin-induced heart failure. The gene expression profiles of GSE40447 and GSE9128 microarray data were downloaded from the Gene Expression Omnibus database to identify differentially expressed genes (DEGs) using the R/Limma package or GEO2R. Functional and pathway enrichment analysis on DEGs were performed using DAVID database. The cardiovascular disease (CVD)-related DEGs were screen out based on the CardioGenBase database. The protein-protein interaction (PPI) network was constructed with STRING database and visualized by using Cytoscape. Then, the CVD-related DEGs were validated by intersection analysis with DEGs in GSE9128. The overlapping DEGs with a consistent expression pattern in GSE40447 and GSE9128 were identified as candidate indicators for doxorubicin-induced heart failure. A total of 516 DEGs potentially associated with doxorubicin-induced heart failure in GSE40447 were identified, which were mainly enriched in the gene ontology terms related to B cells, leukocytes, lymphocyte activation and B cell receptor signaling pathway. Of the DEGs, 42 were screened out as CVD-related DEGs by using CardioGenBase. Seven genes with high connectivity degree were presented in the PPI network. Finally, 5/6 CVD-related DEGs revealed by the intersection analysis were validated by GSE9128 and highlighted as candidate indicators of doxorubicin-induced heart failure: CD163, CD28, SLC25A20, ANPEP and TLR5. Several genes, including the 5 previously mentioned, were proposed as potential candidate blood indicators for doxorubicin-induced heart failure. Further experimental validations are greatly warranted for future clinical application.
The care of individual patients requiring anthracyclines remains challenging as uncertainty persists on predictors of cardiotoxicity. The aim of the present study was to identify potential candidate blood indicators of doxorubicin-induced heart failure. The gene expression profiles of GSE40447 and GSE9128 microarray data were downloaded from the Gene Expression Omnibus database to identify differentially expressed genes (DEGs) using the R/Limma package or GEO2R. Functional and pathway enrichment analysis on DEGs were performed using DAVID database. The cardiovascular disease (CVD)-related DEGs were screen out based on the CardioGenBase database. The protein-protein interaction (PPI) network was constructed with STRING database and visualized by using Cytoscape. Then, the CVD-related DEGs were validated by intersection analysis with DEGs in GSE9128. The overlapping DEGs with a consistent expression pattern in GSE40447 and GSE9128 were identified as candidate indicators for doxorubicin-induced heart failure. A total of 516 DEGs potentially associated with doxorubicin-induced heart failure in GSE40447 were identified, which were mainly enriched in the gene ontology terms related to B cells, leukocytes, lymphocyte activation and B cell receptor signaling pathway. Of the DEGs, 42 were screened out as CVD-related DEGs by using CardioGenBase. Seven genes with high connectivity degree were presented in the PPI network. Finally, 5/6 CVD-related DEGs revealed by the intersection analysis were validated by GSE9128 and highlighted as candidate indicators of doxorubicin-induced heart failure: CD163, CD28, SLC25A20, ANPEP and TLR5. Several genes, including the 5 previously mentioned, were proposed as potential candidate blood indicators for doxorubicin-induced heart failure. Further experimental validations are greatly warranted for future clinical application.
Cancer outcome is continually improved in recent years with the introduction of early detection, advances in the chemotherapy, radiotherapy, and targeted therapy strategies; and the implementation of multidisciplinary cancer care (1). In light of the prominent role of chemotherapy, anthracycline especially doxorubicin remains to be considered the footstone chemotherapy regimen of cancer treatment for a wide range of malignancies, particularly for breast cancer and lymphoma (2,3). Over decades from its discovery, the anti-tumor and cardiotoxic mechanisms of doxorubicin seem to continuously evoke considerable interest in basic science and clinical trial research. Despite the great achievement in the survival of cancerpatients, there is a serious alert nowadays that today's cancerpatients may be tomorrow's cardiac patients due to the increased awareness of the doxorubicin-induced cardiotoxicity (4,5).Paradoxically, apparently prolonged survival with doxorubicin is accompanid by increased risk of cardiac damage in cancerpatients, particularly in children, because the damage may not manifest for many years and sustains to be a life-long threat (6). Reportedly, the risk of cardiotoxicity varies according to the type and intensity of cancer treatment (7). A devastating cardiotoxic effect of doxorubicin is principally heart failure, with incidence rates from 0.14 to 48% (estimated risk ranges from 0.14 to 5% for doses >400 mg/m2, 7 to 26% for 550 mg/m2 and 18 to 48% for 700 mg/m2) (8). Clinically, in terms of the benefit as well as cardiotoxic effect with doxorubicin, the paradox could greatly undermine the decision making of clinicians with the lack of effective and convenient approaches to monitor and quantify the cardiotoxic effects.Although recent emphasis on the development of biomarkers indicative of doxorubicin-induced cardiac damage has resulted in the identification of several proposed biomarkers, such as troponins, myoglobin, actate dehydrogenase, creatine phosphokinase, C-redactive protein, brain-type natriuretic peptide (BNP), and pro-BNP (6,9–11), consistent findings focusing on concerns were not always presented. For instance, administration of doxorubicin was not always associated with elevated troponin level (12). Moreover, an increase in pro-BNP level has been found early after doxorubicin use, however, elevated pro-BNP level does not predict left ventricular dysfunction (13,14).Considering the uncertainty about the effectiveness of cardiac markers to quantify doxorubicin-induced cardiotoxicity, we therefore conducted a microarray-based study to systematically identify more potential candidate blood indicators for doxorubicin- induced heart failure via a bioinformatics analysis.
Materials and methods
Microarray data
The microarray expression profile dataset GSE40447 deposited by McCaffrey et al (15), was downloaded from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE40447) and was based on the platform of GPL16006[(HG-U133_Plus_2) Affymetrix Human Genome U133 Plus 2.0 Array]. A total of 15 blood samples from 5 women with doxorubicin chemotherapy-induced heart failure (HF) and 10 women with no HF but a history of doxorubicin chemotherapy, were used for the analysis. The microarray expression profile dataset GSE9128 including 12 blood samples from chronic heart failurepatients and 12 age- and sex-matched controls was used for validation analysis (16). The GSE9128 dataset deposited by Cappuzzello et al (16) was based on the platform of GPL96 [(HG-U133A) Affymetrix Human Genome U133A Array].
Data preprocessing and identification of differentially expressed genes (DEGs)
For GSE40447 data, the probe-level raw data (CEL format) were preprocessed by using a robust multiarray average algorithm with the Affy package in Bioconductor (17). Missing data were imputed by the KNN-based method and median data normalization was performed using robust multichip averaging for multiple probes corresponding to a common gene symbol (18,19). The DEGs following doxorubicin chemotherapy between HF and non-HF samples were identified by using the limma package in R/Bioconductor (20). For GSE9128 data, GEO2R (http://www.ncbi.nlm.nih.gov/geo/geo2r/), a R-based interactive web tool commonly used to compare two independent groups of samples for identifying DEGs with lack of raw CEL format data, was used to screen DEGs between HF and control samples. Similarly, the limma package involving empirical Bayes statistics was integrated for DEG analysis. |logFC|≥0.585 (|logFC|≥0.585 indicated that fold-change >1.5) and P-value <0.05 were considered as threshold values for DEGs in Limma and GEO2R.
Functional enrichment analysis
Gene Ontology (GO) functional enrichment including biological process and molecular function and Kyoto Encyclopedia of Gene and Genomes (KEGG) pathway enrichment analyses were performed for DEGs in GSE40447 by using the Database for Annotation, Visualization and Integrated Discovery (DAVID) to study differentially expressed clusters at the functional level (https://david.ncifcrf.gov/) (21). P<0.05 was considered as the cut-off criterion for the enrichment analysis.
Screening for cardiovascular disease (CVD)-related DEGs
CVD-related DEGs were identified by mapping to a Literature Based Multi-Omics Database for Major Cardiovascular Diseases: CardioGenBase (http://cardiogenbase.com/), which collected gene-disease association data from PubMed and MEDLINE and covered major CVDs such as cerebrovascular disease, coronary artery disease (CAD), hypertensive heart disease, inflammatory heart disease, ischemic heart disease and rheumatic heart disease (22). The CardioGenBase database documented ~1,500 CVD genes from ~2,4000 research articles.
Construction of protein-protein interaction (PPI) network
The DEGs in GSE40447 data were mapped to the overall PPIs by using the database Search Tool for the Retrieval of Interacting Genes (STRING, http://string-db.org/), which integrates a variety of predicted and experimentally validated interactions of proteins (23). Then, the primary PPI network was constructed with a threshold value of combined score >0.4 output by the STRING database and visualized by using Cytoscape (http://cytoscape.org/) (24).
Microarray-based validation for CVD-related DEGs
The CVD-related DEGs were validated by an intersection analysis with DEGs in the GSE9128 data. The overlap with consistent expression pattern of DEGs in GSE40447 and GSE9128 were identified as candidate indicators for doxorubicin- induced heart failure. Meanwhile, the expression files of candidate indicators in experiment and control groups in GSE9128 were obtained from the GEO database, of which the relative expression corresponded to the results of DEG analysis was graphically presented.
Results
Identification of DEGs in GSE40447 microarray data
The workflow of the proposed analysis was performed as Fig. 1. As a result shown in Fig. 2, a total of 516 DEGs in GSE40447 including 263 down-regulated genes and 253 up-regulated genes were identified in HF group following doxorubicin chemotherapy compared to non-HF group with a history of doxorubicin chemotherapy according to the established cut-off criterion. The heatmap for DEGs is shown in Fig. 3.
Figure 1.
Workflow of the proposed analysis.
Figure 2.
Intersection analysis between DEGs in GSE40447 microarray data and cardiovascular genes in CardioGenBase; the 42 overlapped genes were CVD-related genes. DEGs, differentially expressed genes; PPI, protein-protein interaction; GO, Gene Ontology; BP, biological process; MF, molecular function; CVD, cardiovascular disease.
Figure 3.
Heat maps of blood DEGs in patients with doxorubicin-induced heart failure compared to patients without heart failure but with a history of doxorubicin treatment. DEGs, differentially expressed genes.
GO and KEGG pathway enrichment analyses of DEGs in GSE40447 data
DAVID provided comprehensively functional annotation to understand the biological meaning behind the large list of genes. In the current study, GO enrichment analysis using DAVID was performed to explore DEGs functions from two aspects, including molecular function and biological process. As presented in Fig. 4, the results showed that the top10 overrepresented GO terms in biological processes were enriched in B-cell activation, leukocyte activation, cell activation, lymphocyte activation, regulation of transcription, transcription, negative regulation of B-cell proliferation, inflammatory response, regulation of lymphocyte proliferation and regulation of mononuclear cell proliferation. Additionally, the top10 enriched GO terms in molecular function were protein domain-specific binding, molecular adaptor activity, carbon-nitrogen ligase activity, with glutamine as an amido-N-donor, copper ion binding, phospholipid binding, SH3/SH2 adaptor activity, zinc ion binding, phosphoinositide binding, UDP-galactose: Glucosylceramide beta-1,4-galactosyltrans- ferase activity and transcription factor binding. On the other hand, we adopted KEGG enrichment analysis by clustering similar genes into a same network to understand the DEGs action mode. The B-cell receptor signaling pathway containing 6 DEGs (BLNK, CD79A, CD79B, CR2, PPP3R1, AKT1) indicated with red was enriched (Fig. 4).
Figure 4.
Functional enrichment analysis and KEGG pathway analysis of DEGs in GSE40447. (A) GO analysis of biological process. (B) GO analysis of molecular function. (C) KEGG pathway enrichment analysis. DEGs, differentially expressed genes.
Screening for CVD-related DEGs
Many DEGs generated may result from various responses to doxorubicin chemotherapy with multifarious etiologies but not just cardiac damage. In order to screen the CVD-related DEGs, the CardioGenBase database containing 1556 CVD-related genes was applied. Finally, the intersection analysis showed 42 DEGs associated with CVD (Fig. 2 and Table I). Of note, all 42 genes were literarily confirmed with a tight relevance to CVD in MEDLINE.
Table I.
CVD-related DEGs mapped in the CardioGenBase database.
The primary PPI network including all CVD-related DEGs was constructed using STRING database (Fig. 5) and was further constructed with a threshold value of a combined score >0.4 and visualized by Cytoscape. Overall, 20 nodes and 25 edges were mapped in the PPI network of identified DEGs, including 11 up-regulated genes and 9 down-regulated genes (Fig. 6). The 7 nodes with the higher degrees were screened as hub genes (more than 3 interactions for each hub gene), including AKT1, MPO, ANPEP, CD79A, CD40, IL3RA and TLR5.
Figure 5.
PPI of DEGs in GSE4047 constructed by STRING database. PPI, protein-protein interaction; DEGs, differentially expressed genes.
Figure 6.
PPI network of CVD-related DEGs constructed with STRING database and displayed with Cytoscape. Genes in orange were upregulated and those in blue were downregulated. PPI, protein-protein interaction; CVD, cardiovascular disease; DEGs, differentially expressed genes.
Validation for CVD-related DEGs by GSE9128
The study involving GSE9128 was designed to identify diagnostic/prognostic blood markers and gene expression profiles of chronic heart failure. Based on the established cut-off criterion for DEGs, 651 DEGs (330 down-regulated and 321 up-regulated) were identified in HF group compared to healthy control. To validate the CVD-related DEGs induced by doxorubicin treatment, an intersection analysis with DEGs in GSE9128 was performed, and 6 genes (CD163, CD28, SLC25A20, ANPEP, TLR5, CD40) was shown in the overlap (Fig. 7). However, 5 genes (CD163, CD28, SLC25A20, ANPEP, TLR5) were highlighted with a consistent expression pattern in GSE40447 and GSE9128, and were finally identified as potential candidate blood indicators for doxorubicin-induced heart failure (Table II). Corresponded to the detailed results of DEG analysis, the expression files of the candidate indicators obtained from GSE9128 in experiment and control groups were graphically presented. As shown in Fig. 8, 2.14, 0.52, 1.53, 1.74 and 1.68 fold-change transcriptional level for CD163, CD28, SLC25A20, ANPEP and TLR5 were observed compared to the non-HF healthy control, respectively.
Figure 7.
Intersection analysis between DEGs in GSE9128 data and CVD-related DEGs, proposing some potential indicators for heart failure. CVD, cardiovascular disease; DEGs, differentially expressed genes.
Table II.
Expression of CVD-related genes in GSE40447 and GSE9128 datasets.
Expression validation of CVD-related DEGs by GSE9128 compared to that in GSE40447. CVD, cardiovascular disease; DEGs, differentially expressed genes.
Discussion
Currently, bioinformatics analysis with high-throughput gene expression profiling has provided systematic insights into the mechanism of doxorubicin-induced cardiotoxic effects, related to the underlying gene activity changes and more importantly, has enabled the identification of targets for a diagnosis and prevention policy. In the present study, a total of 516 blood DEGs potentially related to doxorubicin-induced HF were initially identified with the GSE40447 microarray data. Functional enrichment analysis showed that these DEGs were mainly related to B -cell receptor signaling pathway. Of the DEGs, 42 were literarily evidenced as CVD-related genes. Importantly, the further validation analysis revealed that 5 CVD-related DEGs (CD163, CD28, SLC25A20, ANPEP, TLR5) may served as potential candidate blood indicators for doxorubicin-induced heart failure.The pathogenesis of doxorubicin-induced HF is a complex process driven by specific genetic alterations and susceptible drug toxicity. Hemoglobin scavenger receptor CD163 is a macrophage-specific protein and its up-regulation is one of the major changes in the macrophage switch to alternative activated phenotypes in inflammation (25). This upregulation may echo the activation of inflammation triggered by doxorubicin-related cardiac damage in that enhanced inflammation has been confirmed as a hallmark of cardiac injury (26). In support of this, several previous studies have revealed increased serum level of CD163 along with elevated inflammatory cytokines in patients with atrial fibrillation (27) and coronary heart disease (28). In addition, elevated serum CD163 level was also found in patients with chronic heart failure with reduced ejection fraction (29). Hence, the results of the current study are in line with previous findings, suggesting the potential role of increased CD163 expression in doxorubicin-induced cardiotoxicity. Moreover, the CD28 molecule is a type I transmembrane protein expressed on the surface of 80% of humanCD4+ T cells and 50% of humanCD8+ T cells (30), which is well known as one of the most important co-stimulatory receptors for T-lymphocyte activation and T-cell receptor signal transduction, thereby regulating the production of inflammatory cytokine/chemokines (31). Several separate studies have reported that CD28 might act as an accomplice but not always a defender by demonstrating the deleterious effect of CD4+CD28 null T-lymphocytes in patients with atrial fibrillation, chronic heart failure, coronary artery disease, atherosclerosis and other CVDs (32–34). Investigators have uncovered partial mechanisms responsible for the deleterious effect of activation of cytotoxic lymphocytes secreting pro-inflammatory cytokines to promote inflammation and the development of cardiovascular inflammatory diseases (35). Therefore, it is not difficult to speculate the detrimental role of CD28 down-regulation in doxorubicin-induced cardiotoxicity.The present study also suggested a role for Toll-like receptor 5 (TLR5) up-regulation in doxorubicin-induced cardiotoxicity. In the heart, previous research found that TLR5 activation may condition vascular dendritic cells (DCs) to support perivasculitic infiltrates possibly by increasing the suppressive activity of Treg cells and inducing the synthesis of immunosuppressive interleukins IL-10, IL-35, and transforming growth factor β (36,37). Supportively, Zhu et al (38) found that TLR5 polymorphism might be associated with rheumatic heart disease risk in a Chinese Han population. Platelet TLR5 was found associated with ratio of total cholesterol to high-density lipoprotein, thus contributing to cardiovascular risk (39). Therefore, it could not rule out the link between TLR5-involved vasculitic and doxorubicin-induced cardiotoxicity although the underlying mechanism remains uninvestigated.For the solute carrier family 25 member 20 (SLC25A20 or CACT), a key carnitine-acylcarnitine translocase exchanging for free carnitine across the mitochondrial membrane in mitochondrial beta-oxidation, little information has been acknowledged on its pathophysiologic mechanisms in CVD except for several case reports about the CACT-deficiency in cardiomyopathy, arrythmias and cardiogenic shock (40–42). However, the current study found for the first time a markedly up-regulation of SLC25A20 transcriptional level in doxorubicin-induced HF, which was further validated with a 1.74-fold change in a HF cohort compared to the healthy control. Nevertheless, the underlying mechanism remains unknown and that whether this is a compensatory upregulation to prevent injury needs to be studied.The present study also suggested a potential role of upregulated alanyl aminopeptidase (ANPEP, APN or CD13) in doxorubicin-induced heart failure. ANPEP is a conserved type II integral membrane zinc-dependent metalloprotease in the M1 family of ectoenzymes (43), widely expressed on all myeloid cells, activated endothelial cells and epithelium of the kidney and intestine (44). Evolving evidence supported a role for ANPEP in controlling arterial blood pressure and the pathogenesis of hypertension by inhibiting renal tubule Na flux and modulating salt-adaptation (43,45). Interestingly, circulating ANPEP was found to be up-regulated in patients with essential hypertension and in the angiogenic vessels in the infarct area and border zone after myocardial infarction (44,46). Further exploration revealed that ANPEP is essential for promoting optimal post-infarction healing for the ischaemic heart via compensatory mechanisms including the preparation and sustainment of the reparative response to relieve potential angiogenic defects (44). Thus, it is biologically reasonable to speculate that the increase of ANPEP expression may be compensatory, which might be predictive for doxorubicin-induced cardiotoxicity.Several limitations for the study should be acknowledged. The underlying molecular mechanism of doxorubicin-induced HF is not completely consistent with non-pharmaceutical HF. Some potential more important indicators specified for doxorubicin-induced HF may be concealed by the validation with the study GSE9128 designed for the identification of non-pharmaceutical HF biomarkers. For instance, the present study showed that DEGs in GSE40447 were mainly related to B cell receptor signaling pathway in KEGG enrichment analysis, whereas the focused indicators were not implicated, which may suggest the specificity of some DEGs that needed to be experimentally confirmed. Moreover, the potential blood indicators were initially proposed by the intersection between GSE40447 and CardioGenBase, which may exclude other important predictors of doxorubicin-induced heart injury that has not been documented before in CardioGenBase, and therefore may result in limited ability for diagnosis of doxorubicin-induced HF. Nevertheless, the highlighted indicators are biologically reasonable and reliable because of the identical molecular basis of heart failure and a systematic and scientific study approach in the study. Moreover, in light of applying predicament of the current proposed biomarkers in the clinic, these indicators may contribute to the diagnosis and prevention of doxorubicin-induced HF and the decision-making for clinicians when combined with several routine markers for cardiac injury such as troponins and pro-BNP. Definitely, the aforementioned results were based on microarray data with limited sample size and a lack of experimental verification, which was also a limitation of the study and required to be addressed by future studies.In conclusion, the present study aimed to identify potential candidate blood biomarkers to predict doxorubicin-induced heart failure with comprehensive bioinformatics analysis. A set of significantly altered genes were identified and five (CD163, CD28, SLC25A20, ANPEP, TLR5) were highlighted by bioinformatics validation. Our results suggest that data mining and integration could be a useful tool to predict doxorubicin-induced heart failure. Despite the absence of experimental validation, this study still has significance for further study, providing potential new key genes for further experimental validation, and shedding light on the molecular mechanisms of doxorubicin-induced cardiotoxicity.
Authors: Giuseppe Curigliano; Daniela Cardinale; Susan Dent; Carmen Criscitiello; Olexiy Aseyev; Daniel Lenihan; Carlo Maria Cipolla Journal: CA Cancer J Clin Date: 2016-02-26 Impact factor: 508.702
Authors: Ingrid E Dumitriu; Ernesto Trallero Araguás; Christina Baboonian; Juan Carlos Kaski Journal: Cardiovasc Res Date: 2008-09-25 Impact factor: 10.787
Authors: Nazanin Fallah-Rad; Jonathan R Walker; Anthony Wassef; Matthew Lytwyn; Sheena Bohonis; Tielan Fang; Ganhong Tian; Iain D C Kirkpatrick; Pawan K Singal; Marianne Krahn; Debjani Grenier; Davinder S Jassal Journal: J Am Coll Cardiol Date: 2011-05-31 Impact factor: 24.094
Authors: Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering Journal: Nucleic Acids Res Date: 2014-10-28 Impact factor: 16.971
Authors: Konstantin Byrgazov; Claes Anderson; Benjamin Salzer; Eva Bozsaky; Rolf Larsson; Joachim Gullbo; Manfred Lehner; Fredrik Lehmann; Ana Slipicevic; Leo Kager; Mårten Fryknäs; Sabine Taschner-Mandl Journal: Ther Adv Med Oncol Date: 2020-07-29 Impact factor: 5.485