Literature DB >> 30864351

Search for New Candidate Genes Involved in the Comorbidity of Asthma and Hypertension Based on Automatic Analysis of Scientific Literature.

Olga V Saik1, Pavel S Demenkov1, Timofey V Ivanisenko1, Elena Yu Bragina2, Maxim B Freidin2, Victor E Dosenko3, Olga I Zolotareva4, Evgeniy L Choynzonov5, Ralf Hofestaedt6, Vladimir A Ivanisenko1.   

Abstract

Comorbid states of diseases significantly complicate diagnosis and treatment. Molecular mechanisms of comorbid states of asthma and hypertension are still poorly understood. Prioritization is a way for identifying genes involved in complex phenotypic traits. Existing methods of prioritization consider genetic, expression and evolutionary data, molecular-genetic networks and other. In the case of molecular-genetic networks, as a rule, protein-protein interactions and KEGG networks are used. ANDSystem allows reconstructing associative gene networks, which include more than 20 types of interactions, including protein-protein interactions, expression regulation, transport, catalysis, etc. In this work, a set of genes has been prioritized to find genes potentially involved in asthma and hypertension comorbidity. The prioritization was carried out using well-known methods (ToppGene and Endeavor) and a cross-talk centrality criterion, calculated by analysis of associative gene networks from ANDSystem. The identified genes, including IL1A, CD40LG, STAT3, IL15, FAS, APP, TLR2, C3, IL13 and CXCL10, may be involved in the molecular mechanisms of comorbid asthma/hypertension. An analysis of the dynamics of the frequency of mentioning the most priority genes in scientific publications revealed that the top 100 priority genes are significantly enriched with genes with increased positive dynamics, which may be a positive sign for further studies of these genes.

Entities:  

Keywords:  ANDSystem; associative gene network; asthma; comorbid disease; dynamics of interest of genes in publications; gene prioritization; hypertension

Mesh:

Substances:

Year:  2018        PMID: 30864351      PMCID: PMC6348743          DOI: 10.1515/jib-2018-0054

Source DB:  PubMed          Journal:  J Integr Bioinform        ISSN: 1613-4516


Introduction

Asthma is a chronic inflammatory disease of the respiratory tract, the main characteristics of which are the hypersensitivity of the airways to various stimuli and reversible airflow obstruction [1]. More than 300 million patients worldwide are suffering from this disease [2]. Another widespread disease in the world is hypertension, which is characterized by a persistent increase in blood pressure from 140/90 mmHg. Art. and higher [3]. Annually about nine million people die due to complications associated with hypertension [4]. A number of studies have shown that asthma is related to an increase in the incidence of cardiovascular diseases [5], [6]. In this regard, the combination of asthma and arterial hypertension (comorbidity) in a large percentage of clinical cases is not surprising. Prevalence of hypertension in asthma patients is high with odds ratio = 1.66 [147], [188], p-value  <  0.00001 [7]. This may not be a simple coincidence, because standard asthma mechanisms have been implicated in key processes of initiation of hypertension. Good examples of this are the establishment of the role of arachidonic acid-leukotriene B4 production in spontaneously hypertensive rat [8] or the value of Th17 and IL17 in arterial hypertension [9]. Therapeutic control of allergic inflammation leads to better control of blood pressure [10]. The important role of the transcription factor STAT3, which is involved in signal transduction for a variety of cytokines and takes an active part in allergic inflammation [11], [12] and vascular remodeling [13], is well-known. On the basis on this information, it can be assumed that the imbalance between the pro-inflammatory and anti-inflammatory factors in the body creates the optimal situation for the realization of a hereditary predisposition to both asthma and arterial hypertension. In addition to the important role played by immune responses and control of inflammation in the pathogenesis of asthma and hypertension, it is also possible to assume the involvement of other mechanisms underlying the comorbidity of these diseases. For example, β-adrenoblockers and ACE inhibitors, which for a long time were contraindicated for patients with bronchial asthma due to possible bronchospasm, are widely used in the treatment of hypertension. Polymorphism of β-adrenoreceptor genes is associated with the risk of hypertension and bronchial asthma [14], [15]. It is assumed that mutations in the SLC26A4 gene can play an important role in the pathogenesis of bronchial asthma and hypertension, and therefore in the comorbidity of these diseases [16], [17], [18]. However, despite the accumulated data on the participation of specific genes simultaneously in asthma and hypertension, the mechanisms of interaction of these two diseases and the formation of their comorbid state are still poorly understood. Previously, we performed the prioritization of genes that are associated with asthma and hypertension simultaneously, to find the candidates which can potentially be involved in the mechanisms of comorbidity of these two diseases [19]. In the present work, a prioritization of an extended set of genes, which are associated either with asthma or with hypertension, was carried out. In addition, a new criterion for prioritization, based on the assessment of cross-talk centrality, previously used by us for the prioritization of neuronal apoptosis genes in the associative gene network of Parkinson’s disease [20], was introduced. When assessing the quality of prioritization, this criterion was of the best significance. Among the most significant genes were cytokines (CXCL10, CD40LG, IL1A, IL13 and IL15), toll-like receptor TLR2, death domain receptor FAS, transcription factor STAT3, complement factor C3 and amyloid beta precursor protein APP. An analysis of the overrepresentation of Gene Ontology biological processes for the top 100 highest-priority genes has shown the potential importance of the response to hypoxia, the inflammatory and immune responses and regulation of the apoptotic process in the mechanism of interaction of asthma and hypertension. In addition, we analyzed the dynamics of interest in genes by the frequency of their mention in scientific publications. It appeared that genes simultaneously associated with asthma and hypertension are enriched with genes that have a high positive dynamic of interest in them from researchers. This fact may indicate the involvement of these genes in many mechanisms of manifestation of various phenotypic traits that can determine the importance of their role in the development of comorbidity of diseases. An analysis of the most priority genes also showed significant enrichment with genes that have a high dynamic of interest in them. This may indicate their involvement in multiple phenotypic signs and, accordingly, in the comorbidity of asthma and hypertension. The genes identified as a result of prioritization can be used in the planning of experiments on genotyping and search for associations with the comorbid state of asthma and hypertension.

Methods

The sets of genes associated with asthma and hypertension were extracted from the ANDSystem knowledge base [21], [22]. The set of genes associated with asthma included 749 genes. The set of genes associated with hypertension included 706 genes. The number of genes simultaneously presented in the first and second sets was 202. A test set of genes, used for prioritization, was obtained by combining sets of asthma and hypertension genes, minus the 202 genes gained by the intersection of these two sets. The total number of genes in the test set was 1051. The over-represented Gene Ontology biological processes were identified using the DAVID 6.8 service [23] with the following parameters: the organism – "Homo sapiens", Gene_Ontology – "GOTERM_BP_DIRECT". The reconstruction of associative gene networks was carried out with ANDSystem tool [21], [22]. To prioritize the genes we used four criteria, which we previously discussed in the analysis of asthma, hypertension and Parkinson’s disease [19], [20]. Criteria 1 and 2 were based on the well-known methods of prioritization: ToppGene [24] (https://toppgene.cchmc.org/prioritization.jsp) and Endeavor [25] version 3.71 (https://endeavour.esat.kuleuven.be/), respectively. The ToppGene and Endeavor programs allow one to perform a ranking of a test set of genes by a training set of genes, according to specific criteria characterizing the proximity of genes from the test set to the genes from the training sample. The genetic information (co-localization in the genome), the functional properties of genes (participation in the same GO categories), etc., can be used by these programs as properties of the vertices of the graph of protein-protein networks. As a training set for each of these programs, a list of genes associated with asthma and hypertension, described above, was provided, while a list of 1051 genes, which is also described above, was used as a test set. For the ToppGene in the Training parameters section, all features were selected. The "Rank" indicator of the ToppGene program was used to rank the genes. In the case of the Endeavor system, all settings were set by default. The ranking of genes was carried out according to the "p-value" indicator of the Endeavor program. The lowest rank received genes with the lowest values of "p-value", and the largest rank – genes with the highest values of "p-value". Criterion 3 reflects the specificity of association of biological processes in which the analyzed gene participates with comorbidity mechanisms. Criterion 3 was calculated by formula N1/N2, where N1 is a number of the Gene Ontology biological processes, overrepresented for a set of genes associated with asthma and hypertension; N2 is a number of all Gene Ontology biological processes in which the analyzed gene was involved. Information indicating that the gene is involved in the Gene Ontology biological process was obtained from the AmiGO system [26]. Ranks for genes were determined by sorting the list of genes by descending the number of over-represented Gene Ontology biological processes. Thus, the lowest rank was attributed to the gene for which the number of the over-represented Gene Ontology biological processes was the highest. To calculate criterion 4, an associative gene network which describes the interactions between the analyzed gene/protein and the genes/proteins from the training set was used. The network was automatically reconstructed by ANDSystem. Criterion 4 was based on the use of the cross-talk centrality (CTC), calculated using the “Intelligent Filtration” function of the “ANDVisio” program. Within this criterion, the CTC value was calculated separately for vertices in the gene network corresponding to the gene and its product (protein). Thus, for the gene, two indices (CTCgene and CTCprotein) were determined. The CTCgene was calculated by the formula: where N – is a number of links of the i-th gene with members of the associative gene network of asthma/hypertension; M – is a number of vertices of the associative gene network of asthma/hypertension. CTCprotein for the i-th protein was determined the same way. The CTCgene and CTCprotein grades were determined by sorting the values in descending order, like in the case of criterion 3. In the case when several genes had the same value for the criterion under which the ranking was conducted, the rank of these genes was calculated the similar way as the Spearman rank correlation coefficient [27]. Such genes were assigned the same rank, equal to the average arithmetic ranks of these genes, according to their position, in the sorted list of genes. As the fifth criterion, the average value of the ranks obtained according to the criteria 1–4 was taken like it was done in [19]. For the reconstruction of the dynamics, the information on investigated genes from PubMed publications for the period from 1990 to 2017 were considered. For each year from this period, the number of publications in which the analyzed gene was mentioned was calculated. The marking of gene names in the texts of publications was carried out by using the ANDSystem. The trend reflecting the dynamics of the number of gene references in the articles was calculated using the Pearson correlation coefficient between the year and the total number of articles published this year in which the gene was mentioned. The enrichment of the gene set with genes with increased interest dynamics was calculated using the hypergeometric distribution realized in the hypergeom.sf function of the scipy.stats package of the Python programming language [28]. Genes with high dynamics of interest were genes for which the Pearson correlation coefficient exceeded 0.9.

Results and Discussion

To prioritize the genes for their potential simultaneous involvement in asthma and hypertension 749 genes of asthma and 706 genes of hypertension were considered. The training set of genes consisted of 202 genes, which were presented both in asthma and hypertension lists. Prioritization was carried out by using 4 criteria (see Online Appendix 1). The ranking of genes was carried out according to the value calculated as the mean for the ranks of the genes obtained using each of the 4 criteria. The top 10 of the most significant genes is listed in Table 1. Among them were cytokines (CXCL10, CD40LG, IL1A, IL13 and IL15), toll-like receptor TLR2, death domain receptor FAS, transcription factor STAT3, complement component C3 and amyloid beta precursor protein APP. It is interesting to note that all these genes are genes from the group of asthma. At the same time, the ratio of asthma genes to the genes of hypertension in the top 100 of the most significant genes appeared to be 2:1.
Table 1:

Top 10 of the most significant genes.

Gene symbolGeneIDCriterion 1Criterion 2Criterion 3Criterion 4 ACriterion 4 BAverage of criteria 1–4
ToppGeneEndeavourGO BPGene CTCProtein CTC
IL1A35521710280.0021142.4
CD40LG959372113.5023443.1
STAT36774504143.0018243.4
IL1536007212938.0013.536.557.8
FAS355351852.00177.52060.5
APP3512516263.0051965.6
TLR2709767584.001462266.6
C3718496784.0049.588.567.6
IL133596511033.50177.514.568.1
CXCL10362779286.0055101.568.3
Top 10 of the most significant genes. Figure 1 shows that the top 10 genes are relatively well connected to each other inside the associative gene network, reconstructed with ANDSystem (see Online Appendix 2). In particular, the transcription factor STAT3 regulates CXCL10 [29] and Fas [30]. In turn, TLR2, IL-13 and IL-15 regulate the activity of STAT3 [31], [32], [33].
Figure 1:

An associative gene network reconstructed with ANDSystem, illustrating the interactions of the top 10 of most significant genes.

An associative gene network reconstructed with ANDSystem, illustrating the interactions of the top 10 of most significant genes. For better understanding which biological processes can underlie the molecular mechanisms of interaction between asthma and hypertension, an analysis of the overrepresentation of Gene Ontology biological processes on two sets of data was conducted: the training set of genes and the top 100 of the highest priority genes from the table in Online Appendix 1 were analyzed. For the set of genes associated with asthma and hypertension simultaneously, there were found 69 significantly over-represented biological processes (p-value < 0.05 with FDR correction). For the top 100 of the most priority genes, there were 42 such processes. Twenty-two biological processes were over-represented simultaneously for these two sets of genes (see Online Appendix 3). For example, the response to hypoxia had the best relevance for the training set of genes (p-value < 10−12). In the second set of genes, the p-value for this process was <10−8, and the most significant process appeared to be the inflammatory response (p-value < 10−19). The intersection of over-represented processes was more than 50% of the total number of all overrepresented processes for a set of top 100 priority genes. Such a high value was partly related to the 3rd criterion of prioritization, which was based on considering the overrepresented biological processes. But since the ranking was performed by using the rank averaged over all 4 criteria, such intersection indirectly confirms the correctness of the prioritization. It should be noted that the manual analysis of the full-text articles of PubMed Central showed that for genes from the top 10 there was additional information that was not represented in ANDSystem that speaks in favor of their possible connection with hypertension. For example, for the IL1A gene, it is known that in Chinese Han population genetic variations in IL1A are associated with the risk of preeclampsia, which is characterized by hypertension and proteinuria [34]. In addition in Chinese children with type 1 diabetes the prevalence of hypertension in carriers of T allele in the IL1A gene was 4.2-fold higher than for C allele carriers with p-value < 0.001 [35]. The article [36] stated that the down-regulation of CD40LG might be related to kidney dysfunction in an early phase of hypertension, and [37] discussed that CD40LG was included in pulmonary hypertension gene subnetwork. Gene STAT3 regulate angiogenesis and vasoconstriction and is involved in the development of portal hypertension [38]. In patients with essential hypertension, IL-15 might be involved in the development of cardiovascular diseases [39]. For gene FAS it was shown that its expression was significantly increased in spontaneously hypertensive rats compared to Wistar-Kyoto rats [40]. In [41] APP was identified to undergo differential ubiquitination during pulmonary hypertension pathogenesis. Hypomethylation of the TLR2 gene increased the risk of essential hypertension in the Chinese population [42]. It was shown that C3 is highly expressed in mesenchymal tissues in spontaneously hypertensive rats (SHR) [43] and increased C3 induces salt-sensitive hypertension in SHR [44]. In [45] was discussed that IL-13 was involved in the induction of pulmonary hypertension. CXCL10 appeared to be involved in the pathogenesis of hypertension [46] and its level in serum was elevated in untreated essential hypertension [47]. One of the interesting characteristics of a gene may be the dynamics of increasing researcher’s interest in this gene, which may reflect the prospect of its study. It turned out that the genes from the training set, simultaneously associated with asthma and hypertension, are statistically significantly enriched with genes that have a high positive dynamic of their mention in the scientific publications of PubMed (p-value < 10−15). The total number of genes with a positive growing trend (r > 0.9) was 872 of the 17,868 human genes, presented in the ANDSystem. Among the 202 genes of the training set, it appeared to be 20% of such genes (42 genes). The HMOX1 gene, which encodes heme oxygenase 1, possessed the greatest positive dynamics of interest (r = 0.99). It is interesting to note that the top 100 of most priority genes contained 25% of genes possessing increased positive dynamics of interest. This enrichment was also statistically significant (p-value < 10−11). Analysis of the top 5 genes showed that among them STAT3 belongs to the group of 25% of genes possessing increased positive dynamics of interest (r = 0.96). The differences in the dynamics of interest for genes from the top 5 group are shown in Figure 2.
Figure 2:

The dynamics of researcher’s interest for the top 5 most priority genes, according to scientific publications of PubMed.

The dynamics of researcher’s interest for the top 5 most priority genes, according to scientific publications of PubMed. Gene STAT3 encodes a protein that in response to cytokines and growth factors is phosphorylated and translocated to the nucleus where it starts to act as a transcription activator [48]. STAT3 is involved in many cellular processes including cell growth and apoptosis [49].

Concluding Remarks

It is well known that many diseases are comorbid, which significantly complicates their treatment, and also prevents their proper diagnosis. The study of the molecular mechanisms of comorbid states is an urgent task of the modern biomedicine. Asthma and hypertension are examples of widespread comorbid diseases. In this study, we used prioritization methods, including existing well-known methods (ToppGene and Endeavor), and an approach (cross-talk centrality), based on the analysis of associative gene networks from ANDSystem, to find genes that could potentially be involved in asthma and hypertension simultaneously. Cytokines (CXCL10, CD40LG, IL1A, IL13 and IL15), toll-like receptor TLR2, death domain receptor FAS, transcription factor STAT3, complement factor C3 and amyloid beta precursor protein (APP), appeared to be the most significant. In addition, an analysis of the dynamics of the frequency of mentioning genes in scientific publications was carried out. It turned out that the top 100 of the priority genes are significantly enriched by genes with increased positive dynamics of researcher’s interest in them. This may indicate a promise for further researches on these genes. A further study of the predicted genes, simultaneously involved in asthma and hypertension, can help to illuminate the mechanisms of the comorbid state of these two diseases. Click here for additional data file.
  45 in total

1.  Characterization of IL-4 and IL-13 signals dependent on the human IL-13 receptor alpha chain 1: redundancy of requirement of tyrosine residue for STAT3 activation.

Authors:  R Umeshita-Suyama; R Sugimoto; M Akaiwa; K Arima; B Yu; M Wada; M Kuwano; K Nakajima; N Hamasaki; K Izuhara
Journal:  Int Immunol       Date:  2000-11       Impact factor: 4.823

2.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

3.  Cooperation between STAT3 and c-jun suppresses Fas transcription.

Authors:  V N Ivanov; A Bhoumik; M Krasilnikov; R Raz; L B Owen-Schaub; D Levy; C M Horvath; Z Ronai
Journal:  Mol Cell       Date:  2001-03       Impact factor: 17.970

4.  Treatment of allergic rhinitis can improve blood pressure control.

Authors:  E Magen; C Yosefy; R J Viskoper; J Mishal
Journal:  J Hum Hypertens       Date:  2006-09-07       Impact factor: 3.012

5.  Serum interleukin-15 concentration in patients with essential hypertension.

Authors:  Masaharu Kaibe; Mitsuru Ohishi; Norihisa Ito; Ming Yuan; Takashi Takagi; Minako Terai; Yuji Tatara; Norio Komai; Hiromi Rakugi; Toshio Ogihara
Journal:  Am J Hypertens       Date:  2005-08       Impact factor: 2.689

6.  Airway epithelial STAT3 is required for allergic inflammation in a murine model of asthma.

Authors:  Marina C Simeone-Penney; Mariano Severgnini; Powen Tu; Robert J Homer; Thomas J Mariani; Lauren Cohn; Amy R Simon
Journal:  J Immunol       Date:  2007-05-15       Impact factor: 5.422

7.  Complement 3 is involved in the synthetic phenotype and exaggerated growth of vascular smooth muscle cells from spontaneously hypertensive rats.

Authors:  Zhi-Hong Lin; Noboru Fukuda; Xue-Qing Jin; En-Hui Yao; Takahiro Ueno; Morito Endo; Satoshi Saito; Koichi Matsumoto; Hideo Mugishima
Journal:  Hypertension       Date:  2004-05-10       Impact factor: 10.190

8.  Polymorphisms in signal transducer and activator of transcription 3 and lung function in asthma.

Authors:  Augusto A Litonjua; Kelan G Tantisira; Stephen Lake; Ross Lazarus; Brent G Richter; Stacey Gabriel; Eric S Silverman; Scott T Weiss
Journal:  Respir Res       Date:  2005-06-03

9.  AmiGO: online access to ontology and annotation data.

Authors:  Seth Carbon; Amelia Ireland; Christopher J Mungall; ShengQiang Shu; Brad Marshall; Suzanna Lewis
Journal:  Bioinformatics       Date:  2008-11-25       Impact factor: 6.937

10.  Improved human disease candidate gene prioritization using mouse phenotype.

Authors:  Jing Chen; Huan Xu; Bruce J Aronow; Anil G Jegga
Journal:  BMC Bioinformatics       Date:  2007-10-16       Impact factor: 3.169

View more
  2 in total

1.  Comorbidity of asthma and hypertension may be mediated by shared genetic dysregulation and drug side effects.

Authors:  Olga Zolotareva; Olga V Saik; Cassandra Königs; Elena Yu Bragina; Irina A Goncharova; Maxim B Freidin; Victor E Dosenko; Vladimir A Ivanisenko; Ralf Hofestädt
Journal:  Sci Rep       Date:  2019-11-08       Impact factor: 4.379

Review 2.  A Survey of Gene Prioritization Tools for Mendelian and Complex Human Diseases.

Authors:  Olga Zolotareva; Maren Kleine
Journal:  J Integr Bioinform       Date:  2019-09-09
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.