Literature DB >> 26306270

Disease Comorbidity Network Guides the Detection of Molecular Evidence for the Link Between Colorectal Cancer and Obesity.

Yang Chen1, Li Li2, Rong Xu3.   

Abstract

Epidemiological studies suggested that obesity increases the risk of colorectal cancer (CRC). The genetic connection between CRC and obesity is multifactorial and inconclusive. In this study, we hypothesize that the study of shared comorbid diseases between CRC and obesity can offer unique insights into common genetic basis of these two diseases. We constructed a comorbidity network based on mining health data for millions of patients. We developed a novel approach and extracted the diseases that play critical roles in connecting obesity and CRC in the comorbidity network. Our approach was able to prioritize metabolic syndrome and diabetes, which are known to be associated with obesity and CRC through insulin resistance pathways. Interestingly, we found that osteoporosis was highly associated with the connection between obesity and CRC. Through gene expression meta-analysis, we identified novel genes shared among CRC, obesity and osteoporosis. Literature evidences support that these genes may contribute in explaining the genetic overlaps between obesity and CRC.

Entities:  

Keywords:  association rule mining; colorectal cancer; comorbidity network; gene expression; obesity; osteoporosis

Year:  2015        PMID: 26306270      PMCID: PMC4525229     

Source DB:  PubMed          Journal:  AMIA Jt Summits Transl Sci Proc


I. INTRODUCTION

Comorbidity studies often detect unexpected disease links [1] and offer novel insights into the genetic mechanisms of diseases [2, 3]. A number of epidemiological studies suggest that obesity increases the risk of colorectal cancer (CRC) [4-6]. Based on these evidences of co-occurrence, many genetic factors have been proposed to explain the role of obesity in the development of CRC. For example, both animal and human studies have demonstrated that the increased release of insulin and reduced insulin signaling play roles in obesity and colorectal carcinogenesis [7-9]. Experiments also show that obesity leads to altered level of adipocytokines, such as Adiponectin [10-12] and leptin [13, 14], which may either prevent or foster carcinogenesis. The mechanism for the association between obesity and CRC is multifactorial and inconclusive [6, 15, 16]. Shared comorbidities between obesity and CRC can provide unique insights into the common genetic basis for the two diseases. For example, type 2 diabetes is highly correlated with obesity and was identified as a risk factor for CRC [17]. A few studies then discovered that genetic factors of insulin resistance, which occur in type 2 diabetes, contribute in explaining the role of obesity in CRC [18]. However, both obesity and CRC are heterogeneous conditions. Over 40% of the obese population is not characterized by the presence of insulin resistance [19]. We hypothesize that systems approaches to studying the diseases that are phenotypically-significant to both CRC and obesity may offer new insights into the common molecular mechanisms between the two interconnected diseases. Systematic comorbidity studies have been conducted previously, but mostly focused on pairwise comorbidities and their genetic overlaps. Rhetsky et al. developed a statistical model to estimate the co-occurrence relationship for each pair of 160 diseases [20], and demonstrated that comorbidities are genetically linked. Park et al. [21] and Hidalgo et al. [22] detected the comorbidities pairs from the Medicare claims (which only contain senior patients ages 65 or older) with statistical measures. Roque et al. mined pairwise disease correlations using similar measures from medical records of a psychiatric hospital [23]. Recently, we extracted comorbidity patterns from a publically accessible database, which contains disease records for millions of patients at all ages, using an association rule mining approach [24, 25]. In this study, we constructed a disease comorbidity network based on our previous work. We developed a novel approach to detect diseases that have strong connections with both obesity and CRC in the comorbidity network. Specifically, we extracted the local network consisting of all the paths between obesity and CRC, and prioritized the nodes (diseases) that play critical roles in maintaining the connection between the two diseases (Fig. 1). Substantial literature evidences can support that the top ranked diseases have associations with both obesity and CRC. We investigated the gene expression profiles of a prioritized comorbid disease to facilitate detecting novel genetic basis underlying the link between obesity and CRC. Our approach is generalizable to study the genetic basis for other disease associations.
Fig. 1.

Approach to detect the diseases that have strong connections with both obesity and CRC in the comorbidity network. Nodes D1, D2 and D3 were prioritized because they play important roles in maintaining the network structure and the connection

II. MATERIALS AND METHODS

Fig. 2 shows the three steps of our approach. We first mined disease comorbidity relationships from large amounts of patient records and constructed a disease comorbidity network. We then extracted the local comorbidity cluster for obesity and CRC and prioritize the candidate comorbidity that plays a critical role in connecting the two diseases. Finally we conducted gene expression meta-analysis to identify common genes shared by obesity, CRC and the prioritized comorbidity.
Fig. 2.

Our approach contains three steps: (1) We constructed a comorbidity network based on data mining; (2) we extracted the local network that contains paths from obesity to CRC, and analyzed the local network to pin point the strong comorbidity for both obesity and CRC; (3) we conducted gene expression meta-analysis to identify common genes shared among obesity, CRC and the comorbidity.

A. Construct Disease Comorbidity Network

We mined disease comorbidity relationships from the FDA adverse event reporting system. The database contains records (2004–2013) of 3,354,043 patients (male and female at all age levels) and 10,112 disorders. Our previous studies [24, 25] have demonstrated that this database is useful in mining comorbidity patterns among diverse patient populations. We applied the association rule mining approach to detect disease comorbidity relationships from the patient-disease pairs. Association rule mining can flexibly detect strong co-occurrence relationships among sets of diseases, and alleviates the intrinsic bias of traditional comorbidity measures (such as relative risk and φ-correlation) towards rare diseases [24, 25]. We constructed an undirected and unweighted comorbidity network based on the result of association rule mining, which is a list of patterns between two sets of diseases, represented in the form x→y. We collected all diseases in the set x and y in each pattern, assuming they have comorbidity relationships with each other, and established an edge between each pair of diseases in xUy to construct the comorbidity network [24].

B. Prioritize the Diseases That Have Strong Associations with Both Obesity and CRC

We extracted the local network consisting of the paths from obesity to CRC in the disease comorbidity network. The local network thus includes the nodes that may represent different aspects of the relationship between obesity and CRC. We implemented breath first search to enumerate the paths, and limited the paths within four steps. Then we ranked the nodes in the local network, except obesity and CRC, based on how important they are in maintaining the local network structure and the connection between obesity and CRC. We used the degree and betweenness centrality to characterize the importance of each node in the flowing of the network. The degree of a node becomes higher if more paths between obesity and CRC pass through this node. The betweeness evaluates the number of times that the node acts as the bridge along the shortest paths. Removing the nodes with highest degree or betweenness can easily break down the connection between obesity and CRC. We investigated the top ranked diseases based on both ranking methods, and used the unexpected ones to guide the detection of genetic associations between obesity and CRC.

C. Identify Gene Overlaps Through Gene Expression Meta-analysis

We chose a top ranked disease on the path between obesity and CRC, and then conducted gene expression meta-analysis for the prioritized disease, obesity and CRC, respectively, to detect new genetic explanations for the relationship between obesity and CRC. Gene expression normalized data (SOFT files) were downloaded from NCBI GEO omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) using the R package GEOquery [28]. Then, we performed microarray meta-analyses for each disease independently using the R package MetaDE [29]. MetaDE implements meta-analysis methods for differential expression analysis, and we used the Fisher’s method. Significant differentially expressed genes (DEGs) were selected as those displaying a FDR corrected p-value <0.05. Last, we extracted the common significant genes for the three diseases.

III. RESULTS

A. Local Disease Comorbidity Network Models the Connection Between Obesity and CRC

We extracted 7006 comorbidity association rules with the confidence larger than 50% from the patient records across ten years. The comorbidity network based on these rules contains 771 nodes and 15,667 edges. Fig. 3 shows the local network consisting of all the 119 paths (no longer than four steps) from obesity to CRC. A total of 24 nodes in the local network are the candidate diseases, which have associations with both obesity and CRC, and may indicate different aspects of the relationship between the two diseases.
Fig. 3.

The local network that contains all paths from obesity to colorectal cancer in the comorbidity network.

B. Osteoporosis Shows High Comorbidity Associations with Both CRC and Obesity

Table 1 shows the top five nodes sorted by degree and betweenness in the local network. In either way of ranking, hypertension, diabetes and hyperlipaemia were in top three and closely related with both obesity and CRC. Substantial literature evidences support that the metabolic syndrome components, hypertension and hyperlipaemia, as well as diabetes have association with obesity and CRC through insulin resistance in substantial literature [6–9, 18]. These three disorders also independently increase the risk of CRC and colorectal adenoma [6, 17, 18]. The top ranked comorbidities demonstrated the validity of our network analysis approach.
TABLE I

TOP FIVE DISEASE NODES IN THE LOCAL NETWORK THAT CONTAINS ALL PATHS FROM OBESITY TO COLORECTAL CANCER. THE DISEASES WERE RANKED BY DEGREE AND BETWEENNESS, RESPECTIVELY.

RankRanked by degreeRanked by betweenness
NodesDegreeNodesBetweenness
1Hypertension26Hypertension60.2
2Diabetes mellitus24Diabetes mellitus55.9
3Hyperlipaemia22Hyperlipaemia35.2
4Osteoporosis14Osteoporosis12.3
5Hypothyroid14Hypothyroid9.5
Significantly, osteoporosis was ranked highly by both centrality ranking methods. Epidemiological studies suggested an inverse association between bone mineral density and CRC [30], colon cancer among postmenopausal women [31], and colorectal adenoma [32]. On the other hand, patients of obesity and osteoporosis may share common genetic and environmental factors [33]. Different from previous studies, our result shows that osteoporosis is crucial for the association between CRC and obesity. Fig. 4 shows the paths of obesity-osteoporosis-CRC. We further investigate the gene expression profiles of osteoporosis patients to gain novel insight of the genetic basis for the link between obesity and CRC.
Fig. 4.

The paths from obesity to colorectal cancer that pass through osteoporosis.

C. Innovative Genes Shared Among Osteoporosis, Obesity and CRC Were Detected Using Gene Expression Meta-analysis

We downloaded five microarray series (GSE4017, GSE9348, GSE4183, GSE8671, GSE20916) for CRC, three (GSE48964, GSE29718, GSE55205) for obesity and three (GSE7429, GSE2208, GSE7158) for osteoporosis. Through meta-analysis, we obtained 9058 significant differentially expressed genes for CRC, 275 for obesity and 91 for osteoporosis. CRC and obesity shared a total of 192 genes. Among them, we found genes on insulin signaling pathways, such as PDK1, PRKAG2 and PDE3B, and adipocytokines, such as IL6 and IL8. The three diseases osteoporosis, obesity and CRC shared six genes. Table II lists the genes and literature evidences, which support their relationships with each of the three diseases. Among them, FOS, JUN, and FOSB are oncogenes. FOS and JUN are known on the insulin signaling pathway. FOSB is on the AP1 pathway, which is associated with the proliferation of colon cancer cells [55]. Several studies suggested that overexpression of FOSB increases the responding of high fat reward while decreases energy expenditure and promotes adiposity [40, 56].
TABLE II

COMMON GENES SHARED BY OBESITY, CRC AND OSTEOPOROSIS, AND PLAUSIBLE EVIDENCE SUPPORTING THEIR RELATIONSHIPS WITH THE THREE DISEASES.

GENESOBESITYCRCOSTEOPOROSIS
PPP1R15A*In the bone morphogenetic protein (BMP) signaling pathway, which regulates appetite [34]Mutations in the BMP pathway are related with colorectal carcinogenesis [35]In the bone morphogenetic protein signaling pathway, which are associated with bone-related diseases, such as osteoporosis [36]
FOSdiet-induced obesity is accompanied by alteration of FOS expression [37]Proto-oncogene, in the KEGG pathway of colorectal cancer [38]Mice lacking c-fos develop severe osteopetrosis [39]
FOSBpositive association between maternal obesity [40]Oncogene, regulators of cell proliferation, has a debatable impact on CRC patient survival [41]Overexpression of FosB increases bone formation [42]
HADHA*Associated with multiple fatty acid metabolism pathways [43]Unknown. Associated with breast cancer [44]Unknown.
JUNThe c-Jun NH2-terminal Kinase Promotes Insulin Resistance [45]Proto-oncogene, in the KEGG pathway of colorectal cancer [38]Associated with osteogenesis [46, 47]
NRIP1*Down-regulated in obese subjects, may suggest a compensatory mechanism to favor energy expenditure and reduce fat accumulation in obesity states [48]Unknown. Involved in regulation of E2F1, an oncogene [49]Modulates transcriptional activity of the estrogen receptor. Interact with ESR1 and ESR2 in osteoporosis [50]

novel genes not involving insulin resistance pathways

Interestingly, we found several genes not involving insulin signaling. Gene PPP1R15A is in the bone morphogenetic protein signaling (BMP) pathway and its superfamily, the TGF beta signaling pathway. The mutation of BMP pathway has been found in patients with juvenile polyposis, which is rare syndrome with an increased risk for developing CRC [51, 52]. Mutations in TGF beta signaling also have been found susceptibility to CRC through genome-wide association studies [53]. A recent mouse experiment also showed that the BMP pathway regulates brown adipogenesis, energy expenditure and appetite, thus is highly associated with diet-induced obesity [54]. These evidences support our result. Further investigation is required to confirm and elucidate the role of the BMP pathway in the connection between obesity and CRC. Gene NRIP1 regulates the estrogen receptor. Its interaction with sex hormone receptors plays a role in both obesity [48] and osteoporosis [50]. Its relationship with CRC is unclear yet, but studies suggested that estrogen may have protective effect on CRC [57]. Gene HADHA is on multiple pathways of fatty acid metabolism. But its role in CRC and osteoporosis is unknown yet. To identify the common genes among obesity, CRC and osteoporosis, we currently analyzed the gene expression data, which can be noisy. While we found literature evidences to support the detected genes and their relationships with both obesity and CRC, these candidate genes need further investigations, for example, through mouse model experiments.

IV. DISCUSSIONS AND CONCLUSIONS

The genetic connection between CRC and obesity is multifactorial and inconclusive. In this study, we developed a comorbidity network analysis approach, which suggested that osteoporosis is important for the connection between obesity and CRC. We identified common genes among obesity, CRC and osteoporosis, and found these genes are associated with the regulation of sex hormone receptors and growth factors inducing bone formation. These genes are candidates in explaining the genetic overlaps between obesity and CRC. Our comorbidity network may be not inclusive and biased toward the diseases whose drugs have high toxicity. The FDA adverse event reporting system collects data from medical product manufacturers, health professionals, and the public. The diseases without drug treatments are not included in the data, and the disease comorbidity relationships were often under-estimated in practice based on these data. In this study, we developed a network analysis approach to compensate the bias of the comorbidity data. In the future, including more complete patient disease data may facilitate the detection of new interesting comorbidities other than osteoporosis for obesity and CRC. In addition, we currently detect comorbidities based on disease co-occurrence. The co-occurrence patterns may indicate the increase of the risk between two diseases in a mutual way. Incorporating more comprehensive patient-level data, such as time series data, may help refine the disease relationships and control confounding factors.
  55 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Germline mutations of the gene encoding bone morphogenetic protein receptor 1A in juvenile polyposis.

Authors:  J R Howe; J L Bair; M G Sayed; M E Anderson; F A Mitros; G M Petersen; V E Velculescu; G Traverso; B Vogelstein
Journal:  Nat Genet       Date:  2001-06       Impact factor: 38.330

3.  Overweight, obesity, and mortality from cancer in a prospectively studied cohort of U.S. adults.

Authors:  Eugenia E Calle; Carmen Rodriguez; Kimberly Walker-Thurmond; Michael J Thun
Journal:  N Engl J Med       Date:  2003-04-24       Impact factor: 91.245

Review 4.  Tgf-beta signaling alterations and colon cancer.

Authors:  Naresh Bellam; Boris Pasche
Journal:  Cancer Treat Res       Date:  2010

5.  The transcriptional coregulator RIP140 represses E2F1 activity and discriminates breast cancer subtypes.

Authors:  Aurélie Docquier; Pierre-Olivier Harmand; Samuel Fritsch; Maïa Chanrion; Jean-Marie Darbon; Vincent Cavaillès
Journal:  Clin Cancer Res       Date:  2010-04-21       Impact factor: 12.531

6.  Association of HADHA expression with the risk of breast cancer: targeted subset analysis and meta-analysis of microarray data.

Authors:  Manju Mamtani; Hemant Kulkarni
Journal:  BMC Res Notes       Date:  2012-01-12

Review 7.  TGF-β and BMP signaling in osteoblast differentiation and bone formation.

Authors:  Guiqian Chen; Chuxia Deng; Yi-Ping Li
Journal:  Int J Biol Sci       Date:  2012-01-21       Impact factor: 6.580

8.  Mining cancer-specific disease comorbidities from a large observational health database.

Authors:  Yang Chen; Rong Xu
Journal:  Cancer Inform       Date:  2014-10-13

9.  The impact of cellular networks on disease comorbidity.

Authors:  Juyong Park; Deok-Sun Lee; Nicholas A Christakis; Albert-László Barabási
Journal:  Mol Syst Biol       Date:  2009-04-07       Impact factor: 11.429

10.  Maternal pregravid obesity changes gene expression profiles toward greater inflammation and reduced insulin sensitivity in umbilical cord.

Authors:  Keshari M Thakali; Jessica Saben; Jennifer B Faske; Forrest Lindsey; Horacio Gomez-Acevedo; Curtis L Lowery; Thomas M Badger; Aline Andres; Kartik Shankar
Journal:  Pediatr Res       Date:  2014-05-12       Impact factor: 3.756

View more
  10 in total

1.  Context-sensitive network-based disease genetics prediction and its implications in drug discovery.

Authors:  Yang Chen; Rong Xu
Journal:  Bioinformatics       Date:  2017-04-01       Impact factor: 6.937

2.  Phenome-based gene discovery provides information about Parkinson's disease drug targets.

Authors:  Yang Chen; Rong Xu
Journal:  BMC Genomics       Date:  2016-08-31       Impact factor: 3.969

3.  Mining comorbidities of opioid use disorder from FDA adverse event reporting system and patient electronic health records.

Authors:  Yiheng Pan; Rong Xu
Journal:  BMC Med Inform Decis Mak       Date:  2022-06-16       Impact factor: 3.298

4.  Towards precision medicine-based therapies for glioblastoma: interrogating human disease genomics and mouse phenotypes.

Authors:  Yang Chen; Zhen Gao; Bingcheng Wang; Rong Xu
Journal:  BMC Genomics       Date:  2016-08-22       Impact factor: 3.969

5.  Interrogating Patient-level Genomics and Mouse Phenomics towards Understanding Cytokines in Colorectal Cancer Metastasis.

Authors:  Xiaoshu Cai; Yang Chen; Chunlei Zheng; Rong Xu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26

6.  Identification of key genes and pathways associated with obesity in children.

Authors:  Ling Li; Guangyu Wang; Ning Li; Haiyan Yu; Jianping Si; Jiwen Wang
Journal:  Exp Ther Med       Date:  2017-06-13       Impact factor: 2.447

7.  Linking glycemic dysregulation in diabetes to symptoms, comorbidities, and genetics through EHR data mining.

Authors:  Isa Kristina Kirk; Christian Simon; Karina Banasik; Peter Christoffer Holm; Amalie Dahl Haue; Peter Bjødstrup Jensen; Lars Juhl Jensen; Cristina Leal Rodríguez; Mette Krogh Pedersen; Robert Eriksson; Henrik Ullits Andersen; Thomas Almdal; Jette Bork-Jensen; Niels Grarup; Knut Borch-Johnsen; Oluf Pedersen; Flemming Pociot; Torben Hansen; Regine Bergholdt; Peter Rossing; Søren Brunak
Journal:  Elife       Date:  2019-12-10       Impact factor: 8.140

8.  Explore Small Molecule-induced Genome-wide Transcriptional Profiles for Novel Inflammatory Bowel Disease Drug.

Authors:  Xiaoshu Cai; Yang Chen; Zhen Gao; Rong Xu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2016-07-20

9.  Combining phenome-driven drug-target interaction prediction with patients' electronic health records-based clinical corroboration toward drug discovery.

Authors:  Mengshi Zhou; Chunlei Zheng; Rong Xu
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

10.  Context-sensitive network analysis identifies food metabolites associated with Alzheimer's disease: an exploratory study.

Authors:  Yang Chen; Rong Xu
Journal:  BMC Med Genomics       Date:  2019-01-31       Impact factor: 3.063

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.