Literature DB >> 27146020

Colorectal cancer risk genes are functionally enriched in regulatory pathways.

Xi Lu1, Mingming Cao2, Su Han3, Youlin Yang1, Jin Zhou4.   

Abstract

Colorectal cancer (CRC) is a common complex disease caused by the combination of genetic variants and environmental factors. Genome-wide association studies (GWAS) have been performed and reported some novel CRC susceptibility variants. However, the potential genetic mechanisms for newly identified CRC susceptibility variants are still unclear. Here, we selected 85 CRC susceptibility variants with suggestive association P < 1.00E-05 from the National Human Genome Research Institute GWAS catalog. To investigate the underlying genetic pathways where these newly identified CRC susceptibility genes are significantly enriched, we conducted a functional annotation. Using two kinds of SNP to gene mapping methods including the nearest upstream and downstream gene method and the ProxyGeneLD, we got 128 unique CRC susceptibility genes. We then conducted a pathway analysis in GO database using the corresponding 128 genes. We identified 44 GO categories, 17 of which are regulatory pathways. We believe that our results may provide further insight into the underlying genetic mechanisms for these newly identified CRC susceptibility variants.

Entities:  

Mesh:

Year:  2016        PMID: 27146020      PMCID: PMC4857176          DOI: 10.1038/srep25347

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Colorectal cancer (CRC) is the third most common form of cancer and the second leading cause of cancer-related death in the western world and12. CRC is a leading cause of cancer-related deaths in the United States, and its lifetime risk in the United States is about 7%13. CRC is a common complex disease caused by the combination of genetic variants and environmental factors1. Genome-wide association studies (GWAS) are considered to be new and power approaches to detect the genetic variants of human complex diseases. Recently, GWAS have been performed and reported some novel CRC susceptibility single nucleotide polymorphisms (SNPs) with genome-wide significance (P < 5.00E-08), and these SNPs have been repliciated by meta-anaysis methods45678910111213. In 2012, Loo et al. conducted a cis-expression quantitative trait loci (cis-eQTLs) analysis to investigate the possible regulatory functions of 19 CRC risk variants on the expression of neighboring genes (<2 Mb up- or down-stream)14. They identified three variants including rs10795668, rs4444235 and rs9929218 to be significantly associated with expression levels of nearby genes14. In 2014, Closa et al. analyzed the association between 26 CRC SNPs and the expression of genes within a 2 Mb region (cis-eQTLs) using 47 healthy colonic mucosa tissues and 97 normal mucosa tissues adjacent to colon cancer, and 97 paired tumor tissues15. Using Bonferroni correction, they identified three significant cis-eQTLs including rs3802842 in 11q23.1 associated with the expression of C11orf53, COLCA1 and COLCA2; rs7136702 in 12q13.12 associated with the expression of DIP2B and rs5934683 in Xp22.3 associated with the expression of SHROOM2 and GPR143. Closa et al. also reported other SNPs including rs7130173 for 11q23.1 and rs61927768 for 12q13.12, which are in linkage disequilibrium (LD) with rs3802842 and rs7136702, and are more strongly associated with the expression of the identified genes and are better functional candidates. In 2014, Yao et al. select 25 CRC SNPs, and test the hypothesis that the CRC SNPs and/or correlated SNPs are in elements that regulate gene expression3. They identified 23 promoters, 28 enhancers, and 66 putative target genes of the risk-associated enhancers3. Evidence shows that most variants for common human diseases are not correlated with protein-coding changes, indicating that susceptibility variants in regulatory regions may contribute to disease phenotypes16. For CRC, most risk variants also reside outside the coding regions of genes31415. Until now, as described above, comprehensive functional studies of CRC SNPs on nearby gene expression have been reported31415. Evidence from the National Human Genome Research Institute (NHGRI) GWAS catalog shows that 85 CRC susceptibility variants, which reach suggestive association P < 1.00E-05, have been identified until now1718. However, the exact genetic mechanisms for these newly identified CRC susceptibility variants are still unclear now. In order to investigate the potential regulatory functions for 85 newly identified CRC susceptibility variants, we conducted a pathway analysis of these CRC susceptibility genes around these CRC susceptibility variants.

Results

CRC susceptibility genes from ProxyGeneLD

Using the ProxyGeneLD and the LD information from the HapMap phase II Europe (CEU), 74 of these 85 unique CRC susceptibility variants were included in HapMap, and were successfully mapped to the corresponding genes 53 unique CRC susceptibility genes (Table 1). However, another 11 SNPs including rs11196172, rs73376930, rs11255841, rs10849432, rs35509282, rs140355816, rs34245511, rs12412391, rs4849303, rs57046232 and rs7999699 are not found in HapMap.
Table 1

The main information for 85 CRC susceptibility variants.

SNPPubmed IDDisease/TraitGeneP valueOR or beta95% CI
rs493982718372901CRCSMAD78.00E-281.2[1.16–1.24]
rs701434618372901CRCLOC1019300339.00E-261.19[1.15–1.23]
rs17453724836286CRCMYRF9.00E-211.16[1.12–1.19]
rs1689276618372905CRCLINC00536-EIF3H3.00E-181.27[1.20–1.34]
rs1079566824836286CRCRNA5SP299-LINC007095.00E-151.15[1.11–1.19]
rs698326717618284CRCCCAT2, LOC1019300331.00E-141.27[1.16–1.39]
rs64716124836286CRCC5orf662.00E-141.15[1.11–1.19]
rs698326724836286CRCCCAT2, LOC1019300335.00E-141.14[1.10–1.18]
rs1079566818372905CRCRNA5SP299-LINC007093.00E-131.12[1.10–1.16]
rs1050547724737748CRCLOC1019300338.00E-131.2[NR]
rs493982717934461CRCSMAD71.00E-121.16[1.09–1.27]
rs1119617224836286CRCTCF7L21.00E-121.14[1.10–1.18]
rs242327924836286CRCSRSF10P23.00E-121.13[1.09–1.17]
rs7337693024737748CRCGREM1, LOC1001313151.00E-111.25[NR]
rs698326723266556CRCCCAT2, LOC1019300331.00E-111.13[1.09–1.18]
rs1050547717618283CRCLOC1019300333.00E-111.17[1.12–1.23]
rs722963924448986CRCSMAD73.00E-111.22[1.15–1.29]
rs242730824737748CRCCABLES23.00E-111.24[NR]
rs103520924737748CRCNKX2-3-SLC25A285.00E-111.12[1.08–1.16]
rs698326718372905CRCCCAT2, LOC1019300337.00E-111.24[1.17–1.33]
rs1125584124737748CRCRNA5SP299-LINC007097.00E-111.19[NR]
rs132131122634755CRCN/A1.00E-101.1[1.07–1.13]
rs1077421424836286CRCRPL18P9-CCND21.00E-101.14[1.09–1.18]
rs64716123263487CRCC5orf661.00E-101.11[1.08–1.15]
rs96125319011631CRCFGFR3P3-CASC202.00E-101.12[1.08–1.16]
rs1116955220972440CRCDIP2B-ATF12.00E-101.09[1.05–1.11]
rs492538620972440CRCLAMA52.00E-101.08[1.05–1.10]
rs493982723266556CRCSMAD72.00E-101.12[1.09–1.16]
rs382499922634755CRCPOLD34.00E-101.08[1.05–1.10]
rs64716123263487CRCC5orf664.00E-101.17[1.11–1.22]
rs1077421423263487CRCRPL18P9-CCND25.00E-101.17[1.11–1.23]
rs380284218372901CRCCOLCA2, COLCA16.00E-101.11[1.08–1.15]
rs1084943224836286CRCLOC1027237676.00E-101.14[1.09–1.18]
rs593468322634755CRCGPR143-SHROOM27.00E-101.07[1.04–1.10]
rs444423519011631CRCRPS3AP46-MIR55808.00E-101.11[1.08–1.15]
rs669117020972440CRCDUSP10-QRSL1P21.00E-091.06[1.03–1.09]
rs1224100825105248CRCVTI1A1.00E-091.13[1.09–1.18]
rs668775820972440CRCDUSP10-QRSL1P22.00E-091.09[1.06–1.12]
rs1041121019011631CRCRHPN25.00E-091.15[1.10–1.20]
rs242327923263487CRCSRSF10P27.00E-091.1[1.06–1.14]
rs3550928225023989CRCMTHFD2P4-TOMM22P48.00E-091.53[1.39–1.67]
rs775822921242260CRCSLC22A38.00E-091.28[1.18–1.39]
rs668775824836286CRCDUSP10-QRSL1P29.00E-091.12[1.08–1.17]
rs414309424743840CRCbGATA39.00E-091.17[1.11–1.23]
rs992921819011631CRCCDH11.00E-081.1[1.06–1.12]
rs180046924836286CRCTGFB1, B9D21.00E-081.09[1.06–1.12]
rs477958421761138CRCSCG5-GREM12.00E-081.18[1.11–1.24]
rs70401724836286CRCZMIZ1-AS12.00E-081.1[1.06–1.13]
rs722963924836286CRCSMAD72.00E-081.2[1.16–1.25]
rs14035581624737748CRCLINC00536-EIF3H2.00E-081.59[NR]
rs698326721242260CRCCCAT2, LOC1019300332.00E-081.18[1.11–1.25]
rs1093659920972440CRCMYNN3.00E-081.04[1.04–1.10]
rs1260352624836286CRCNXN3.00E-081.1[1.06–1.14]
rs1077421423263487CRCRPL18P9-CCND23.00E-081.09[1.06–1.13]
rs3424551124737748CRCLIMA13.00E-081.14[NR]
rs36761523300701CRCKRT18P42-MAN2A14.00E-081.35[1.20–1.49]
rs1190375723266556CRCNABP1-SDPR4.00E-081.16[1.10–1.22]
rs646965624836286CRCLINC00536-EIF3H5.00E-081.09[1.06–1.13]
rs321781023266556CRCCCND26.00E-081.2[1.12–1.28]
rs494831724836286CRCBICC17.00E-081.1[1.06–1.13]
rs1334395424737748CRCRHPN27.00E-081.18[NR]
rs1091125123266556CRCLAMC19.00E-081.09[1.06–1.13]
rs493982721761138CRCSMAD71.00E-071.14[1.08–1.18]
rs242327923263487CRCSRSF10P22.00E-071.14[1.08–1.19]
rs380284224836286CRCCOLCA2, COLCA13.00E-071.09[1.05–1.12]
rs321790123266556CRCCCND23.00E-071.1[1.06–1.14]
rs1313078723266556CRCATOH1-HMGB3P153.00E-071.09[1.06–1.13]
rs1689276621761138CRCLINC00536-EIF3H4.00E-071.24[1.14–1.34]
rs380284221761138CRCCOLCA2, COLCA14.00E-071.14[1.08–1.20]
rs380284223266556CRCCOLCA2, COLCA14.00E-071.11[1.06–1.15]
rs5933623266556CRCTBX34.00E-071.09[1.06–1.13]
rs477958418372905CRCSCG5-GREM15.00E-071.23[1.14–1.34]
rs477958423266556CRCSCG5-GREM15.00E-071.12[1.08–1.19]
rs1241239124836286CRCLOC101927324, LINC014757.00E-071.08[1.05–1.11]
rs102816625192705CRCaN/A7.00E-071.49[1.27–1.74]
rs290187924743840CRCbMEIS1-AS2-DNMT3AP17.00E-071.11[1.06–1.16]
rs187143824743840CRCbNPM1P5-ST8SIA28.00E-071.11[1.07–1.16]
rs166565023263487CRCHSPA12A9.00E-071.13[1.08–1.19]
rs484930324743840CRCbACOXL1.00E-061.12[1.08–1.18]
rs414704524743840CRCbKRT18P63-RPL21P461.00E-061.2[1.11–1.3]
rs818004023350875CRCKLHL18-PTPN232.00E-061.28[1.15–1.41]
rs493982718372905CRCSMAD72.00E-061.18[1.10–1.25]
rs3945323300701CRCSNRPCP19-CYCS2.00E-061.28[1.16–1.43]
rs685588525192705CRCaCCSER12.00E-061.11[1.06–1.16]
rs193375525192705CRCaTMEM200A-SMLR12.00E-061.19[1.11–1.28]
rs137091624743840CRCbMRPL42P4-TNS32.00E-061.11[1.06–1.15]
rs698901024743840CRCbLINC00681-KIAA14562.00E-061.64[1.33–2]
rs259395724743840CRCbMORC12.00E-061.15[1.09–1.2]
rs1253470124743840CRCbDPP62.00E-061.17[1.09–1.24]
rs485569524743840CRCbMORC1-FLJ227632.00E-061.14[1.08–1.20]
rs1254802123350875CRCDUSP4-RPL17P333.00E-061.28[1.155–1.42]
rs1230927424836286CRCWNK13.00E-061.11[1.06–1.16]
rs1041121024836286CRCRHPN23.00E-061.12[1.07–1.17]
rs1773092923300701CRCRPS23P3-CENPC3.00E-061.47[1.25–1.72]
rs1011440823300701CRCMIR4291-BARX13.00E-061.37[1.20–1.56]
rs459151723300701CRCSALL4P5-RPL24P73.00E-061.06[0.88–1.29]
rs1087935723300701CRCTPH23.00E-061.25[1.14–1.39]
rs5704623224737748CRCFGFR3P3-CASC203.00E-061.11[NR]
rs205731423266556CRCDCBLD13.00E-061.08[1.04–1.11]
rs1709498323266556CRCDACT1-RPL31P43.00E-061.13[1.08–1.20]
rs799969924743840CRCbRPL27AP8-SUCLA23.00E-061.32[1.18–1.47]
rs310496423350875CRCLOC100616530, C8orf37-AS14.00E-061.27[1.14–1.40]
rs936572323300701CRCSYNJ24.00E-061.27[1.15–1.41]
rs1167110424743840CRCbGOLGA2P9-ZNF4924.00E-061.25[1.13–1.37]
rs1075288124836286CRCKRT18P28-LAMC15.00E-061.07[1.04–1.10]
rs724888824743840CRCbPNMAL15.00E-061.37[1.19–1.56]
rs731543821761138CRCTBX3-UBA52P76.00E-061.11[1.06–1.15]
rs481380223266556CRCCASC20-BMP27.00E-061.1[1.05–1.14]
rs212838223266556CRCGSDMC-FAM49B8.00E-061.11[1.06–1.16]
rs191245323266556CRCC1orf1109.00E-061.07[1.04–1.11]

CRC, colorectal cancer; CRCa, CRC (calcium intake interaction); CRCb, CRC (diet interaction); OR, odds ratio; CI, confidence interval; Chr, Chromosome; NR, not reported.

CRC susceptibility genes for pathway analysis

Using the nearest upstream and downstream gene method in NHGRI GWAS catalog, we got 106 unique CRC susceptibility gene IDs as described above. We compared these 106 genes with 53 unique CRC susceptibility genes from the ProxyGeneLD, and found 31 shared genes. In the end, we got 128 unique CRC susceptibility genes, which is the union of genes from both methods.

Pathway analysis preprocessing

In WebGestalt database, 120 of 128 genes were successfully mapped to 120 unique Entrez Gene IDs19. Other 8 genes were mapped to multiple Entrez Gene IDs or could not be mapped to any Entrez Gene ID. The following pathway analysis will be based upon the 120 unique gene IDs.

Pathway analysis using GO database

Our pathway analysis in GO database showed that these 120 CRC susceptibility genes were significantly enriched in 40 biological processes, 1 molecular function and 3 cellular components with adjusted P < 0.01. 17 of these 44 significant signals are regulatory pathways, such as regulation of epithelial to mesenchymal transition, negative regulation of Wnt receptor signaling pathway, regulation of pathway-restricted SMAD protein phosphorylation, positive regulation of nucleocytoplasmic transport, regulation of muscle organ development, positive regulation of intracellular protein transport. Interestingly, regulation of epithelial to mesenchymal transition (GO:0010717) is the most significant signal (Table 2). More detailed information including the gene IDs is described in supplementary Table 1.
Table 2

Significant GO pathways from pathway analysis of 128 CRC susceptibility genes.

GO categoriesIDNameCOERrawPadjP
biological processGO:0010717regulation of epithelial to mesenchymal transition3850.1728.737.73E-078.55E-05
biological processGO:0060675ureteric bud morphogenesis6860.3119.276.43E-078.55E-05
biological processGO:0030178negative regulation of Wnt receptor signaling pathway11270.5113.657.60E-078.55E-05
biological processGO:0035295tube development4361226.015.73E-078.55E-05
biological processGO:0060389pathway-restricted SMAD protein phosphorylation3750.1729.516.74E-078.55E-05
biological processGO:0001657ureteric bud development10470.4814.74.59E-078.55E-05
biological processGO:0060393regulation of pathway-restricted SMAD protein phosphorylation3350.1533.083.72E-078.55E-05
biological processGO:0046824positive regulation of nucleocytoplasmic transport7360.3317.959.82E-079.50E-05
biological processGO:0048468cell development1461216.693.141.20E-061.00E-04
biological processGO:0048634regulation of muscle organ development10460.4812.67.84E-062.00E-04
biological processGO:0090316positive regulation of intracellular protein transport8860.414.892.96E-062.00E-04
biological processGO:0032989cellular component morphogenesis1000164.583.498.23E-062.00E-04
biological processGO:0048729tissue morphogenesis472112.165.098.81E-062.00E-04
biological processGO:0048869cellular developmental process28292912.962.246.09E-062.00E-04
biological processGO:0051222positive regulation of protein transport16370.759.389.29E-062.00E-04
biological processGO:0001658branching involved in ureteric bud morphogenesis6450.2917.061.07E-052.00E-04
biological processGO:0061138morphogenesis of a branching epithelium16970.779.041.18E-052.00E-04
biological processGO:0009887organ morphogenesis802153.674.082.53E-062.00E-04
biological processGO:0090092regulation of transmembrane receptor protein serine/threonine kinase signaling pathway16870.779.11.13E-052.00E-04
biological processGO:0045598regulation of fat cell differentiation6450.2917.061.07E-052.00E-04
biological processGO:0016202regulation of striated muscle tissue development10260.4712.847.01E-062.00E-04
biological processGO:0060485mesenchyme development16270.749.448.93E-062.00E-04
biological processGO:0042476odontogenesis10360.4712.727.41E-062.00E-04
biological processGO:0035239tube morphogenesis30091.376.558.57E-062.00E-04
biological processGO:0022612gland morphogenesis11060.511.911.08E-052.00E-04
biological processGO:0048732gland development27791.277.094.49E-062.00E-04
biological processGO:0048762mesenchymal cell differentiation14170.6510.843.57E-062.00E-04
biological processGO:0032388positive regulation of intracellular transport9560.4413.794.64E-062.00E-04
biological processGO:0002009morphogenesis of an epithelium370101.695.96.51E-062.00E-04
biological processGO:2000027regulation of organ morphogenesis14770.6710.44.71E-062.00E-04
biological processGO:0034330cell junction organization19580.898.962.87E-062.00E-04
biological processGO:0051153regulation of striated muscle cell differentiation6450.2917.061.07E-052.00E-04
biological processGO:0042307positive regulation of protein import into nucleus6150.2817.98.45E-062.00E-04
biological processGO:0045662negative regulation of myoblast differentiation1030.0565.511.08E-052.00E-04
biological processGO:0001763morphogenesis of a branching structure19680.98.912.98E-062.00E-04
biological processGO:0000902cell morphogenesis945164.333.74.00E-062.00E-04
biological processGO:0048646anatomical structure formation involved in morphogenesis1594217.32.884.88E-062.00E-04
biological processGO:0000904cell morphogenesis involved in differentiation704133.224.031.52E-053.00E-04
biological processGO:0060284regulation of cell development495112.274.851.38E-053.00E-04
biological processGO:1901213regulation of transcription from RNA polymerase II promoter involved in heart development1230.0554.591.96E-053.00E-04
molecular functionGO:0008013beta-catenin binding6350.2818.18.03E-066.00E-04
cellular componentGO:0005913cell-cell adherens junction4640.1921.183.73E-053.30E-03
cellular componentGO:0070161anchoring junction21860.896.73.00E-048.80E-03
cellular componentGO:0005912adherens junction20160.837.272.00E-048.80E-03

C, the number of reference genes in the category; O, the number of genes in the gene set and also in the category; E, expected number in the category; R, the ratio of enrichment, rawP, the p value from hypergeometric test; adjP, the p value adjusted by the multiple test adjustment.

Discussion

Until now, 85 CRC susceptibility variants with suggestive association P < 1.00E-05 have been reported1718. To investigate the underlying genetic pathways where these newly identified CRC susceptibility genes are significantly enriched, we conducted a functional annotation. Using two kinds of SNP to gene mapping methods including the nearest upstream and downstream gene method and the ProxyGeneLD, we got 128 unique CRC susceptibility genes. We then conducted a pathway analysis in GO database using the corresponding 128 genes. We identified 44 GO categories, 17 of which are regulatory pathways. Here, we identified the regulation of epithelial to mesenchymal transition (GO:0010717) to be the most significant signal in all the 44 GO categories and the most signal in all the 17 regulatory pathways. It is reported that the epithelial-mesenchymal transition-like dedifferentiation of the tumor cells is a character of CRC invasion20. Several studies have reviewed the association between epithelial-mesenchymal transition and CRC progression212223. Our results show that these newly identified CRC susceptibility SNPs or genes may regulate epithelial-mesenchymal transition. The negative regulation of Wnt receptor signaling pathway (GO:0030178) is the third significant signal in all the 44 GO categories and the second significant signal in all the 17 regulatory pathways. Evidence shows that aberrant regulation of the Wnt/β-catenin signaling pathway can cause CRC24. It is reported that the loss-of-function mutations in APC gene are common in CRC, and can cause inappropriate activation of Wnt signaling24. Recently, several studies have reviewed the involvement of Wnt signalling in CRC development252627. Masuda et al. reported Wnt signaling to be the potential therapeutic target by targeting TNIK in CRC28. Here, our results show that these newly identified CRC susceptibility SNPs or genes may regulate Wnt receptor signaling pathway. The positive regulation of nucleocytoplasmic transport pathway (GO:0046824) is the 8th significant signal in all the 44 GO categories and the 4th significant signal in all the 17 regulatory pathways. Hill et al. reviewed the mechanisms and role of nucleocytoplasmic transport in cancer therapy29. Here, we identified the pathway-restricted SMAD protein phosphorylation (GO:0060389) and regulation of pathway-restricted SMAD protein phosphorylation (GO:0060393) to be 5th and 7th significant association signals, respectively. Interestingly, evidence shows that protein phosphorylation is a post-translational modification central to cancer biology30. Protein phosphorylation affects most eukaryotic cellular processes and its deregulation is considered a hallmark of cancer31. We also found that these newly identified CRC susceptibility SNPs or genes may regulate five GO categories related with cell differentiation including regulation of fat cell differentiation (GO:0045598), mesenchymal cell differentiation (GO:0048762), regulation of striated muscle cell differentiation (GO:0051153), negative regulation of myoblast differentiation (GO:0045662), and cell morphogenesis involved in differentiation (GO:0000904). Evidence showed the involvement of differentiation in CRC. Breaking the balance between proliferation and differentiation in animal cells can cause cancer32. PPAR-γ is a nuclear receptor with a dominant regulatory role in differentiation of cells of the adipose lineage33. PPAR-γ can modulate the growth and differentiation of CRC cells33. Differentiated human CRC cells protect tumor-initiating cells from irinotecan34. The resistance of colorectal tumors to irinotecan requires the cooperative action of tumor-initiating ALDHhigh/ABCB1negative cells and their differentiated, drug-expelling, ALDHlow/ABCB1positive daughter cells34. The calcium activated chloride channel A1 (CLCA1) is a member of the calcium sensitive chloride conductance family of proteins and is expressed mainly in the colon32. Recent study shows that CLCA1 plays an important role in differentiation and proliferation of Caco-2 cells, which can regulate the transition from proliferation to differentiation in CRC and may be a potential diagnostic marker for CRC prognosis32. Take together, our findings suggest that most CRC susceptibility variants are located in the intron region of protein encoding genes and are not correlated with protein-coding changes. Most of these 120 CRC susceptibility genes are involved in kinds of regulatory pathways. Our results may provide further insight into the underlying genetic mechanisms for these newly identified CRC susceptibility variants.

Materials and Methods

CRC susceptibility variants

The CRC susceptibility variants were available from the NHGRI GWAS catalog, which collected the results of published GWAS in online database18. We selected 85 unique CRC susceptibility variants with P < 1.00E-05 from the GWAS CRC, CRC (calcium intake interaction), and CRC (diet interaction).

Data preprocessing

In NHGRI GWAS catalog, these 85 unique CRC susceptibility variants were successfully mapped to 167 nearest upstream and downstream genes. We further analyzed these 167 genes and got 106 unique CRC susceptibility gene IDs. The detailed information was described in Table 1.

Mapping SNPs to genes using the ProxyGeneLD

In addition to the nearest upstream and downstream gene method, we also used a Perl software named ProxyGeneLD. ProxyGeneLD can map these 85 SNPs to their corresponding genes using the linkage disequilibrium (LD) information from the HapMap genotyping data (HapMap phase II Europe (CEU), release 22)35. For more detailed algorithms, please refer to the original study35.

CRC susceptibility genes

Here, we map these 85 SNPs to their corresponding genes using both methods as described above. The final CRC susceptibility gene set is the union of genes from both methods.

Pathway analysis using WebGestalt

We used the GO pathways in WebGestalt database for pathway analysis19. The hypergeometric test was used to detect the overrepresentation of differently expressed AD genes among all of the genes in a given pathway19. The reference gene list is the entire Entrez gene set. The minimum number of genes for a category is 3. The FDR test was used to correct for multiple testing. GO pathways with an adjusted P < 0.05 are considered to be significantly associated with CRC.

Additional Information

How to cite this article: Lu, X. et al. Colorectal cancer risk genes are functionally enriched in regulatory pathways. Sci. Rep. 6, 25347; doi: 10.1038/srep25347 (2016).
  35 in total

Review 1.  The canonical Wnt signalling pathway and its APC partner in colon cancer development.

Authors:  Jean Schneikert; Jürgen Behrens
Journal:  Gut       Date:  2006-07-13       Impact factor: 23.059

2.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

3.  Protein phosphorylation analysis in archival clinical cancer samples by shotgun and targeted proteomics approaches.

Authors:  Angelo Gámez-Pozo; Iker Sánchez-Navarro; Enrique Calvo; Esther Díaz; María Miguel-Martín; Rocío López; Teresa Agulló; Emilio Camafeita; Enrique Espinosa; Juan Antonio López; Manuel Nistal; Juan Ángel Fresno Vara
Journal:  Mol Biosyst       Date:  2011-05-26

Review 4.  Colorectal cancer and genetic alterations in the Wnt pathway.

Authors:  S Segditsas; I Tomlinson
Journal:  Oncogene       Date:  2006-12-04       Impact factor: 9.867

5.  Differentiated human colorectal cancer cells protect tumor-initiating cells from irinotecan.

Authors:  Benjamin L Emmink; Winan J Van Houdt; Robert G Vries; Frederik J H Hoogwater; Klaas M Govaert; Andre Verheem; Maarten W Nijkamp; Ernst J A Steller; Connie R Jimenez; Hans Clevers; Inne H M Borel Rinkes; Onno Kranenburg
Journal:  Gastroenterology       Date:  2011-04-01       Impact factor: 22.682

6.  Strategies and issues in the detection of pathway enrichment in genome-wide association studies.

Authors:  Mun-Gwan Hong; Yudi Pawitan; Patrik K E Magnusson; Jonathan A Prince
Journal:  Hum Genet       Date:  2009-05-01       Impact factor: 4.132

Review 7.  Colorectal cancer as a complex disease: defining at-risk subjects in the general population - a preventive strategy.

Authors:  Annika Lindblom; Xiao-Lei Zhou; Tao Liu; Annelie Liljegren; Johanna Skoglund; Tatjana Djureinovic
Journal:  Expert Rev Anticancer Ther       Date:  2004-06       Impact factor: 4.512

8.  Common variant in 6q26-q27 is associated with distal colon cancer in an Asian population.

Authors:  R Cui; Y Okada; S G Jang; J L Ku; J G Park; Y Kamatani; N Hosono; T Tsunoda; V Kumar; C Tanikawa; N Kamatani; R Yamada; M Kubo; Y Nakamura; K Matsuda
Journal:  Gut       Date:  2011-01-17       Impact factor: 23.059

9.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations.

Authors:  Danielle Welter; Jacqueline MacArthur; Joannella Morales; Tony Burdett; Peggy Hall; Heather Junkins; Alan Klemm; Paul Flicek; Teri Manolio; Lucia Hindorff; Helen Parkinson
Journal:  Nucleic Acids Res       Date:  2013-12-06       Impact factor: 16.971

10.  The transition from proliferation to differentiation in colorectal cancer is regulated by the calcium activated chloride channel A1.

Authors:  Bo Yang; Lin Cao; Bin Liu; Colin D McCaig; Jin Pu
Journal:  PLoS One       Date:  2013-04-12       Impact factor: 3.240

View more
  2 in total

1.  CCDC12 promotes tumor development and invasion through the Snail pathway in colon adenocarcinoma.

Authors:  Fengying Du; Lipan Peng; Qiang Wang; Kangdi Dong; Wenting Pei; Hongqing Zhuo; Tao Xu; Changqing Jing; Leping Li; Jizhun Zhang
Journal:  Cell Death Dis       Date:  2022-02-25       Impact factor: 8.469

2.  Ovarian cancer variant rs2072590 is associated with HOXD1 and HOXD3 gene expression.

Authors:  Liyuan Guo; Yan Peng; Lei Sun; Xia Han; Juan Xu; Dongwei Mao
Journal:  Oncotarget       Date:  2017-10-13
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.