| Literature DB >> 29942513 |
Michelle E Penney1, Patrick S Parfrey2, Sevtap Savas1,3, Yildiz E Yilmaz1,2,4.
Abstract
BACKGROUND: Colorectal cancer has significant impact on individuals and healthcare systems. Many genes have been identified to influence its pathogenesis. However, the genetic basis of mucinous tumor histology, an aggressive subtype of colorectal cancer, is currently not well-known. This study aimed to identify common and rare genetic variations that are associated with the mucinous tumor phenotype.Entities:
Keywords: Colorectal cancer; Common single nucleotide polymorphisms; Genome-wide association study; Mucinous adenocarcinoma; Rare single nucleotide polymorphisms
Year: 2018 PMID: 29942513 PMCID: PMC5998544 DOI: 10.1186/s40364-018-0133-z
Source DB: PubMed Journal: Biomark Res ISSN: 2050-7771
Baseline features of the study cohort and the results of univariable logistic regression analysis
| Mucinous | Non-mucinous | ||||
|---|---|---|---|---|---|
| Characteristics | No. (%) | No. (%) | OR (95% CI) | ||
| Agea | ≤60 | 20 (9) | 203 (91) | ||
| 60–65 | 17 (18) | 78 (82) | 2.21 (1.09–4.44) | 0.025 | |
| >65 | 20 (11) | 167 (89) | 1.22 (0.63–2.34) | 0.558 | |
| Sex | Female | 29 (15) | 169 (85) | ||
| Male | 28 (9) | 279 (91) | 0.58 (0.34–1.02) | 0.057 | |
| Location | Colon | 47 (14) | 287 (86) | ||
| Rectum | 10 (6) | 161 (94) | 0.38 (0.18–0.74) | 0.007 | |
| Stage | I | 3 (3) | 90 (97) | ||
| II | 27 (14) | 169 (86) | 4.79 (1.64–20.45) | 0.012 | |
| III | 19 (11) | 147 (89) | 3.88 (1.28–16.83) | 0.033 | |
| IV | 8 (16) | 42 (84) | 5.71 (1.57–27.09) | 0.013 | |
| Grade | Well/moderately diff. | 48 (10) | 416 (90) | ||
| Poorly diff. | 7 (19) | 30 (81) | 2.02 (0.78–4.62) | 0.115 | |
| Unknown | 2 | 2 | |||
| MSI status | MSI-low/MSS | 49 (11) | 382 (89) | ||
| MSI-high | 6 (11) | 47 (89) | 1.00 (0.37–2.29) | 0.992 | |
| Unknown | 3 | 18 | |||
| Lymphatic | Absent | 31 (10) | 267 (90) | ||
| Present | 23 (14) | 144 (86) | 1.38 (0.77–2.44) | 0.278 | |
| Unknown | 3 | 37 | |||
| Absent | 45 (11) | 366 (89) | |||
| Present | 9 (19) | 38 (81) | 1.93 (0.83–4.09) | 0.104 | |
| Unknown | 3 | 44 | |||
aThe age at diagnosis was separated into 3 groups: ≤60, 60–65, and > 65 since the odds ratio does not remain constant for each year increase in age at diagnosis under the logistic regression model and this particular grouping gave the most efficient odds ratio estimates with no significant change in the results when considering slightly different groupings. CI confidence interval, diff. Differentiated, MSI microsatellite instability, MSS microsatellite stable, No number, OR odds ratio (compares the odds of having mucinous tumors with the corresponding factor level to the odds of having mucinous tumors with the reference factor level)
Top ten promising SNPs identified in univariable analysis and the subsequent multivariable analysis under their plausible genetic models
| Univariable | Multivariablee | |||||||
|---|---|---|---|---|---|---|---|---|
| Genomic location | SNP ID (Genotypea) | Geneb | Information in RegulomeDB | Plausible modelc | OR (95% CI) | OR (95% CI) | ||
| Chr6:110750552 | rs9481067 (GG) |
| ND | Recessive | 4.17 (2.33–7.43) | 1.24E-06 | 4.75 (2.53–8.95) | 1.24E-06 |
| Chr3:114121019 | rs10511330 (CT + CC) |
| Minimal binding evidence | Dominant | 3.77 (2.06–6.81) | 1.24E-05 | 4.85 (2.54–9.23) | 1.40E-06 |
| Chr3:114117327 | rs16822593 (AG + AA) |
| ND | Dominant | 3.70 (2.02–6.68) | 1.59E-05 | 4.83 (2.53–9.20) | 1.50E-06 |
| Chr2:179860562 | rs13019215 (TC + TT) |
| ND | Dominant | 0.27 (0.14–0.48) | 1.56E-05 | 0.23 (0.12–0.43) | 8.20E-06 |
| Chr2:179867985 | rs12471607 (TC + TT) |
| ND | Dominant | 0.27 (0.14–0.48) | 1.65E-05 | 0.23 (0.12–0.43) | 8.42E-06 |
| Chr5:80483574 | rs716897 (CT + CC) |
| Minimal binding evidence | Dominant | 0.27 (0.15–0.47) | 5.33E-06 | 0.26 (0.14–0.47) | 1.12E-05 |
| Chr16:86077637 | rs4843335 (AG + AA) | intergenic | Minimal binding evidence | Dominant | 4.11 (2.11–7.79) | 2.06E-05 | 4.67 (2.98–9.34) | 1.48E-05 |
| Chr6:118634698 | rs11968293 (CA + CC) |
| Minimal binding evidence | Dominant | 0.28 (0.16–0.50) | 1.27E-05 | 0.26 (0.14–0.48) | 1.48E-05 |
| Chr9:131923949 | rs4837345 (TT) | intergenic | Minimal binding evidence | Recessive | 4.72 (2.40–9.05) | 4.00E-06 | 4.56 (2.24–9.11) | 1.97E-05 |
| Chr9:131930494 | kgp10457679/ | intergenic | Likely to affect binding and linked to expression of a gene target | Recessive | 4.72 (2.40–9.05) | 4.00E-06 | 4.56 (2.24–9.11) | 1.97E-05 |
aRisk increasing/decreasing genotype. bBased on Ensembl [51] or dbSNP databases [61]. cUnder the recessive genetic model, minor allele homozygous patients are compared to major allele homozygous and heterozygous patients combined. Under the dominant genetic model, minor allele homozygous and heterozygous patients are combined and compared to major allele homozygous patients. dThe rs number for the kgp10457679 polymorphism was obtained from the UCSC genome browser [62]. eMultivariable models adjusted for sex, age at diagnosis, stage, tumor location, and tumor grade. Patients with missing/unknown data for any of these variables were excluded from the analysis. Chr chromosome, CI confidence interval, ND data not available at RegulomeDB, OR odds ratio (compares the odds of having mucinous tumors with the specified genotype(s)a to the odds of having mucinous tumors with the reference (other) genotype(s))
Genes identified in the common and rare analyses
| Gene symbola | Gene nameb | Function |
|---|---|---|
|
| solute carrier family 22 member 16 | codes for a human ʟ-carnitine transporter protein hCT2. hCT2 has been shown to have undetectable expression in a colon cancer cell line. [ |
|
| coiled-coil domain containing 141 | codes for a protein that plays a critical role in centrosome positioning and movement, particularly radial migration. Centrosome aberrations have been shown to be present in early-stage colorectal cancers and could contribute to chromosomal instability. [ |
|
| solute carrier family 35 member F1 | codes for a member of the solute carrier family 35, a family of nucleotide sugar transporters. [ |
|
| zinc finger and BTB domain containing 20 | codes for a transcriptional repressor. Upregulation of ZBTB20 has been shown to promote cell proliferation in non-small cell lung cancer and is a potential druggable target for the disease. Similarly, overexpression of ZBTB20 has been associated with poor prognosis in patients with hepatocellular carcinoma. [ |
|
| Ras protein specific guanine nucleotide releasing factor 2 | codes for a signalling molecule. RasGRF2 contains regulatory domains for both Ras and Rho GTPases, suggesting it can influence both pathways. The Rho pathway has been thought to be involved in cell migration, while the Ras pathway has been thought to be involved in cell proliferation and survival, which are all processes related to cancer. [ |
|
| SEC24 homolog B, COPII coat complex component | codes for a protein that is a part of the COPII vesicle coat, facilitating molecular transport from the endoplasmic reticulum to the Golgi apparatus. It has been suggested that alterations in vesicle trafficking proteins may be facilitators of epithelial carcinogenesis. [ |
|
| coiled-coil domain containing 109B | also known as |
|
| long intergenic non-protein coding RNA 596 | no literature data available. |
|
| SEC24B antisense RNA 1 | long non-coding RNA (lncRNA) that is involved in gene expression regulation. [ |
|
| NA | no literature data available. |
|
| family with sequence similarity 87 member A | no literature data available. |
aAccording to Ensembl database [51]. bAccording to HUGO Gene Nomenclature Committee (HGNC) [4]. NA Not available
Fig. 1ROC curves and corresponding AUC values for multivariable models. Due to high LD among some of the top 10 SNPs, ROC analysis was performed on only the following SNPs: rs9481067, rs10511330, rs13019215, rs716897, rs4843335, rs11968293, and kgp10457679. AUC: area under the ROC curve, CI: confidence interval, LD: linkage disequilibrium, ROC: receiver operator characteristic
Most significant gene regions identified from SKAT-O multivariable analysis
| Genomic locationa | Geneb | Descriptionc | Other genes in the gene-based regiond | # of SNPs | SNPs | |
|---|---|---|---|---|---|---|
| Chr4:110349928–110467052 |
| protein coding | 5 | rs10516557, kgp21293502, rs10003981, rs17040515, rs17040519 | 1.81E-05 | |
| Chr4:110476361–110614874 |
| protein coding | 6 | rs17619262, rs7654187, rs6831048, rs17619310, rs9997940, rs1053680 | 3.29E-05 | |
| Chr14:24386456–24408777 |
| long intergenic non-protein coding RNA | 6 | rs8010486, rs1159372, rs10135026, rs8005541, rs8019962, kgp19564619 | 3.34E-05 | |
| Chr4:110263631–110359973 |
| noncoding RNA; antisense RNA | 7 | rs10031399, rs17040364, rs17040369, rs11098033, rs17040401, rs12648138, rs11098035 | 4.21E-05 | |
| Chr1:207074273–207084738 |
| pseudogene | 10 | rs3093428, kgp15249933, kgp15191074, rs3093447, kgp22852559, rs3093434, rs3093437, rs3093438, rs3093440, rs41304091 | 5.47E-05 | |
| Chr8:320931–338174 |
| non-coding RNA | – | 7 | rs4527844, kgp20525414, kgp20198205, rs11785854, rs7461388, rs17064450, rs17064458 | 6.58E-05 |
aThese genomic locations describe the region containing the gene as well as 5 kb long sequences before and after the gene. bBased on the information in the UCSC database [62]. cNCBI’s Gene Entrez database [77]. dIn some cases, the gene regions examined also contained sequences of other genes. Chr chromosome