| Literature DB >> 34070617 |
Ewa E Hennig1,2, Anna Kluska2, Magdalena Piątkowska2, Maria Kulecka1,2, Aneta Bałabas2, Natalia Zeber-Lubecka1,2, Krzysztof Goryca2, Filip Ambrożkiewicz2, Jakub Karczmarski2, Tomasz Olesiński3, Łukasz Zyskowski3, Jerzy Ostrowski1,2.
Abstract
Despite great efforts, most of the genetic factors contributing to the risk of colorectal cancer (CRC) remain undetermined. Including small but homogenous populations in genome-wide association studies (GWAS) can help us discover new common risk variants specific to the studied population. In this study, including 465 CRC patients and 1548 controls, a pooled DNA samples-based GWAS was conducted in search of genetic variants associated with CRC in a Polish population. Combined with a new method of selecting single-nucleotide polymorphisms (SNPs) for verification in individual DNA samples, this approach allowed the detection of five new susceptibility loci not previously reported for CRC. The discovered loci were found to explain 10% of the overall risk of developing CRC. The strongest association was observed for rs10935945 in long non-coding RNA LINC02006 (3q25.2). Three other SNPs were also located within genes (rs17575184 in NEGR1, rs11060839 in PIWIL1, rs12935896 in BCAS3), while one was intergenic (rs9927668 at 16p13.2). An expression quantitative trait locus (eQTL) bioinformatic analysis suggested that these polymorphisms may affect transcription factor binding sites. In conclusion, four of the identified variants were located within genes likely involved in tumor invasiveness and metastasis. Therefore, they could possibly be markers of poor prognosis in CRC patients.Entities:
Keywords: colorectal cancer; genome-wide association study; long non-coding RNA; metastasis; polymorphism; tumor progression
Year: 2021 PMID: 34070617 PMCID: PMC8229782 DOI: 10.3390/biology10060465
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
The demographic and clinical characteristics of patients and controls.
| CRC | Control | |
|---|---|---|
| Female | 176 (38) | 969 (63) |
| Male | 289 (62) | 579 (37) |
| Age (mean ± SD) | 66 ± 11 | 55 ± 11 |
| Age (median) | 66 | 58 |
| Age (min.–max.) | 20–91 | 19–95 |
| Tumor localization (%) | ||
| rectum | 173 (37.2) | |
| sigmoid | 79 (17.0) | |
| sigmoid-rectum | 72 (15.5) | |
| caecum | 55 (11.8) | |
| ascendant | 39 (8.4) | |
| other | 47 (10.1) | |
| Tumor size (%) | ||
| 0 | 4 (0.9) | |
| 1 | 40 (8.6) | |
| 2 | 85 (18.3) | |
| 3 | 277 (59.6) | |
| 4 | 56 (12.0) | |
| Tis | 3 (0.6) | |
| Node status (%) | ||
| 0 | 245 (52.7) | |
| 1 | 126 (27.1) | |
| 2 | 78 (16.8) | |
| 3 | 9 (1.9) | |
| Nx | 7 (1.5) | |
| Grade (%) | ||
| 1 | 27 (5.8) | |
| 2 | 284 (61.1) | |
| 3 | 44 (9.5) | |
| Gx | 110 (23.6) | |
| Metastasis (%) | 49 (10.5) |
CRC, colorectal cancer; N, number of subjects; SD, standard deviation; Tis, tumor in situ; Nx, indeterminate; Gx, indeterminate.
The allelic and genotypic association of GWAS-selected, single-nucleotide polymorphisms (SNPs) with colorectal cancer.
| Allele Frequency (%) | Genotype Frequency (%) | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| dbSNP ID a | Region | MA | MAF b | Control | CRC | OR (95% CI) | Genotype | Control | CRC | OR (95% CI) | |||||
|
| 1p31.1 | A | 0.088 | 232 (10.8) | 60 (6.5) | 0.57 (0.42–0.76) |
| AA | 11 (1.0) | 1 (0.2) | 0.22 (0.01–1.12) |
| |||
|
| 3q25.2 | T | 0.399 | 906 (42.2) | 478 (51.5) | 1.46 (1.25–1.70) |
| TT | 195 (18.2) | 117 (25.2) | 2.11 (1.54–2.90) |
| |||
| rs10838094 | 11p15.4 | A | 0.378 | 445 (41.4) | 422 (45.5) | 1.18 (0.99–1.41) | 8.03 × 10−2 | AA | 97 (18.1) | 92 (19.8) | 1.34 (0.93–1.92) | 8.12 × 10−2 | |||
| rs12424924 | 12p12.1 | A | 0.194 | 223 (20.5) | 165 (17.9) | 0.85 (0.68–1.06) | 0.147 | AA | 25 (4.6) | 16 (3.5) | 0.71 (0.37–1.36) | 0.152 | |||
|
| 12q24.33 | A | 0.169 | 332 (15.6) | 194 (21.1) | 1.45 (1.19–1.76) |
| AA | 27 (2.5) | 21 (4.6) | 2.06 (1.13–3.71) |
| |||
|
| 16p13.2 | C | 0.290 | 840 (39.1) | 285 (30.9) | 0.70 (0.59–0.82) |
| CC | 179 (16.7) | 46 (10.0) | 0.48 (0.33–0.68) |
| |||
|
| 17q23.2 | C | 0.400 | 545 (25.4) | 194 (20.9) | 0.77 (0.64–0.93) |
| CC | 68 (6.4) | 21 (4.5) | 0.63 (0.37–1.03) |
| |||
Allelic frequencies of all studied SNPs were in Hardy–Weinberg equilibrium. Bold denotes significant association after Benjamini–Hochberg algorithm adjustment (p < 0.05). CRC, colorectal cancer; MA, minor allele; MAF, MA frequency; OR, odds ratio; CI, confidence interval. a/ SNP identifier based on NCBI SNP database (http://www.ncbi.nlm.nih.gov/snp/; accessed on 20 November 2020). b/ MAF based on NCBI SNP database (http://www.ncbi.nlm.nih.gov/snp/; accessed on 20 November 2020).
The results of the stepwise selection for the logistic regression model.
| dbSNP ID a | AIC b | AIC Change (%) |
| |
|---|---|---|---|---|
| rs9927668 | 1309.45 | 0.028 | ||
| rs10935945 | 1294.45 | 15.0 (1.15) | 0.054 | 0.026 (92) |
| rs17575184 | 1285.31 | 9.14 (0.74) | 0.071 | 0.017 (33) |
| rs12935896 | 1279.74 | 5.57 (0.43) | 0.084 | 0.013 (18) |
| rs11060839 | 1274.75 | 4.99 (0.39) | 0.095 | 0.012 (14) |
| rs10838094 | 1274.26 | 0.49 (0.04) | 0.101 | 0.006 (6) |
Six significant SNPs (p < 0.05) ranked by Akaike information criterion (AIC) values were sequentially implemented into the model, starting with SNP rs9927668 with the lowest AIC value. All six SNPs were included in the final prediction model. a/ SNP identifier based on NCBI SNP database (http://www.ncbi.nlm.nih.gov/SNP/; accessed on 20 November 2020). b/ AIC value calculated after sequential implementation of the ranked SNPs. c/ Nagelkerke pseudo-R2 value calculated after sequential implementation of the ranked SNPs.
Figure 1Possibly altered regulatory motifs. Based on HaploReg database (https://pubs.broadinstitute.org/mammals/haploreg/haploreg.php) [22]. Position weight matrix motifs in biological sequences [34]. Allele variants are indicated in bold. Ref, reference sequence; Alt, alternative sequence.