| Literature DB >> 32193401 |
Ying Duan1, Gaowen Liu2, Yao Sun3, Jiamin Wu3, Zichao Xiong3, Tianbo Jin3, Mingwei Chen4.
Abstract
The abundant expression of collagen type VI α5 (COL6A5) exists in lung tissue, and its role in lung cancer is still unknown. We performed a genetic association study with an attempt to detect the relationships between single nucleotide polymorphisms (SNPs) in COL6A5 and lung cancer predisposition in Chinese Han population. We finally selected six tag-SNPs to determine their genotypes among 510 lung cancer patients and 495 healthy controls with the MassARRAY platform. The associations of SNPs and lung cancer risk were estimated by logistic regression method with adjustment for confounding factors. Two available databases were used for gene expression and prognosis analysis. COL6A5 rs13062453, rs1497305, and rs77123808 were significantly associated with the risk of lung cancer in the whole population or stratified subgroups (p < 0.05). Among them, COL6A5 rs13062453 and rs1497305 were also linked to the susceptibility of lung adenocarcinoma. Additionally, rs1497305 was found to be strongly related to the TNM staging under five genetic models (p < 0.05). Results from databases suggested the important role of COL6A5 in lung cancer development. COL6A5 polymorphisms rs13062453, rs1497305 and rs77123808 were associated with lung cancer risk in Chinese Han population. These findings first yield new insight of COL6A5 in lung cancer.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32193401 PMCID: PMC7081318 DOI: 10.1038/s41598-020-61614-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1COL6A5 gene schematic representation. The specific location of each SNP is marked by asterisk. Minimum unit of the measure is 1 kb. The flanking sequences of SNPs were provided by dbSNP database (http://www.bioinfo.org.cn/relative/dbSNP%20Home%20Page.htm).
Figure 2Comparison of the COL6A5 gene expression in lung adenocarcinoma (LUAD, N = 483), squamous cell carcinoma (LUSC, N = 486) tumors and healthy tissues (LUAD normal tissues: N = 347; LUSC normal tissues: N = 338) in GEPIA database. The expression of COL6A5 gene was down-regulated in both lung cancer types. *p < 0.05.
Figure 3Low expression of COL6A5 gene is associated with poor overall survival in lung cancer patients. The Kaplan-Meier curves presented the overall survival of lung cancer patients with high (red) and low (black) COL6A5 expression levels and generated by Kaplan–Meier method.
Demographic information of the lung cancer cases and healthy controls in this study.
| Variables | Lung Cancer Cases (N = 510) | Healthy Controls (N = 495) | |
|---|---|---|---|
| Age (years, mean ± SD) | 60.78 ± 9.96 | 61.94 ± 7.72 | 0.038a |
| Male | 355 | 346 | 0.920b |
| Female | 155 | 149 | |
| Lung Adenocarcinoma | 188 | ||
| Squamous Cell Carcinoma | 120 | ||
| Others | 202 | ||
| I+II | 84 | ||
| III+IV | 261 | ||
| Absence | 165 | ||
| Positive | 215 | ||
| Negative | 84 | ||
| Absence | 211 | ||
ap value was calculated with independent samples t-test.
bp value was calculated with Pearson’s chi-squared test.
Descriptive information of the selected SNPs in COL6A5.
| SNP | Chromosome | Position | Functional Consequence | Alleles A < B | Minor Allele Frequency | Call Rate | |
|---|---|---|---|---|---|---|---|
| Case | Control | ||||||
| rs77123808 | Chr 3 | 130368023 | intron variant | C < A | 0.338 | 0.356 | 100.00% |
| rs2034664 | Chr 3 | 130370233 | intron variant | T < A | 0.183 | 0.170 | 99.90% |
| rs10212241 | Chr 3 | 130385219 | transcript variant (synonymous codon) | T < C | 0.438 | 0.438 | 99.90% |
| rs13062453 | Chr 3 | 130419022 | intron variant | A < G | 0.365 | 0.328 | 99.80% |
| rs1497305 | Chr 3 | 130428544 | intron variant | A < G | 0.289 | 0.276 | 99.30% |
| rs2403340 | Chr 3 | 130439499 | intron variant | A < G | 0.175 | 0.174 | 99.90% |
SNP: Single nucleotide polymorphism; Chr: Chromosome.
Evaluation of the correlation between COL6A5 variants and lung cancer susceptibility among Chinese Han population.
| SNP | Allele/Genotype | Control N (%) | Case N (%) | OR (95% CI) | |
|---|---|---|---|---|---|
| rs77123808 | A | 638 (64.44%) | 675 (66.18%) | 1.00 | 0.415 |
| C | 352 (35.56%) | 345 (33.82%) | 0.93 (0.77–1.11) | ||
| AA | 205 (41.41%) | 221 (43.33%) | 1.00 | ||
| AC | 228 (46.06%) | 233 (45.69%) | 0.96 (0.73–1.24) | 0.735 | |
| CC | 62 (12.53%) | 56 (10.98%) | 0.85 (0.56–1.27) | 0.423 | |
| rs10212241 | C | 555 (56.17%) | 573 (56.18%) | 1.00 | 0.999 |
| T | 433 (43.83%) | 447 (43.82%) | 1.00 (0.84–1.19) | ||
| CC | 156 (31.58%) | 163 (31.96%) | 1.00 | ||
| CT | 243 (49.19%) | 247 (48.43%) | 0.97 (0.73–1.29) | 0.858 | |
| TT | 95 (19.23%) | 100 (19.61%) | 1.01 (0.71–1.45) | 0.950 | |
| rs13062453 | G | 664 (67.21%) | 646 (63.46%) | 1.00 | 0.078 |
| A | 324 (32.79%) | 372 (36.54%) | 1.18 (0.98–1.42) | ||
| GG | 230 (46.56%) | 203 (39.88%) | 1.00 | ||
| GA | 204 (41.30%) | 240 (47.15%) | |||
| AA | 60 (12.15%) | 66 (12.97%) | 1.21 (0.81–1.81) | 0.346 | |
| rs1497305 | G | 710 (72.45%) | 722 (71.06%) | 1.00 | 0.492 |
| A | 270 (27.55%) | 294 (28.94%) | 1.07 (0.88–1.30) | ||
| GG | 265 (54.08%) | 247 (48.62%) | 1.00 | ||
| GA | 180 (36.73%) | 228 (44.88%) | |||
| AA | 45 (9.18%) | 33 (6.50%) | 0.76 (0.47–1.24) | 0.269 |
SNP: Single nucleotide polymorphism; OR: Odds ratio; 95% CI: 95% confidence interval.
The OR and 95% CI were calculated by logistic regression analysis.
p value was obtained by Wald test with adjustment for age and gender.
*Statistically significant results are in bold (p < 0.05).
Significant COL6A5 variants associated with lung cancer susceptibility in stratified analysis for age or gender.
| SNP | Allele/Genotype | > 61 | <61 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Control N (%) | Case N (%) | OR (95% CI) | Control N (%) | Case N (%) | OR (95% CI) | ||||
| rs1497305 | G | 395 (73.42%) | 355 (73.35%) | 1.00 | 0.979 | 315 (71.27%) | 367 (68.98%) | 1.00 | 0.439 |
| A | 143 (26.58%) | 129 (26.65%) | 1.00 (0.76–1.33) | 127 (28.73%) | 165 (31.02%) | 1.12 (0.85–1.47) | |||
| GG | 146 (54.28%) | 124 (51.24%) | 1.00 | 119 (53.85%) | 123 (46.24%) | 1.00 | |||
| GA | 103 (38.29%) | 107 (44.21%) | 1.19 (0.82–1.71) | 0.360 | 77 (34.84%) | 121 (45.49%) | |||
| AA | 20 (7.43%) | 11 (4.55%) | 0.61 (0.28–1.34) | 0.217 | 25 (11.31%) | 22 (8.27%) | 0.83 (0.44–1.56) | 0.559 | |
| rs77123808 | A | 450 (65.03%) | 464 (65.35%) | 1.00 | 0.899 | 188 (63.09%) | 209 (67.86%) | 1.00 | 0.217 |
| C | 242 (34.97%) | 246 (34.65%) | 0.99 (0.79–1.23) | 110 (36.91%) | 99 (32.14%) | 0.81 (0.58–1.13) | |||
| AA | 152 (43.93%) | 145 (40.85%) | 1.00 | 53 (35.57%) | 75 (48.70%) | 1.00 | |||
| AC | 146 (42.20%) | 174 (49.01%) | 1.28 (0.93–1.76) | 0.132 | 82 (55.03%) | 59 (38.31%) | |||
| CC | 48 (13.87%) | 36 (10.14%) | 0.81 (0.49–1.32) | 0.396 | 14 (9.40%) | 20 (12.99%) | 1.01 (0.47–2.18) | 0.980 | |
| rs13062453 | G | 468 (67.63%) | 461 (65.11%) | 1.00 | 0.319 | 196 (66.22%) | 184 (59.74%) | 1.00 | 0.100 |
| A | 224 (32.37%) | 247 (34.89%) | 1.12 (0.90–1.40) | 100 (33.78%) | 124 (40.26%) | 1.32 (0.95–1.84) | |||
| GG | 163 (47.11%) | 152 (42.94%) | 1.00 | 67 (45.27%) | 51 (33.12%) | 1.00 | |||
| GA | 142 (41.04%) | 157 (44.35%) | 1.20 (0.87–1.65) | 0.261 | 62 (41.89%) | 82 (53.25%) | |||
| AA | 41 (11.85%) | 45 (12.71%) | 1.12 (0.69–1.81) | 0.651 | 19 (12.84%) | 21 (13.64%) | 1.46 (0.71–3.00) | 0.305 | |
| rs1497305 | G | 503 (72.90%) | 515 (72.95%) | 1.00 | 0.984 | 207 (71.38%) | 206 (66.88%) | 1.00 | 0.235 |
| A | 187 (27.10%) | 191 (27.05%) | 1.00 (0.79–1.26) | 83 (28.62%) | 102 (33.12%) | 1.24 (0.87–1.75) | |||
| GG | 187 (54.20%) | 182 (51.56%) | 1.00 | 78 (53.79%) | 65 (42.21%) | 1.00 | |||
| GA | 129 (37.39%) | 151 (42.78%) | 1.22 (0.89–1.66) | 0.223 | 51 (35.17%) | 76 (49.35%) | 0.98 (0.44–2.18) | 0.951 | |
| AA | 29 (8.41%) | 20 (5.67%) | 0.66 (0.36–1.21) | 0.180 | 16 (11.03%) | 13 (8.44%) | |||
SNP: Single nucleotide polymorphism; OR: Odds ratio; 95% CI: 95% confidence interval.
The OR and 95% CI were calculated by logistic regression analysis.
p value was obtained by Wald test with adjustment for age and gender.
*Statistically significant results are in bold (p < 0.05).
Significant COL6A5 variants in stratified analysis for pathological type.
| SNP | Allele/Genotype | Control (N = 495) | Lung Adenocarcinoma | Squamous Cell Carcinoma | ||||
|---|---|---|---|---|---|---|---|---|
| Case (N = 188) | OR (95% CI) | Case (N = 120) | OR (95% CI) | |||||
| rs13062453 | G | 664 (67.21%) | 233 (61.97%) | 1.00 | 0.068 | 157 (65.97%) | 1.00 | 0.715 |
| A | 324 (32.79%) | 143 (38.03%) | 1.26 (0.98–1.61) | 81 (34.03%) | 1.06 (0.78–1.43) | |||
| GG | 230 (46.56%) | 70 (37.23%) | 1.00 | 52 (43.70%) | 1.00 | |||
| GA | 204 (41.30%) | 93 (49.47%) | 53 (44.54%) | 1.18 (0.76–1.83) | 0.456 | |||
| AA | 60 (12.15%) | 25 (13.30%) | 1.27 (0.74–2.20) | 0.391 | 14 (11.76%) | 1.07 (0.55–2.09) | 0.847 | |
| rs1497305 | G | 710 (72.45%) | 261 (69.41%) | 1.00 | 0.267 | 168 (70.59%) | 1.00 | 0.566 |
| A | 270 (27.55%) | 115 (30.59%) | 1.16 (0.89–1.50) | 70 (29.41%) | 1.10 (0.80–1.50) | |||
| GG | 265 (54.08%) | 93 (46.73%) | 1.00 | 59 (49.58%) | 1.00 | |||
| GA | 180 (36.73%) | 97 (48.74%) | 50 (42.02%) | 1.24 (0.81–1.91) | 0.323 | |||
| AA | 45 (9.18%) | 9 (4.52%) | 0.59 (0.27–1.27) | 0.180 | 10 (8.40%) | 1.08 (0.50–2.31) | 0.848 | |
SNP: Single nucleotide polymorphism; OR: Odds ratio; 95% CI: 95% confidence interval.
The OR and 95% CI were calculated by logistic regression analysis.
p value was obtained by Wald test with adjustment for age and gender.
*Statistically significant results are in bold (p < 0.05).
Significant COL6A5 variants associated with TNM staging of lung cancer among Chinese Han population.
| SNP | Allele/Genotype | Control N (%) | Case N (%) | OR (95% CI) | |
|---|---|---|---|---|---|
| rs1497305 | G | 105 (62.50%) | 377 (72.78%) | 1.00 | |
| A | 63 (37.50%) | 141 (27.22%) | |||
| GG | 32 (38.10%) | 134 (51.74%) | 1.00 | ||
| GA | 41 (48.81%) | 109 (42.08%) | 0.63 (0.37–1.07) | 0.089 | |
| AA | 11 (13.10%) | 16 (6.18%) |
SNP: Single nucleotide polymorphism; OR: Odds ratio; 95% CI: 95% confidence interval.
The OR and 95% CI were calculated by logistic regression analysis.
p value was obtained by Wald test with adjustment for age and gender.
*Statistically significant results are in bold (p < 0.05).