| Literature DB >> 26288819 |
Xian-Jun Yu1, Min-Jun Yang2, Bo Zhou3, Gui-Zhen Wang3, Yun-Chao Huang4, Li-Chuan Wu3, Xin Cheng3, Zhe-Sheng Wen5, Jin-Yan Huang6, Yun-Dong Zhang7, Xiao-Hong Gao3, Gao-Feng Li4, Shui-Wang He8, Zhao-Hui Gu6, Liang Ma3, Chun-Ming Pan6, Ping Wang9, Hao-Bin Chen10, Zhi-Peng Hong11, Xiao-Lu Wang3, Wen-Jing Mao1, Xiao-Long Jin12, Hui Kang2, Shu-Ting Chen2, Yong-Qiang Zhu2, Wen-Yi Gu2, Zi Liu3, Hui Dong2, Lin-Wei Tian13, Sai-Juan Chen6, Yi Cao7, Sheng-Yue Wang2, Guang-Biao Zhou3.
Abstract
Air pollution has been classified as Group 1 carcinogenic to humans, but the underlying tumorigenesis remains unclear. In Xuanwei City of Yunnan Province, the lung cancer incidence is among the highest in China attributed to severe air pollution generated by combustion of smoky coal, providing a unique opportunity to dissect lung carcinogenesis of air pollution. Here we analyzed the somatic mutations of 164 non-small cell lung cancers (NSCLCs) from Xuanwei and control regions (CR) where smoky coal was not used. Whole genome sequencing revealed a mean of 289 somatic exonic mutations per tumor and the frequent C:G → A:T nucleotide substitutions in Xuanwei NSCLCs. Exome sequencing of 2010 genes showed that Xuanwei and CR NSCLCs had a mean of 68 and 22 mutated genes per tumor, respectively (p < 0.0001). We found 167 genes (including TP53, RYR2, KRAS, CACNA1E) which had significantly higher mutation frequencies in Xuanwei than CR patients, and mutations in most genes in Xuanwei NSCLCs differed from those in CR cases. The mutation rates of 70 genes (e.g., RYR2, MYH3, GPR144, CACNA1E) were associated with patients' lifetime benzo(a)pyrene exposure. This study uncovers the mutation spectrum of air pollution-related lung cancers, and provides evidence for pollution exposure-genomic mutation relationship at a large scale.Entities:
Keywords: Air pollution; Exome sequencing; Exposure; Lung cancer; Whole genome sequencing
Mesh:
Substances:
Year: 2015 PMID: 26288819 PMCID: PMC4534757 DOI: 10.1016/j.ebiom.2015.04.003
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
The demographic characteristics of the 164 NSCLC patients from HPR or CR.
| Characteristics | Total (n = 164) | HPR (n = 79) | CR (n = 85) |
|---|---|---|---|
| Male | 101 | 42 | 59 |
| Female | 61 | 37 | 24 |
| n.d. | 2 | 0 | 2 |
| < 65 | 122 | 61 | 61 |
| ≥ 65 | 40 | 18 | 22 |
| n.d. | 2 | 0 | 2 |
| Median, range | 56 [34, 78] | 57 [36, 76] | 59 [34, 78] |
| Xuanwei/Fuyuan | 79 | 79 | 0 |
| Rest of Yunnan | 24 | 0 | 24 |
| Guangdong | 61 | 0 | 61 |
| Smoker | 81 | 38 | 43 |
| Non-smoker | 81 | 41 | 40 |
| n.d. | 2 | 0 | 2 |
| Adenocarcinoma | 112 | 64 | 48 |
| Squamous-cell carcinoma | 46 | 14 | 32 |
| Adenosquamous carcinoma | 0 | 0 | 0 |
| Large-cell carcinoma | 6 | 0 | 0 |
| IA | 18 | 12 | 6 |
| IB | 47 | 25 | 22 |
| IIA | 13 | 2 | 11 |
| IIB | 22 | 9 | 13 |
| IIIA | 28 | 13 | 15 |
| IIIB | 15 | 9 | 6 |
| IV | 16 | 9 | 7 |
| n.d. | 5 | 0 | 5 |
n.d.: not determined.
Fig. 1Mutation landscape of lung cancer from HPR. (A): A stacked bar graph representing the total number of non-synonymous versus synonymous mutations in each patient. (B): Summary of somatic genic rearrangements in each patient. “Other Genic” indicates rearrangements linking an intergenic region to the 3′ portions of a genic footprint. (C): Total numbers of recurring and non-recurring mutations in each patient. (D): The 381 recurring mutated genes (with dN/dS > 2), classified into 18 categories. (E): Mutations and copy number variations in calcium signal and ion channel genes.
Comparison of mutations in the 2010 genes in HPR and CR NSCLCs.
| HPR (n = 79) | CR (n = 85) | p (HPR vs CR) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | S (n = 38) | NS (n = 41) | p | Total | S (n = 43) | NS (n = 40) | p | ||
| Mutation/Mb (mean) | 15.90 | 18.31 | 13.66 | 0.818 | 6.34 | 7.98 | 4.49 | 0.0297 | 3.611E-06 |
| G:C → T:A substitution | 54.79% | 54.03% | 55.73% | 0.114 | 41.92% | 45.26% | 38.08% | 3.421E-05 | 1.711E-39 |
| Mutated genes/tumor | 67.99 | 74.87 | 61.61 | 0.930 | 22.06 | 28.16 | 15.58 | 0.017 | 9.966E-07 |
| Non-silent mutations/tumor | 72.66 | 81.24 | 64.71 | 0.895 | 23.13 | 29.67 | 16.22 | 0.015 | 8.796E-07 |
| Genes mutated in > 3% samples | 785 | 703 | 617 | 0.062 | 213 | 273 | 97 | 6.339E-10 | 9.052E-21 |
S, smoker; NS, non-smoker.
The smoking status of two CR patients was unknown.
Fig. 2Comparison of mutations in HPR lung cancer with those in CR NSCLCs. (A): HPR NSCLCs bore more mutated genes than CR patients. (B): Comparison of recurrent mutated genes in HPR with those in CR NSCLCs. (C): Genes mutated in > 5% tumor samples from HPR and CR. (D): Mutations in 59 genes whose mutation frequencies are > 10% of HPR NSCLCs. Tumors are arranged from left to right in the top track.
Fig. 3Mutations in some representative genes. Schematic representations of proteins encoded by the genes are shown. Numbers refer to amino acid residues. Mutations found in HPR and CR patients are shown in red and black, respectively. Each “+” corresponds to an independent, mutated tumor sample, and “*” indicates a nonsense (truncating) mutation. Mutations underlined with a same-colored line are found in the same patient. (A): The five genes which were mutated in > 10% tumor samples in both regions. (B): Representative five genes whose mutation rates in HPR lung cancer were significantly higher than in CR NSCLCs. (C): Mutations in TMEM132C which were mutated in 6/79 (7.59%) HPR NSCLCs and 12/85 (14.12%) CR lung cancers (p = 0.182).
Fig. 4Association between mutations and exposure to BaP. (A): Estimated doses of the patients' exposure to BaP. Sources of combustion of smoky coal and cigarette smoke were included. (B): Exposure–response relationship between BaP exposure and the mutation probability of 24 representative genes. See also Figure S9.