| Literature DB >> 28083984 |
Xiao Wang1,2,3, Jing Li1,2,3, Yong Duan1,2,3, Huifei Wu1,2,3, Qiuyue Xu1,2,3, Yanliang Zhang1,2,3.
Abstract
BACKGROUND: The lung cancer mortality rate in Xuanwei city is among the highest in China and adenocarcinoma is the major histological type. Lung cancer has been associated with exposure to indoor smoky coal emissions that contain high levels of polycyclic aromatic hydrocarbons; however, the pathogenesis of lung cancer has not yet been fully elucidated.Entities:
Keywords: zzm321990Lung adenocarcinoma; single nucleotide variants; somatic mutations; whole genome sequencing
Mesh:
Year: 2017 PMID: 28083984 PMCID: PMC5334298 DOI: 10.1111/1759-7714.12411
Source DB: PubMed Journal: Thorac Cancer ISSN: 1759-7706 Impact factor: 3.500
Sequence coverage summary for normal and tumor genomes
| Sequence coverage | Normal | Tumor |
|---|---|---|
| Gross mapping yield (Gb) | 307.288 | 324.531 |
| Both mates mapped yield (Gb) | 242.639 | 259.218 |
| Average haploid coverage | 81× | 86× |
| Fully called genome fraction | 0.975 | 0.976 |
| SNVs | 3 416 989 | 3 428 060 |
| Substitution | 89 536 | 90 455 |
SNV, single nucleotide variant.
Coverage percentage and variations are with respect to National Center for Biotechnology Information Build 37 of the human genome reference assembly.
Figure 1Flow chart describing the methodology used in filtering single nucleotide variants (SNVs) to obtain candidate novel somatic mutations.
Figure 2Distribution of the 964 novel somatic single nucleotide variants (SNVs) based on their genomic location. Of the 964 potentially novel somatic mutations, 32% were in an intronic region, 65% in an intergenic region, 1% in upstream or downstream regions of a gene, and an additional 1% were in an untranslated (UTR) 3′ or 5′ region.
Details of predicted non‐synonymous single nucleotide variations
| Chromosome | Start | End | Reference | Alt | Gene symbol | Gene function impaction | SIFT |
|---|---|---|---|---|---|---|---|
| 12 | 112 673 325 | 112 673 325 | G | A | HECTD4 | Nonsynonymous SNV | 0 |
| 13 | 49 075 918 | 49 075 918 | C | A | RCBTB2 | Stopgain SNV | 0 |
| 3 | 126 071 687 | 126 071 687 | C | A | KLF15 | Nonsynonymous SNV | 0.01 |
| 12 | 2 721 085 | 2 721 085 | G | T | CACNA1C | Nonsynonymous SNV | 0.05 |
| 14 | 92 482 047 | 92 482 047 | T | A | TRIP11 | Nonsynonymous SNV | 0.11 |
| 14 | 105 405 385 | 105 405 385 | T | A | AHNAK2 | Nonsynonymous SNV | 0.17 |
| 22 | 21 330 800 | 21 330 800 | G | T | AIFM3 | Nonsynonymous SNV | 0.27 |
| 5 | 94 764 399 | 94 764 399 | C | T | FAM81B | Nonsynonymous SNV | 0.32 |
| 9 | 136 421 040 | 136 421 040 | C | A | ADAMTSL2 | Nonsynonymous SNV | 0.34 |
| 16 | 309 499 | 309 499 | G | T | ITFG3 | Nonsynonymous SNV | 0.39 |
| 2 | 189 904 214 | 189 904 214 | T | G | COL5A2 | Nonsynonymous SNV | 0.69 |
| 6 | 47 847 014 | 47 847 014 | G | C | PTCHD4 | Stopgain SNV | 1 |
SNV, single nucleotide variant.
Figure 3(a) Somatic single nucleotide mutation trends. Somatic mutations were primarily C·G‐A·T transversions. Distribution of specific nucleotide changes among germline and somatic variations in the lung adenocarcinoma genome. C·G‐A·T transversions accounted for 56% of high confidence somatic mutations, whereas most germline variations were A·T‐G·C and C·G‐T·A transitions. (b) Somatic single‐nucleotide mutations of all types occurred at a lower frequency on the transcribed strand (TS). TS represent single nucleotide somatic mutation rates, with different types of base substitutions on the coding regions. The non‐transcribed strand (NTS) represents the same classes of mutations occurring on the NTS.
Copy number variation increased in chromosomes 5 and 7
| Chromosome | Start | End | Called score | CNV type score | Overlapping gene |
|---|---|---|---|---|---|
| chr5 | 9 590 000 | 9 632 000 | 3 | 32 |
|
| chr5 | 1 156 000 | 1 228 000 | 3 | 33 |
|
| chr5 | 1 478 000 | 1 498 000 | 3 | 6 |
|
| chr5 | 1 532 000 | 2 582 000 | 3 | 50 |
|
| chr5 | 15 934 000 | 16 328 000 | 3 | 31 |
|
| chr5 | 15 122 000 | 15 890 000 | 3 | 43 |
|
| chr5 | 1 248 000 | 1 410 000 | 3 | 52 |
|
| chr5 | 19 856 000 | 20 046 000 | 3 | 22 |
|
| chr5 | 2 720 000 | 4 094 000 | 3 | 42 |
|
| chr5 | 2 582 000 | 2 720 000 | 4 | 69 | |
| chr5 | 4 194 000 | 4 382 000 | 3 | 26 | |
| chr5 | 4 412 000 | 4 486 000 | 3 | 32 | |
| chr5 | 18 718 000 | 18 852 000 | 3 | 19 | |
| chr5 | 19 256 000 | 19 346 000 | 3 | 4 | |
| chr7 | 55 018 000 | 55 776 000 | 3 | 54 |
|
CNV, copy number variation.
Figure 4Gene pathway network BioCarta analysis. EGFR was found to play an important role in telomerase, cellular aging, and immortality.