| Literature DB >> 29257265 |
Zongwen Liu1, Ting Chai2, Yan Zhang3, Alan Chu1, Bing Liang2, Xing Guo1, Zhenjiang Guo1, Rui Song1, Ge Hou1, Jinjin Yuan1, Yaohe Liu1, Yuhao Zhang4.
Abstract
The present study aimed to detect the mutation characteristics of mitochondrial DNA (mtDNA) in Eca109 of Ec9706 cells, and to investigate their association with the nuclear genome (nDNA), thus providing a basis for gene targeting therapies for esophageal squamous cell carcinoma (ESCC). In vitro‑cultured Ec9706 and Eca109 cells were analyzed the changes of single‑nucleotide polymorphisms (SNPs), insertions/deletions (INDELs), copy number varia-tion, and structure variation (SV) of their genome by high‑throughput sequencing. The loci with SV on chromosome 1‑12 of the two ESCC cell lines were ≥5% of the mtDNA, but SV on chromosome 13‑22, X and Y was ≤3%; >40% of loci exhibited gain or loss; intergenic loci with INDEL changes and SNP features accounted for the majority of mutations. The affected genes encoded proteins including nDNA‑encoding intra‑mitochondrial‑transporting proteins, ATP energy generation‑associated proteins and mitochondrial electron respiratory chain proteins, and these proteins were all nucleus‑encoded mitochondrial proteins. The transcription, duplication, and translation of the abnormally expressed mtDNA in Ec9706 and Eca109 cells were closely associated with disorders of nuclear DNA products.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29257265 PMCID: PMC5783496 DOI: 10.3892/mmr.2017.8228
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
Figure 1.Workflow of the high-throughput sequencing.
Data coverage analysis.
| Quality control statistics sample | WGC024118D | WGC024119D |
|---|---|---|
| Paired-end read length | 150*2 | 150*2 |
| Total effective data yield (Gb) | 116.434 | 118.01 |
| Total reads number (M) | 776.23 | 786.73 |
| Reads mapping rate | 99.04% | 98.70% |
| Properly paired mapping reads rate | 96.97% | 96.81% |
| No-mismatch mapping reads rate | 61.04% | 60.11% |
| Mismatch alignment bases rate | 0.77% | 0.74% |
| Mean coverage sequencing depth | 39.1 | 39.6 |
| Reference genome coverage | 98.99% | 98.99% |
| Reference genome coverage ≥4X | 98.53% | 98.53% |
| Reference genome coverage ≥10X | 97.21% | 97.28% |
| Reference genome coverage ≥20X | 90.00% | 90.49% |
| Polymerase chain reaction duplication rate | 8.62% | 8.64% |
Figure 2.Detection results of partial single nucleotide polymorphisms. The ‘+’ or ‘−’ before the sequence indicates the sequence alignment orientation for each read; mismatches or inserts/deletions to the hg19 reference genome are brown; low-quality bases (<20 phred score) are in lower case and in gray if they match the reference genome.
Figure 3.Structure variation loci within chromosomes. In nest-generation sequencing, the loci with structure variation on Chr 1–12 were ≥5%, and structure variation on Chr 13–22, X and Y were ≤3%. Chr, chromosome.
Figure 4.CNVs in the Ec9706 (WGC024118D) and Eca109 (WGC024119D) genome. (A) CNVs in the whole genome; the CNV change trend in the whole genome is that loci with gain or loss account for >40% of the entire sequence sites, the main form of CNV in WGC024118D is loss, and for WGC024119D is gain. Red indicates loss; green indicates gain; and blue indicates sequence matching the reference hg19. (B) CNVs in WGC024118D and (C) WGC024119D. Almost all of the loci on Chr 4, 17, and 18 in WGC024118D have gain or loss; more than half loci on Chr 3, 9, 11, 13, 14, 19 and 20 in WGC024118D, and Chr 1, 5, 7, 9, 11, 20 and Y in WGC024119D have gain or loss; almost half of loci on Chr 6, 10, 22, X and Y in WGC024118D and Chr 3, 8, 10, 12, 13, 15, and 16 in WGC024119D have gain or loss; few loci on Chr 8, 12, 15, 16, and 17 in WGC024118D and Chr 4 and 22 in WGC024119D have gain or loss, maintaining nearly normal sequence. Red indicates gain; green indicates loss; and blue indicates normal sequence. Chr, chromosome.
Summary of INDEL detection results.
| Feature | WGC024118D | WGC024119D |
|---|---|---|
| High-confidence INDEL no. | 740783 | 766563 |
| Deletion | 392984 | 408253 |
| Insertion | 347799 | 358310 |
| Heterozygotes | 386393 | 409200 |
| Homozygotes | 327829 | 330279 |
| dbSNP | 455370 (61.5%) | 465321 (60.7%) |
| 1000 Genomes Project | 237781 (32.1%) | 239886 (31.3%) |
| NA | 17882 | 18327 |
| 3′UTR | 6195 | 6354 |
| 5′UTR | 810 | 865 |
| 3′ UTR5, UTR3 | 6 | 3 |
| Downstream | 5402 | 5600 |
| Exonic | 615 | 645 |
| Exonic, splicing | 3 | 2 |
| Intergenic | 403796 | 419365 |
| Intronic | 270105 | 277821 |
| ncRNA 3′UTR | 127 | 128 |
| ncRNA 5′UTR | 23 | 26 |
| ncRNA 5′UTR, ncRNA 3′UTR | 1 | 1 |
| ncRNA exonic | 1241 | 1299 |
| ncRNA intronic | 29459 | 30801 |
| ncRNA splicing | 13 | 14 |
| Splicing | 91 | 95 |
| Upstream | 4851 | 5049 |
| Upstream, downstream | 163 | 168 |
| Exon frameshift deletion | 99 | 108 |
| Exon frameshift insertion | 87 | 89 |
| Exon nonframeshift deletion | 189 | 194 |
| Exon nonframeshift insertion | 148 | 158 |
| Exon stopgain SNV | 7 | 8 |
| Exon stoploss SNV | 1 | 2 |
| Exon unknown | 87 | 88 |
INDEL, insertion/deletion; UTR, untranslated region; SNV, single-nucleotide variant; NA, no annotation.
Genetic screening of nuclear DNA with INDEL.
| Chr | Start | End | Reference sequence | Sequence alteration | QUALITY | Alteration ratio (%) | Variation type | Gene location | Gene | Name (description) |
|---|---|---|---|---|---|---|---|---|---|---|
| Chr 1 | 39500839 | 39500839 | C | – | 841.73 | 100 | Deletion | Downstream | NDUFS5 | NADH:Ubiquinone oxidoreductase subunit S5, (nuclear gene encoding mitochondrial protein, transcript variant 2) |
| Chr 1 | 45804431 | 45804431 | – | T | 819.73 | 100 | Insertion | Intronic | MUTYH | mutY DNA glycosylase, (nuclear gene encoding mitochondrial protein, transcript variant α3) |
| Chr 2 | 191091321 | 191091334 | CAAAAAAAAAAAAA | – | 1600.73 | 100 | Deletion | Intronic | HIBCH | 3-hydroxyisobutyryl-CoA hydrolase (nuclear gene encoding mitochondrial protein, transcript variant 2) |
| Chr 2 | 219652424 | 219652424 | – | CCTCTTACCTG | 3035.73 | 100 | Insertion | Intronic | CYP27A1 | Cytochrome P450 family 27 subfamily A member 1 (nuclear gene encoding mitochondrial protein) |
| Chr 3 | 179339343 | 179339343 | – | GGTCTCGG | 1811.73 | 100 | Insertion | Intronic | NDUFB5 | NADH:Ubiquinone oxidoreductase subunit B5 (nuclear gene encoding mitochondrial protein, transcript variant 1) |
| Chr 3 | 128614563 | 128614563 | – | CTC | 2388.73 | 100 | Insertion | Intronic | ACAD9 | acyl-CoA dehydrogenase family member 9 (nuclear gene encoding mitochondrial protein, transcript variant 1) |
| Chr 4 | 106312189 | 106312189 | – | C | 806.73 | 100 | Insertion | Intronic | PPA2 | Pyrophosphatase (inorganic) 2 (nuclear gene encoding mitochondrial protein, transcript variant 1) |
| Chr 4 | 89197868 | 89197875 | GACTGTCC | – | 1078.74 | 100 | Deletion | Intronic | PPM1K | Protein phosphatase, Mg2+/Mn2+ dependent, 1K (nuclear gene encoding mitochondrial protein) |
| Chr 1 | 29538224 | 29538225 | CT | – | 1835.73 | 100 | Deletion | Intronic | MECR | Mitochondrial |
| Chr 1 | 47107042 | 47107042 | – | G | 1521.73 | 100 | Insertion | Intronic | ATPAF1 | ATP synthase mitochondrial F1 complex assembly factor 1 (nuclear gene encoding mitochondrial protein) |
Summary of SNP detection results.
| Feature | WGC024118D | WGC024119D |
|---|---|---|
| High-confidence SNP no. | 3851527 | 3861523 |
| Heterozygotes | 1938738 | 1944372 |
| Homozygotes | 1911032 | 1915229 |
| dbSNP | 3685859 (95.7%) | 3694088 (95.7%) |
| 1000 Genomes Project | 3520593 (91.4%) | 3520791 (91.2%) |
| 3′UTR | 25886 | 25946 |
| 5′UTR | 5578 | 5570 |
| 5′UTR, 3′UTR | 10 | 12 |
| Downstream | 23646 | 23744 |
| Exonic | 23196 | 23198 |
| Exonic, splicing | 6 | 7 |
| Intergenic | 2255882 | 2264134 |
| Intronic | 1327100 | 1327665 |
| ncRNA 3′UTR | 615 | 618 |
| ncRNA 5′UTR | 97 | 105 |
| ncRNA exonic | 9941 | 10103 |
| ncRNA intronic | 156029 | 156787 |
| ncRNA splicing | 52 | 55 |
| Splicing | 71 | 68 |
| Upstream | 22691 | 22772 |
| Upstream, downstream | 727 | 739 |
| Exon nonsynonymous SNV | 10720 | 10774 |
| Exon stopgain SNV | 98 | 92 |
| Exon stoploss SNV | 15 | 13 |
| Exon synonymous SNV | 11962 | 11915 |
| Exon unknown | 407 | 411 |
SNP, single nucleotide polymorphism; UTR, untranslated region; SNV, single-nucleotide variant.
Screening of nuclear DNA genes with SNP features.
| Chr | Start | End | Reference sequence | Sequence alteration | QUALITY | Alteration ratio (%) | Variation type | Gene location | Gene | Name (description) |
|---|---|---|---|---|---|---|---|---|---|---|
| Chr 1 | 28564279 | 28564279 | A | G | 1000.77 | 100 | SNP | Exonic | ATPIF1 | ATPase inhibitory factor 1 (nuclear gene encoding mitochondrial protein) |
| Chr 1 | 29542637 | 29542637 | A | G | 1442.77 | 100 | SNP | Exonic | MECR | Mitochondrial |
| Chr 1 | 24134721 | 24134721 | T | C | 794.77 | 100 | SNP | Exonic | HMGCL | 3-Hydroxymethyl-3-methylglutaryl-CoA lyase (nuclear gene encoding mitochondrial protein) |
| Chr 1 | 36927733 | 36927733 | G | A | 1452.77 | 100 | SNP | Exonic | MRPS15 | Mitochondrial ribosomal protein S15 (nuclear gene encoding mitochondrial protein) |
| Chr 1 | 47133811 | 47133811 | T | C | 457.77 | 100 | SNP | Exonic | ATPAF1 | ATP synthase mitochondrial F1 complex assembly factor 1 (nuclear gene encoding mitochondrial protein) |
| Chr 1 | 24134721 | 24134721 | T | C | 1025.77 | 100 | SNP | Exonic | HMGCL | 3-Hydroxymethyl-3-methylglutaryl-CoA lyase (nuclear gene encoding mitochondrial protein) |
| Chr 1 | 29529699 | 29529699 | T | C | 982.77 | 100 | SNP | Exonic | MECR | Mitochondrial |
| Chr 1 | 17380497 | 17380497 | G | T | 896.77 | 100 | SNP | Exonic | SDHB | Homo sapiens succinate dehydrogenase complex iron sulfur subunit B (nuclear gene encoding mitochondrial protein) |
| Chr 1 | 19201956 | 19201956 | A | G | 847.77 | 100 | SNP | Exonic | ALDH4A1 | Aldehyde dehydrogenase 4 family member A1 (nuclear gene encoding mitochondrial protein) |