| Literature DB >> 27955616 |
Levent Albayrak1,2,3, Kamil Khanipov1,2,3, Maria Pimenova1,2, George Golovko1,2, Mark Rojas1,2, Ioannis Pavlidis3, Sergei Chumakov4, Gerardo Aguilar4, Arturo Chávez4, William R Widger5, Yuriy Fofanov6,7.
Abstract
BACKGROUND: Low-abundance mutations in mitochondrial populations (mutations with minor allele frequency ≤ 1%), are associated with cancer, aging, and neurodegenerative disorders. While recent progress in high-throughput sequencing technology has significantly improved the heteroplasmy identification process, the ability of this technology to detect low-abundance mutations can be affected by the presence of similar sequences originating from nuclear DNA (nDNA). To determine to what extent nDNA can cause false positive low-abundance heteroplasmy calls, we have identified mitochondrial locations of all subsequences that are common or similar (one mismatch allowed) between nDNA and mitochondrial DNA (mtDNA).Entities:
Keywords: Heteroplasmy; High throughput sequencing; Low-abundance mutation; Minor allele; Mitochondria; NUMT; Rare variant
Mesh:
Year: 2016 PMID: 27955616 PMCID: PMC5153897 DOI: 10.1186/s12864-016-3375-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1The nucleotide-by-nucleotide perfect and approximate match nDNA interference maps for the mitochondrial genome. 8F, 8R, 10F, and 10R are locations of long-range PCR primers used in Li et al. study [27]. “Maximum” (colored in blue) represents the length of the longest identified LCS (for exact match map) or LSS (for approximate match map) that covers an individual nucleotide position in mtDNA. “Average” (yellow) represents the average length of the identified LCS or LSS that covers an individual nucleotide position in mtDNA
Longest common and longest similar subsequences in mitochondrial genes
| Gene Name | Length | Start Position | End Position | Maximum Length of Longest Common Subsequences | Average Length of Longest Common Subsequences | Maximum Length of Longest Similar Subsequences | Average Length of Longest Similar Subsequences |
|---|---|---|---|---|---|---|---|
|
| 71 | 577 | 647 | 93 | 55.2 | 127 | 85.2 |
|
| 954 | 648 | 1601 | 219 | 108.3 | 228 | 139.9 |
|
| 69 | 1602 | 1670 | 97 | 95.0 | 133 | 128.8 |
|
| 1559 | 1671 | 3229 | 99 | 51.3 | 155 | 81.1 |
|
| 75 | 3230 | 3304 | 42 | 28.6 | 46 | 37.3 |
|
| 956 | 3307 | 4262 | 207 | 74.0 | 263 | 108.6 |
|
| 69 | 4263 | 4331 | 207 | 174.8 | 417 | 285.9 |
|
| 72 | 4400 | 4329 | 137 | 137.0 | 417 | 417.0 |
|
| 68 | 4402 | 4469 | 279 | 162.7 | 417 | 417.0 |
|
| 1042 | 4470 | 5511 | 279 | 131.9 | 417 | 242.3 |
|
| 68 | 5512 | 5579 | 81 | 81.0 | 322 | 322.0 |
|
| 69 | 5655 | 5587 | 240 | 240.0 | 322 | 322.0 |
|
| 73 | 5729 | 5657 | 240 | 240.0 | 322 | 322.0 |
|
| 66 | 5826 | 5761 | 240 | 219.8 | 322 | 316.3 |
|
| 66 | 5891 | 5826 | 182 | 144.7 | 380 | 351.6 |
|
| 1542 | 5904 | 7445 | 245 | 145.7 | 380 | 255.8 |
|
| 69 | 7514 | 7446 | 204 | 204.0 | 333 | 333.0 |
|
| 68 | 7518 | 7585 | 204 | 129.8 | 333 | 333.0 |
|
| 684 | 7586 | 8269 | 188 | 86.4 | 333 | 169.4 |
|
| 70 | 8295 | 8364 | 188 | 188.0 | 251 | 251.0 |
|
| 207 | 8366 | 8572 | 188 | 76.2 | 251 | 177.3 |
|
| 681 | 8527 | 9207 | 141 | 97.4 | 224 | 183.6 |
|
| 784 | 9207 | 9990 | 126 | 71.4 | 207 | 140.4 |
|
| 68 | 9991 | 10058 | 32 | 29.4 | 50 | 45.0 |
|
| 346 | 10059 | 10404 | 41 | 24.4 | 76 | 36.6 |
|
| 65 | 10405 | 10469 | 57 | 49.4 | 83 | 81.6 |
|
| 297 | 10470 | 10766 | 57 | 33.6 | 84 | 63.6 |
|
| 1378 | 10760 | 12137 | 71 | 37.1 | 117 | 70.6 |
|
| 69 | 12138 | 12206 | 52 | 44.9 | 78 | 78.0 |
|
| 59 | 12207 | 12265 | 58 | 50.3 | 78 | 62.3 |
|
| 71 | 12266 | 12336 | 58 | 51.8 | 61 | 59.6 |
|
| 1812 | 12337 | 14148 | 77 | 34.1 | 92 | 53.0 |
|
| 525 | 14673 | 14149 | 91 | 52.4 | 110 | 78.5 |
|
| 69 | 14742 | 14674 | 75 | 75.0 | 110 | 89.0 |
|
| 1141 | 14747 | 15887 | 62 | 27.4 | 87 | 43.3 |
|
| 66 | 15888 | 15953 | 26 | 20.0 | 30 | 24.0 |
|
| 68 | 15956 | 16023 | 32 | 26.6 | 40 | 36.1 |
Fig. 2Poisson distributions. Approximation of average length of LCS and LSS covering each position in the mitochondrial genome by two (a, c) and three (b, d) mixed Poisson distributions. The red dashed line represents the joint approximation of the mixed distributions
Characteristics of four long-range mtDNA PCR primers used by Li et al [27] and four proposed high fidelity long-range primers
| Primers set | Primer | Sequence | Number of similar sequences in nDNA | Primer length/LSS starting from primers position | Number of positions in mtDNA with less value of LSS | Percent of positions in mtDNA with less value of LSS |
|---|---|---|---|---|---|---|
| Long-range mtDNA PCR primers used by | 10 F (forward_10kb) | CCGTGCAAAGGTAGCATAATC | 8 | 21/84 | 11,965 | 72.21% |
| 10R (reverse_10kb) | TTACTTTTATTTGGAGTTGCACCA | 16 | 24/52 | 9,125 | 55.07% | |
| 8 F (forward_8kb) | GGCTTTCTCAACTTTTAAAGGATA | 10 | 24/34 | 6,504 | 39.25% | |
| 8R (reverse_8kb) | TGTCCTGATCCAACATCGAG | 14 | 20/72 | 11,089 | 66.93% | |
| Proposed long-range mtDNA PCR primers | AF (forward_10kb) | TACTACCAGACAACCTTAGCCA | 0 | 22/18 | 218 | 1.32% |
| AR (reverse_10kb) | GAGTCATAAGTGGAGTCCGTAA | 0 | 22/18 | 217 | 1.31% | |
| BF (forward_8kb) | CACCATTTCCGACGGCATCTAC | 0 | 22/16 | 1 | 0.01% | |
| BR (reverse_8kb) | TGCGCCAGGTTTCAATTTCTAT | 0 | 22/18 | 217 | 1.31% |
Fig. 3The length distribution of LSS in the mitochondrial genome. 8F, 8R, 10F, and 10R are locations of forward (F) and reverse (R) primers used for long-range mtDNA amplification by Li et al [27]. AF, AR, BF, and BR are the primers designed to be located in the regions with the lowest similarity to the nuclear genome
Error and parameter values identified for average length of LCS and LSS distributions using two and three Poisson mixtures
| LCS | LSS | |||
|---|---|---|---|---|
| Mixture of 2 Poisson Distributions | Mixture of 3 Poisson Distributions | Mixture of 2 Poisson Distributions | Mixture of 3 Poisson Distributions | |
|
| 0.4071 | 0.1709 | 0.7556 | 0.3993 |
|
| 0.5929 | 0.4195 | 0.2444 | 0.2574 |
|
| 0.4097 | 0.3433 | ||
|
| 0.5729 | 0.1030 | 0.0263 | 0.0277 |
|
| 0.0472 | 0.5691 | 0.5456 | 0.5980 |
|
| 0.1090 | 0.1511 | ||
|
| 11.3771 | 14.0520 | 2.0610 | 4.2692 |
|
| 2.5450 | 11.0717 | 14.1339 | 14.5746 |
|
| 4.96376 | 7.9806 | ||
|
| 2.8675E-04 | 3.3724E-05 | 4.9989E-04 | 3.6505E-05 |