| Literature DB >> 31700535 |
Hannah O'Keefe1,2, Rachel Queen3, Phillip Lord2, Joanna L Elson1,4.
Abstract
Mitochondrial disorders are heterogeneous, showing variable presentation and penetrance. Over the last three decades, our ability to recognize mitochondrial patients and diagnose these mutations, linking genotype to phenotype, has greatly improved. However, it has become increasingly clear that these strides in diagnostics have not benefited all population groups. Recent studies have demonstrated that patients from genetically understudied populations, in particular those of black African heritage, are less likely to receive a diagnosis of mtDNA disease. It has been suggested that haplogroup context might influence the presentation and penetrance of mtDNA disease; thus, the spectrum of mutations that are associated with disease in different populations. However, to date there is only one well-established example of such an effect: the increased penetrance of two Leber's hereditary optic neuropathy mutations on a haplogroup J background. This paper conducted the most extensive investigation to date into the importance of haplogroup context on the pathogenicity of mtDNA mutations. We searched for proven human point mutations across 726 multiple sequence alignments derived from 33 non-human species absent of disease. A total of 58 pathogenic point mutations arise in the sequences of these species. We assessed the sequence context and found evidence of population variants that could modulate the phenotypic expression of these point mutations masking the pathogenic effects seen in humans. This supports the theory that sequence context is influential in the presentation of mtDNA disease and has implications for diagnostic practices. We have shown that our current understanding of the pathogenicity of mtDNA point mutations, primarily built on studies of individuals with haplogroups HVUKTJ, will not present a complete picture. This will have the effect of creating a diagnostic inequality, whereby individuals who do not belong to these lineages are less likely to receive a genetic diagnosis.Entities:
Keywords: comparative genomics; haplogroup; mitochondrial disease; mtDNA
Year: 2019 PMID: 31700535 PMCID: PMC6824070 DOI: 10.1111/eva.12851
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 5.183
Number of sequences per species before and after quality control
|
|
|
|
|
|
|---|---|---|---|---|
| Primates |
| Banobo | 54 | 54 |
|
| Central Chimpanzee | 56 | 54 | |
|
| Easter Chimpanzee | 33 | 33 | |
|
| Western Chimpanzee | 30 | 30 | |
|
| Crab‐eating Macaque | 44 | 44 | |
| Rodentia |
| Mouse | 50 | 50 |
|
| House Mouse | 59 | 59 | |
|
| Brown Rat | 66 | 66 | |
|
| Bank Vole | 35 | 35 | |
| Anguilliformes |
| European Eel | 55 | 55 |
|
| American Eel | 51 | 51 | |
| Artiodactyla |
| Cow | 275 | 274 |
|
| Yak | 83 | 83 | |
|
| Sheep | 94 | 94 | |
| Clupeiformes |
| Atlantic Herring | 100 | 100 |
| Salmoniformes |
| European Whitefish | 80 | 80 |
| Perissodactyla |
| Horse | 247 | 244 |
| Galliformes |
| Red Jungle Fowl | 66 | 65 |
| Carcharhiniformes |
| Speartooth Shark | 94 | 94 |
| Cypriniformes |
| Silver Carp | 30 | 29 |
|
| Bighead Carp | 36 | 35 | |
| Cetartiodactyla |
| Fin Whale | 154 | 148 |
|
| Bison | 34 | 34 | |
|
| Killer Whale | 87 | 87 | |
|
| Wild Boar | 150 | 131 | |
|
| African Buffalo | 45 | 45 | |
|
| Common Bottlenose Dolphin | 50 | 50 | |
| Carnivora |
| Dog | 391 | 389 |
|
| Island Fox | 41 | 41 | |
|
| Island Fox | 33 | 33 | |
|
| Island Fox | 42 | 42 | |
|
| Brown Bear | 74 | 74 | |
|
| Cave Bear (Extinct) | 34 | 20 |
MitoTip pathogenicity prediction versus the results of the Yarham et al scoring system
| Yarham et al scoring | |||||
|---|---|---|---|---|---|
| Pathogenic (113) | Probably (2) | Possibly (56) | Neutral (100) | ||
| MitoTip prediction | Pathogenic | 33/113 | 2/2 | 2/56 | 0/100 |
| Probably | 23/113 | 0/2 | 16/56 | 8/100 | |
| Possibly | 27/113 | 0/2 | 17/56 | 19/100 | |
| Neutral | 26/113 | 0/2 | 19/56 | 71/100 | |
| No score | 4/113 | 0/2 | 2/56 | 2/100 | |
(A) Mutations present in 100% of the sequences from one or more species. (B) Mutations that are polymorphic in one or more species
| (A) | ||||||
|---|---|---|---|---|---|---|
| Monomorphic | ||||||
| Gene | Mutation | Stem | Secondary structure | Tertiary interactions | ||
| WC Pair | Changed in species? | Mutation | WC Pair | |||
| Ala | 5591G > A | ACC | 5652C > T | Some | – | – |
| Arg | 10450A > G | T | 10460T > A | Some | (13T > C‐22C > T)‐46A > G | 56A > C‐19A |
| Asn | 5703G > A | T | 5687C > T | All | – | – |
| Asn | 5728T > C | ACC | 5659A > G | None | – | – |
| Asp | 7554G > A | AC | 7544C > T | None | – | – |
| Gly | 10010T > C | D | 10002A > G | None | – | – |
| His | 12183G > A | T | 12197C > T | All | (13A‐22A)‐46G > A | – |
| Ile | 4267A > G | ACC | 4326T > C | All | – | – |
| Ile | 4269A > G | ACC | 4324T > C | All | – | – |
| Ile | 4274T > C | D | 4281A > G | All | – | 56C > T‐19T > C |
| Ile | 4281A > G | D | 4274T > C | All | 56C > T‐19A > G | – |
| Ile | 4300A > G | AC | 4286T > C | All | – | – |
| Leu | 3273T > C | AC | 3259A > G | All | 44T > C‐26C | – |
| Leu | 3302A > G | ACC | 3231T > C | All | – | – |
| Leu2 | 12276G > A | D | 12288C > T | None | – | 9G‐23C |
| Lys | 8355T > C | T | 8339A > G | Some | – | (25A > G‐10G)‐45A > G |
| Met | 4403G > A | ACC | 4467C > T | All | – | – |
| Phe | 582T > C | ACC | 641A > G | All | – | – |
| Phe | 583G > A | ACC | 640C > T | All | – | – |
| Phe | 602C > T | D | 586G > A | None | 44A‐26C > T | (25C‐10C)‐45G |
| Phe | 617G > A | AC | 607C > T | None | – | – |
| Pro | 15967G > A | T | 15975C > T | Some | – | – |
| Ser | 7497G > A | D | 7503C > T | All | – | 58A‐54C |
| Ser | 7511T > C | ACC | 7450A > G | None | – | – |
| Ser | 7512T > C | ACC | 7449A > G | All | – | – |
| Trp | 5538G > A | AC | 5552C > T | All | – | – |
| Val | 1606G > A | ACC | 1665C > T | All | – | – |
| Val | 1624C > T | D | 1611G > A | None | 9A‐23C > T | (25C > T‐10G)‐45T |
| Val | 1630A > G | AC | 1638T > C | All | – | – |
Figure 1All pathogenic mutations presenting in 100% of the sequences for every species in which they are identified are classified in this study as monomorphic mutations
Conservation index and GenBank frequency of the 58 Mutations found amongst these 33 species
| Gene | Position | Mutation | Conservation index (%) | GenBank frequency |
|---|---|---|---|---|
| Ala | 5,591 | G > A | 91.11 | 0 |
| 5,628 | T > C | 95.56 | 88 | |
| 5,650 | G > A | 64.44 | 1 | |
| Arg | 10,450 | A > G | 91.11 | 0 |
| Asn | 5,693 | T > C | 100 | 0 |
| 5,703 | G > A | 8.89 | 0 | |
| 5,709 | T > C | 80 | 0 | |
| 5,728 | T > C | 86.67 | 1 | |
| Asp | 7,554 | G > A | 91.11 | 1 |
| Cys | 5,814 | T > C | 75.56 | 128 |
| Glu | 14,674 | T > C | 73.33 | 7 |
| 14,687 | A > G | 88.89 | 267 | |
| 14,709 | T > C | 95.56 | 1 | |
| 14,728 | T > C | 91.11 | 0 | |
| 14,739 | G > A | 71.11 | 0 | |
| Gly | 10,010 | T > C | 100 | 0 |
| His | 12,183 | G > A | 71.11 | 1 |
| Ile | 4,267 | A > G | 93.33 | 0 |
| 4,269 | A > G | 86.67 | 0 | |
| 4,274 | T > C | 95.56 | 0 | |
| 4,281 | A > G | 100 | 1 | |
| 4,284 | G > A | 62.22 | 2 | |
| 4,300 | A > G | 93.33 | 0 | |
| 4,302 | A > G | 97.78 | 0 | |
| 4,309 | G > A | 24.44 | 1 | |
| Leu | 3,243 | A > G | 97.78 | 9 |
| 3,244 | G > A | 95.56 | 6 | |
| 3,251 | A > G | 93.33 | 0 | |
| 3,253 | T > C | 84.44 | 6 | |
| 3,271 | T > C | 82.22 | 0 | |
| 3,273 | T > C | 97.78 | 0 | |
| 3,302 | A > G | 91.11 | 0 | |
| Leu2 | 12,276 | G > A | 97.78 | 1 |
| Lys | 8,342 | G > A | 62.22 | 0 |
| 8,344 | A > G | 37.78 | 4 | |
| 8,355 | T > C | 68.89 | 0 | |
| 8,356 | T > C | 26.67 | 0 | |
| Met | 4,403 | G > A | 97.78 | 1 |
| Phe | 582 | T > C | 80 | 0 |
| 583 | G > A | 95.56 | 0 | |
| 602 | C > T | 97.78 | 0 | |
| 617 | G > A | 97.78 | 0 | |
| 622 | G > A | 93.33 | 0 | |
| 642 | T > C | 91.11 | 0 | |
| Pro | 15,967 | G > A | 35.56 | 0 |
| Ser | 7,472 | A > C | 62.22 | 3 |
| 7,497 | G > A | 8.89 | 1 | |
| 7,511 | T > C | 91.11 | 1 | |
| 7,512 | T > C | 31.11 | 0 | |
| Ser2 | 12,261 | T > C | 88.89 | 0 |
| 12,264 | C > T | 71.11 | 0 | |
| Trp | 5,538 | G > A | 86.67 | 0 |
| 5,540 | G > A | 95.56 | 0 | |
| 5,556 | G > A | 93.33 | 0 | |
| Val | 1,606 | G > A | 71.11 | 0 |
| 1,624 | C > T | 97.78 | 0 | |
| 1,630 | A > G | 15.56 | 0 | |
| 1,644 | G > A | 91.11 | 0 |
Derived from the MitoMaster SNV Query tool.
Mutations arising polymorphically in one or more species
| Gene | Mutation | Monomorphic | Polymorphic |
|---|---|---|---|
| Ala | 5628T > C |
|
|
| Ala | 5650G > A |
|
|
| Asn | 5693T > C |
|
|
| Glu | 14739G > A |
|
|
| Glu | 14674T > C |
|
|
| Glu | 14687A > G |
|
|
| Glu | 14709T > C |
|
|
| Glu | 14728T > C |
|
|
| Ile | 4284G > A |
|
|
| Ile | 4309G > A |
|
|
| Leu | 3253T > C |
|
|
| Leu | 3271T > C |
|
|
| Leu | 3243A > G |
|
|
| Leu | 3244G > A |
|
|
| Lys | 8342G > A |
|
|
| Lys | 8356T > C |
|
|
| Lys | 8344A > G |
|
|
| Phe | 642T > C |
|
|
| Phe | 622G > A |
|
|
| Ser | 7472A > C |
|
|
| Ser2 | 12261T > C |
|
|
| Ser2 | 12264C > T |
|
|
| Trp | 5540G > A |
|
|
| Val | 1644G > A |
|
|
Figure 2Phylogenetic analysis and secondary structure modelling of Sus scrofa mt‐tRNA‐Ala. Polymorphic variability in the Sus scrofa sequences divides the alignment into 11 clades. The phylogenetic network demonstrates the clades with and without the m.5605G > A mutation, drawn using NETWORK 4.6.0.6. As mt‐tRNA‐Ala is encoded on the heavy strand of the mitochondrial genome, all sequences and variants are denoted as the complement to the mtRNA molecule. These differences can be seen between the alignment of the clades and the secondary structure models here. Secondary structure analysis demonstrates m.5650G on the Human rCRS and its G > A change in group 5 of Sus scrofa. The adjoining G:U wobble pair in the rCRS and its change to an A:U Watson‐Crick like pair in Sus scrofa is also noted
Figure 3Phylogenetic analysis and secondary structure modelling of Ovis aries mt‐tRNA‐Lys. Nine clades were derived from polymorphic variability within the alignment. The phylogenetic network demonstrates the clades with and without the m.8344A > G mutation. Secondary structure modelling of the rCRS demonstrates m9344A in the T‐loop, the C:G Watson‐Crick like pair at the terminal of the T‐stem and m.8310T in the D‐Loop. Similarly, modelling of the Ovis aries group 2 sequences demonstrates m.8344A > G, a change from C:G to A:U Watson‐Crick like pairing at the terminal of the T‐stem and m.8310T > C
Figure 4Phylogenetic analysis and secondary structure modelling of Pan troglodytes verus mt‐tRNA‐Lys. Four clades were derived from the alignment, based on polymorphic variation. The phylogenetic network demonstrates clades with and without m.8344A > G. Secondary structure modelling of the human rCRS and group 1 Pan troglodytes verus sequences demonstrate the m.8344A > G and m.8310T > C
Figure 5Phylogenetic analysis and secondary structure modelling of Macaca fascicularis mt‐tRNA‐Val. Polymorphic variability within the M. fascicularis alignment subdivided the sequences into 10 clades. The phylogenetic network, created with NETWORK 4.6.1.6, demonstrates clades with and without m.1644G > A. Secondary structure modelling indicates m.1644G and m.1643A in the rCRS and the m.1644G > A and m.1643A > G changes in M. fascicularis, along with the loss of the C:G Watson‐Crick like pairing at the terminal of the AC‐stem