| Literature DB >> 25333882 |
Hojun Song1, Matthew J Moulton2, Michael F Whiting3.
Abstract
Nuclear mitochondrial pseudogenes (numts) are non-functional fragments of mtDNA inserted into the nuclear genome. Numts are prevalent across eukaryotes and a positive correlation is known to exist between the number of numts and the genome size. Most numt surveys have relied on model organisms with fully sequenced nuclear genomes, but such analyses have limited utilities for making a generalization about the patterns of numt accumulation for any given clade. Among insects, the order Orthoptera is known to have the largest nuclear genome and it is also reported to include several species with a large number of numts. In this study, we use Orthoptera as a case study to document the diversity and abundance of numts by generating numts of three mitochondrial loci across 28 orthopteran families, representing the phylogenetic diversity of the order. We discover that numts are rampant in all lineages, but there is no discernable and consistent pattern of numt accumulation among different lineages. Likewise, we do not find any evidence that a certain mitochondrial gene is more prone to nuclear insertion than others. We also find that numt insertion must have occurred continuously and frequently throughout the diversification of Orthoptera. Although most numts are the result of recent nuclear insertion, we find evidence of very ancient numt insertion shared by highly divergent families dating back to the Jurassic period. Finally, we discuss several factors contributing to the extreme prevalence of numts in Orthoptera and highlight the importance of exploring the utility of numts in evolutionary studies.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25333882 PMCID: PMC4204883 DOI: 10.1371/journal.pone.0110508
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
A summary of numts of three mitochondrial genes (COI, COII, ND5) generated across 28 orthopteran families.
| Taxonomic Information | COI | COII | ND5 | ||||||||||||
| Suborder | Superfamily | Family | Species | total | # ident.to orth. | # uniquenumt | # numtmutation | total | # ident.to orth. | # uniquenumt | # numtmutation | total | # ident. to orth. | # unique numt | # numtmutation |
| Caelifera | Acridoidea | Acrididae |
| 47 | 35 | 12 | 1 | 43 | 25 | 18 | 0 | 43 | 33 | 10 | 1 |
| Caelifera | Acridoidea | Lentulidae |
| 43 | 28 | 15 | 2 | 42 | 27 | 15 | 7 | 26 | 14 | 12 | 9 |
| Caelifera | Acridoidea | Lithidiidae |
| 42 | 18 | 24 | 8 | 46 | 27 | 19 | 9 | 24 | 8 | 16 | 9 |
| Caelifera | Acridoidea | Pamphagidae |
| 46 | 24 | 22 | 3 | 40 | 19 | 20 | 16 | 42 | 31 | 11 | 4 |
| Caelifera | Acridoidea | Pamphagodidae |
| 46 | 31 | 15 | 1 | 16 | 13 | 3 | 0 | 30 | 3 | 27 | 27 |
| Caelifera | Acridoidea | Pyrgacrididae |
| 23 | 21 | 2 | 2 | 54 | 40 | 14 | 3 | 38 | 27 | 11 | 0 |
| Caelifera | Acridoidea | Romaleidae |
| 45 | 31 | 13 | 3 | 47 | 30 | 14 | 0 | 45 | 27 | 18 | 2 |
| Caelifera | Acridoidea | Tristiridae |
| 43 | 27 | 16 | 2 | 43 | 22 | 21 | 5 | 36 | 17 | 18 | 7 |
| Caelifera | Eumastacoidea | Chorotypidae |
| 36 | 27 | 9 | 1 | 16 | 15 | 1 | 0 | 45 | 41 | 4 | 3 |
| Caelifera | Eumastacoidea | Eumastacidae |
| 40 | 16 | 23 | 5 | 9 | 2 | 7 | 7 | 45 | 27 | 18 | 3 |
| Caelifera | Pneumoroidea | Pneumoridae |
| 44 | 28 | 16 | 3 | - | - | - | - | 24 | 11 | 13 | 10 |
| Caelifera | Proscopioidea | Proscopiidae |
| 44 | 0 | 8 | 0 | 27 | 0 | 8 | 8 | 42 | 31 | 11 | 7 |
| Caelifera | Proscopioidea | Thericleidae |
| 32 | 20 | 12 | 3 | 24 | 13 | 11 | 1 | 41 | 30 | 10 | 0 |
| Caelifera | Pyrgomorphoidea | Pyrgomorphidae |
| 22 | 18 | 4 | 0 | 42 | 27 | 15 | 6 | - | - | - | - |
| Caelifera | Tetrigoidea | Tetrigidae |
| 61 | 2 | 32 | 9 | - | - | - | - | 17 | 12 | 5 | 4 |
| Caelifera | Tridactyloidea | Cylindrachetidae |
| 45 | 26 | 11 | 0 | 41 | 24 | 17 | 3 | 36 | 30 | 6 | 6 |
| Caelifera | Tridactyloidea | Ripipterygidae |
| 45 | 33 | 10 | 1 | 5 | 2 | 3 | 0 | 46 | 39 | 7 | 0 |
| Caelifera | Tridactyloidea | Tridactylidae |
| 45 | 38 | 7 | 2 | 37 | 29 | 8 | 2 | 34 | 1 | 16 | 3 |
| Caelifera | Trigonopterygoidea | Trigonopterygidae |
| 37 | 22 | 15 | 5 | 6 | 2 | 4 | 3 | 32 | 16 | 16 | 9 |
| Ensifera | Gryllacridoidea | Gryllacrididae |
| 45 | 24 | 19 | 4 | 46 | 30 | 16 | 4 | 43 | 27 | 15 | 4 |
| Ensifera | Grylloidea | Gryllotalpidae |
| 33 | 28 | 3 | 0 | - | - | - | - | - | - | - | - |
| Ensifera | Grylloidea | Myrmecophilidae |
| 92 | 57 | 26 | 1 | 41 | 31 | 9 | 4 | - | - | - | - |
| Ensifera | Hagloidea | Prophalangopsidae |
| 37 | 23 | 14 | 1 | 42 | 19 | 22 | 12 | 42 | 21 | 21 | 9 |
| Ensifera | Schizodactyloidea | Schizodactylidae |
| 46 | 38 | 8 | 0 | 9 | 8 | 1 | 0 | 44 | 33 | 11 | 0 |
| Ensifera | Stenopelmatoidea | Anostostomatidae |
| 18 | 14 | 4 | 1 | 13 | 12 | 1 | 0 | - | - | - | - |
| Ensifera | Stenopelmatoidea | Rhaphidophoridae |
| 44 | 25 | 19 | 6 | 46 | 21 | 25 | 3 | 43 | 22 | 21 | 10 |
| Ensifera | Stenopelmatoidea | Stenopelmatidae |
| 44 | 22 | 20 | 5 | 46 | 38 | 8 | 0 | 27 | 21 | 6 | 1 |
| Ensifera | Tettigonioidea | Tettigoniidae |
| 68 | 27 | 41 | 12 | 36 | 17 | 9 | 0 | 29 | 16 | 13 | 0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
“total” indicates the total number of clones sequenced from each PCR amplicon;
“# ident. to orth.” indicates the number of cloned sequences that are identical to the orthologous mtDNA;
“# unique numt” indicates the number of unique cloned sequences that are different from the orthologous mtDNA, thus representing numts;
“# numt mutation” indicates the number of unique numts with characteristic in-frame stop codons or indels.
“-” indicates missing data.
Figure 1Numts of three mitochondrial genes (COI, COII, ND5) are extremely abundant across the phylogenetic diversity within Orthoptera.
The y-axis shows the proportion of numts from the total number of clones generated based on PCR using conserved primers. If nearing 1, most clones generated are numts. If nearing 0, most clones generated are orthologous mtDNA.
Figure 2Characteristics of numts as measured by base composition and uncorrected p-distance from orthologs.
A: COI-like numts; B: COII-like numts; C: ND5-like numts.
Figure 3Co-analysis of numts and mtDNA using COI-like numts of Stenopelmatus fuscus (Ensifera: Stenopelmatidae).
Grey terminals represent orthologous COI across Orthoptera. The maximum likelihood analysis recovered a monophyletic group consisting of COI-like numts and the ortholog from S. fuscus. Paleonumts are denoted by yellow with red outline, and neonumts are denoted by yellow with blue outline. Those numts with premature stop codons, insertions, and deletions, are indicated by orange, green, and blue squares, respectively. Numbers on nodes indicate bootstrap support values.
All orthopteran families have two types of numts.
| Taxonomic Information | COI-like | COII-like | ND5-like | ||||||
| Suborder | Superfamily | Family | Species | paleonumt | neonumt | paleonumt | neonumt | paleonumt | neonumt |
| Caelifera | Acridoidea | Acrididae |
| 5 | 7 | 2 | 16 | 1 | 9 |
| Caelifera | Acridoidea | Lentulidae |
| 3 | 12 | 7 | 8 | 0 | 12 |
| Caelifera | Acridoidea | Lithidiidae |
| 18 | 6 | 12 | 7 | 8 | 8 |
| Caelifera | Acridoidea | Pamphagidae |
| 10 | 12 | 16 | 4 | 5 | 6 |
| Caelifera | Acridoidea | Pamphagodidae |
| 2 | 13 | 1 | 2 | 25 | 2 |
| Caelifera | Acridoidea | Pyrgacrididae |
| 0 | 2 | 5 | 9 | 1 | 10 |
| Caelifera | Acridoidea | Romaleidae |
| 2 | 11 | 1 | 13 | 1 | 17 |
| Caelifera | Acridoidea | Tristiridae |
| 6 | 10 | 6 | 15 | 11 | 7 |
| Caelifera | Eumastacoidea | Chorotypidae |
| 1 | 8 | 0 | 1 | 0 | 4 |
| Caelifera | Eumastacoidea | Eumastacidae |
| 14 | 9 | 0 | 7 | 6 | 12 |
| Caelifera | Eumastacoidea | Thericleidae |
| 5 | 7 | 2 | 9 | 1 | 9 |
| Caelifera | Pneumoroidea | Pneumoridae |
| 5 | 11 | - | - | 8 | 5 |
| Caelifera | Proscopioidea | Proscopiidae |
| 8 | 0 | 0 | 8 | 1 | 10 |
| Caelifera | Pyrgomorphoidea | Pyrgomorphidae |
| 0 | 4 | 1 | 14 | - | - |
| Caelifera | Tetrigoidea | Tetrigidae |
| 13 | 19 | - | - | 0 | 5 |
| Caelifera | Tridactyloidea | Cylindrachetidae |
| 4 | 7 | 1 | 16 | 2 | 4 |
| Caelifera | Tridactyloidea | Ripipterygidae |
| 6 | 4 | 1 | 2 | 0 | 7 |
| Caelifera | Tridactyloidea | Tridactylidae |
| 1 | 6 | 4 | 4 | 15 | 1 |
| Caelifera | Trigonopterygoidea | Trigonopterygidae |
| 9 | 6 | 3 | 1 | 3 | 13 |
| Ensifera | Gryllacridoidea | Gryllacrididae |
| 2 | 17 | 2 | 14 | 3 | 12 |
| Ensifera | Grylloidea | Gryllotalpidae |
| 0 | 3 | - | - | - | - |
| Ensifera | Grylloidea | Myrmecophilidae |
| 1 | 25 | 4 | 5 | - | - |
| Ensifera | Hagloidea | Prophalangopsidae |
| 4 | 10 | 14 | 8 | 3 | 18 |
| Ensifera | Schizodactyloidea | Schizodactylidae |
| 1 | 7 | 0 | 1 | 1 | 10 |
| Ensifera | Stenopelmatoidea | Anostostomatidae |
| 0 | 4 | 0 | 1 | - | - |
| Ensifera | Stenopelmatoidea | Rhaphidophoridae |
| 13 | 6 | 10 | 15 | 15 | 6 |
| Ensifera | Stenopelmatoidea | Stenopelmatidae |
| 10 | 10 | 1 | 7 | 0 | 6 |
| Ensifera | Tettigonioidea | Tettigoniidae |
| 27 | 14 | 2 | 7 | 1 | 12 |
|
|
|
|
|
|
|
| |||
The first type is the ancient numt or “paleonumt” which represents the nuclear insertion in the past before mtDNA took its current form. The second type is the recent numt or “neonumt” which did not have enough time to accumulate many mutations.
Examples of clades formed by paleonumts from different orthopteran species.
| Numt type | Family | Numt name | Ortholog AT% | NumtAT% | p-distance from ortholog | Age of MRCAbased on fossil data |
| COI-like numts | Eumastacidae | OR407_C039 | 0.676 | 0.678 | 0.410 | 145–163.5 MYA |
| Pyrgacrididae | OR317_C179 | 0.603 | 0.602 | 0.483 | (Oldest Eumastacidae) | |
| COII-like numts | Lentulidae | OR295_C020 | 0.742 | 0.521 | 0.490 | 98.7–108 MYA |
| Pamphagidae | OR151_C053 | 0.693 | 0.570 | 0.417 | (Oldest Tridactylidae) | |
| Tridactylidae | OR153_C069 | 0.655 | 0.653 | 0.478 | ||
| COII-like numts | Lentulidae | OR295_C011 | 0.742 | 0.636 | 0.432 | 33.9–38 MYA |
| Pamphagidae | OR151_C069 | 0.693 | 0.629 | 0.386 | (Oldest Acridoidea) | |
| ND5-like numts | Lentulidae | OR295_C207 | 0.781 | 0.756 | 0.434 | 182–201 MYA |
| Pamphagodidae | OR540_C135 | 0.725 | 0.728 | 0.521 | (Oldest Prophalangopsidae) | |
| Pamphagodidae | OR540_C121 | 0.725 | 0.516 | 0.505 | ||
| Pneumoridae | OR293_C021 | 0.757 | 0.674 | 0.440 | ||
| Prophalangopsidae | OR021_C021 | 0.709 | 0.584 | 0.416 | ||
| Prophalangopsidae | OR021_C022 | 0.709 | 0.565 | 0.402 |
The recovery of these clades indicates that the nuclear insertion event probably occurred in the most recent common ancestor (MRCA) of the species forming the clades. Numt name indicates the specific cloned sequence number used in the study, available in Table S3. Ortholog AT% indicates the base composition of the orthologous mtDNA sequence of the corresponding numt. Numt AT% is the base composition of the specific numts below to show how similar or different they are from the ortholog. p-distance from ortholog indicates the uncorrected p-distance of the numt sequence from the corresponding ortholog. In general, these paleonumts are highly divergent from the orthologs. Age of MRCA based on fossil data is determined from the oldest known fossil for particular clades, thus showing the maximum date of nuclear insertion.