| Literature DB >> 22412894 |
Hubert H Rogers1, Sam Griffiths-Jones.
Abstract
Mitochondrial pseudogenes in nuclear chromosomes (numts) have been detected in the genomes of a diverse range of eukaryotic species. However, the numt content of different genomes and their properties is not uniform, and study of these differences provides insight into the mechanisms and dynamics of genome evolution in different organisms. In the genus Drosophila, numts have previously only been identified on a genome-wide scale in the melanogaster subgroup. The present study extends the identification to 11 species of the Drosophila genus. We identify a total of 302 numts and show that the numt complement is highly variable in Drosophilids, ranging from just 4 in D. melanogaster to 67 in D. willistoni, broadly correlating with genome size. Many numts have undergone large-scale rearrangements in the nucleus, including interruptions, inversions, deletions and duplications of sequence of variable size. Estimating the age of the numts in the nucleus by phylogenetic tree reconstruction reveals the vast majority of numts to be recent gains, 90% having arisen on terminal branches of the species tree. By identifying paralogs and counting duplications among the extant numts we estimate that 23% of extant numts arose through post-insertion duplications. We estimate genus average rates of insertion of 0.75 per million years, and a duplication rate of 0.010 duplications per numt per million years.Entities:
Mesh:
Year: 2012 PMID: 22412894 PMCID: PMC3296715 DOI: 10.1371/journal.pone.0032593
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Numbers of numt annotations in 11 Drosophila species.
| Genome size | # numts annotated | Average length (bps) | Total numt content (bps) | |
|
| 118 (5.35) | 4 | 210 | 838 |
|
| 111 (2.73) | 5 | 700 | 3,501 |
|
| 115 (3.67) | 25 | 1,502 | 37,553 |
|
| 127 (12.04) | 9 | 1,497 | 13,471 |
|
| 134 (6.97) | 20 | 1,255 | 25,103 |
|
| 176 (24.93) | 26 | 1,537 | 39,952 |
|
| 138 (8.47) | 54 | 1,335 | 72,107 |
|
| 187 (15.57) | 67 | 900 | 60,284 |
|
| 172 (13.96) | 24 | 3,029 | 72,689 |
|
| 161 (8.92) | 59 | 2,506 | 147,862 |
|
| 138 (2.84) | 9 | 2,075 | 18,673 |
Genome size estimated by assembly size [20].
Repeat content annotated by ReAS [48], excluding scaffolds <200 kb [20].
Average numt length excludes internal duplications and interrupting sequence between merged fragments.
Figure 1Mitochondrial origins of numts.
Locations of origin of numt sequences (black) in the mitochondrial genomes (grey) for 11 Drosophila species are shown. The D. melanogaster mitochondrial assembly is relatively large, because it has a larger portion of the variable control region sequenced.
Number of gross rearrangement events for each Drosophila species.
| Interruptions | Deletions | Inversions | Duplications | |
|
| - | - | - | - |
|
| - | - | - | - |
|
| 2 (2) | 1 (1) | 1 (1) | - |
|
| 4 (2) | 5 (3) | - | 1 (1) |
|
| - | 4 (2) | - | - |
|
| 5 (5) | - | - | - |
|
| - | 5 (4) | 2 (2) | 1 (1) |
|
| 21 (8) | 12 (7) | 1 (1) | 4 (4) |
|
| 12 (6) | 4 (3) | - | 3 (3) |
|
| 15 (8) | 16 (11) | 2 (2) | 9 (5) |
|
| - | 1 (1) | - | - |
|
|
|
|
|
|
|
|
|
|
|
|
Interrupting sequence, deletions, inversions and duplications affecting at least 200 bases of numt sequence in the nuclear genomes after insertion are shown. The number of distinct numts affected is shown in parentheses. Multiple rearrangements of the same type were only counted once for each numt.
Figure 2Age of numt insertions.
Average frequencies (insertions per million years) of numt insertions on each branch of the Drosophila tree are shown. In parentheses is the number of extant numts that have arisen by duplication (left), and the number of distinct paralog sets (right). Divergence times were derived from TimeTree [47] and the tree toplogy from [20].
Insertion and duplication rates for each lineage.
| Branch | Insertion rate (insertions per m.y.) | Duplication rate – one per set(duplications per numt per m.y.) | Duplication rate – total(duplications per numt per m.y. |
|
| 0.12 | - | - |
|
| 0.23 | - | - |
|
| 1.92 | 0.063 | 0.094 |
|
| 0.45 | 0.006 | 0.006 |
|
| 0.75 | 0.039 | 0.064 |
|
| 0.47 | 0.002 | 0.004 |
|
| 0.66 | 0.005 | 0.012 |
|
| 0.80 | 0.003 | 0.006 |
|
| 0.66 | 0.011 | 0.028 |
|
| 1.71 | 0.006 | 0.041 |
|
| 0.14 | 0.009 | 0.009 |
Duplication rates were calculated both including and excluding multiple duplications from the same paralog set, and using only terminal branches of the tree. No duplications were observed in D. melanogaster and D. simulans.