| Literature DB >> 27833437 |
Russell L Minton1, Marco A Martinez Cruz2, Mark L Farman3, Kathryn E Perez2.
Abstract
Helicoidea is a diverse group of land snails with a global distribution. While much is known regarding the relationships of helicoid taxa, comparatively little is known about the evolution of the mitochondrial genome in the superfamily. We sequenced two complete mitochondrial genomes from Praticolella mexicana Perez, 2011 representing the first such data from the helicoid family Polygyridae, and used them in an evolutionary analysis of mitogenomic gene order. We found the mitochondrial genome of Praticolella mexicana to be 14,008 bp in size, possessing the typical 37 metazoan genes. Multiple alternate stop codons are used, as are incomplete stop codons. Mitogenome size and nucleotide content is consistent with other helicoid species. Our analysis of gene order suggested that Helicoidea has undergone four mitochondrial rearrangements in the past. Two rearrangements were limited to tRNA genes only, and two involved protein coding genes.Entities:
Keywords: Gene rearrangement; convergence; homoplasy; mitochondria; phylogeny; tRNA
Year: 2016 PMID: 27833437 PMCID: PMC5096375 DOI: 10.3897/zookeys.626.9633
Source DB: PubMed Journal: Zookeys ISSN: 1313-2970 Impact factor: 1.546
Figure 1.Mitochondrial genome of UTRGV and McAllen illustrated with an image of the species holotype (ANSP 426031). Gene order and sizes are shown relative to one another, not including non-coding regions. Genes are color coded by H (black) or L (red) strand. IUPAC single letter codes are used to identify tRNA genes.
Taxonomic list of mitochondrial genomes used in the study.
| Taxonomy | GenBank | Reference |
|---|---|---|
| Clade | ||
| Superfamily | ||
| Family | ||
|
|
|
|
| Clade | ||
| Superfamily | ||
| Family | ||
|
|
|
|
| Superfamily | ||
| Family | ||
|
|
|
|
| Superfamily | ||
| Family | ||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Family | ||
|
|
|
|
| Family | ||
|
|
|
|
|
|
|
|
|
|
|
|
| Family | ||
|
|
| this study |
|
|
| this study |
| Superfamily | ||
| Family | ||
|
|
| unpublished |
| Family | ||
|
|
|
|
| Superfamily | ||
| Family | ||
|
|
|
|
|
|
|
|
| Family | ||
|
|
|
|
| Superfamily | ||
| Family | ||
|
|
|
|
Mitochondrial genome annotation for UTRGV.
| Gene | Start | Stop | Length | Strand | Start codon | Stop codon | Anticodon |
|---|---|---|---|---|---|---|---|
|
| 1 | 1525 | 1525 | H | TTG | T | |
|
| 1526 | 1587 | 62 | H | TAC | ||
|
| 1589 | 2579 | 991 | H | |||
|
| 2580 | 2642 | 63 | H | TAG | ||
|
| 2640 | 2706 | 67 | H | TGG | ||
|
| 2706 | 2768 | 63 | H | TGC | ||
|
| 2769 | 3239 | 471 | H | GTG | TAA | |
|
| 3223 | 4893 | 1671 | H | TTG | TAG | |
|
| 4887 | 5768 | 882 | H | ATC | TAG | |
|
| 5768 | 6052 | 285 | H | GTG | TAA | |
|
| 6054 | 7148 | 1095 | H | ATT | TAG | |
|
| 7149 | 7212 | 64 | H | GTC | ||
|
| 7209 | 7265 | 57 | H | GCA | ||
|
| 7270 | 7331 | 62 | H | GAA | ||
|
| 7332 | 8003 | 672 | H | ATG | TAG | |
|
| 8007 | 8068 | 61 | H | TCC | ||
|
| 8063 | 8123 | 61 | H | GTG | ||
|
| 8131 | 8192 | 62 | H | GTA | ||
|
| 8186 | 8248 | 63 | H | TCA | ||
|
| 8338 | 8396 | 59 | L | TTG | ||
|
| 8397 | 8454 | 58 | L | TAA | ||
|
| 8458 | 8608 | 151 | L | ATG | T | |
|
| 8612 | 8670 | 59 | L | GTT | ||
|
| 8671 | 9322 | 652 | L | ATG | T | |
|
| 9323 | 9382 | 60 | L | TCG | ||
|
| 9383 | 9442 | 60 | L | TTC | ||
|
| 9443 | 10186 | 744 | L | |||
|
| 10187 | 10248 | 62 | L | CAT | ||
|
| 10250 | 10597 | 348 | L | TTG | TAA | |
|
| 10598 | 10665 | 68 | L | TGA | ||
|
| 10723 | 10776 | 54 | H | GCT | ||
|
| 10777 | 12100 | 1324 | H | ATG | T | |
|
| 12010 | 12162 | 62 | L | TGT | ||
|
| 12163 | 12958 | 796 | L | ATT | T | |
|
| 12985 | 13044 | 60 | H | GAT | ||
|
| 13048 | 13954 | 907 | H | ATG | T | |
|
| 13955 | 14008 | 61 | H | TTT |
Nucleotide and skew statistics for the mitochondrial genomes used.
| Whole genome composition | ||||||||
|---|---|---|---|---|---|---|---|---|
| Species | Size (bp) | A% | C% | G% | T% | A+T% | AT skew | GC skew |
|
| 15057 | 0,280 | 0,171 | 0,195 | 0,355 | 0,634 | -0,118 | 0,064 |
|
| 14238 | 0,313 | 0,145 | 0,164 | 0,379 | 0,692 | -0,095 | 0,062 |
|
| 14039 | 0,325 | 0,133 | 0,157 | 0,386 | 0,711 | -0,086 | 0,083 |
|
| 14130 | 0,328 | 0,138 | 0,155 | 0,379 | 0,707 | -0,073 | 0,059 |
|
| 13843 | 0,319 | 0,135 | 0,167 | 0,379 | 0,698 | -0,086 | 0,108 |
|
| 14100 | 0,262 | 0,189 | 0,213 | 0,336 | 0,598 | -0,125 | 0,058 |
|
| 15177 | 0,298 | 0,158 | 0,185 | 0,360 | 0,657 | -0,095 | 0,077 |
|
| 14610 | 0,258 | 0,166 | 0,219 | 0,358 | 0,615 | -0,162 | 0,137 |
|
| 14237 | 0,284 | 0,131 | 0,167 | 0,418 | 0,702 | -0,191 | 0,120 |
|
| 14060 | 0,308 | 0,136 | 0,172 | 0,384 | 0,692 | -0,110 | 0,116 |
|
| 14050 | 0,307 | 0,136 | 0,165 | 0,392 | 0,699 | -0,121 | 0,097 |
|
| 14029 | 0,295 | 0,144 | 0,182 | 0,379 | 0,674 | -0,125 | 0,118 |
|
| 15197 | 0,336 | 0,120 | 0,147 | 0,397 | 0,733 | -0,083 | 0,100 |
|
| 14008 | 0,289 | 0,126 | 0,188 | 0,398 | 0,686 | -0,159 | 0,198 |
|
| 14008 | 0,288 | 0,126 | 0,188 | 0,398 | 0,686 | -0,160 | 0,198 |
|
| 14149 | 0,325 | 0,129 | 0,153 | 0,393 | 0,718 | -0,094 | 0,083 |
|
| 14092 | 0,339 | 0,109 | 0,122 | 0,430 | 0,769 | -0,113 | 0,055 |
|
| 14078 | 0,326 | 0,123 | 0,155 | 0,397 | 0,722 | -0,098 | 0,116 |
Figure 2.Maximum likelihood phylogeny of protein coding genes. Analysis in IQTREE yielded a single tree (log likelihood = -89104.188) under the mtZOA+F+I+G4 model. Branch support >50% is shown based on 10,000 ultra-fast bootstrap replicates. , , and were recovered as monophyletic. Nodes A-E refer to rearrangements shown in Figure 4.
Figure 3.Maximum likelihood phylogeny of gene order. Analysis in MLGO yielded a single tree. Branch support >50% is shown based on 100 bootstrap replicates. and were recovered as monophyletic, but was not.
Figure 4.Ancestral gene order reconstructions for . Columns (A–E) correspond to labeled nodes in Figure 2. IUPAC single letter codes are used to identify tRNA genes. Rearrangements in red and blue are unique to . The convergent rearrangement seen in , , and is shown in yellow. The green rearrangement is unique to .