| Literature DB >> 26933490 |
Tara Grosemans1, Krystalynne Morris2, William Kelley Thomas2, Annelien Rigaux3, Tom Moens1, Sofie Derycke4.
Abstract
Species with seemingly identical morphology but with distinct genetic differences are abundant in the marine environment and frequently co-occur in the same habitat. Such cryptic species are typically delineated using a limited number of mitochondrial and/or nuclear marker genes, which do not yield information on gene order and gene content of the genomes under consideration. We used next-generation sequencing to study the composition of the mitochondrial genomes of four sympatrically distributed cryptic species of the Litoditis marina species complex (PmI, PmII, PmIII, and PmIV). The ecology, biology, and natural occurrence of these four species are well known, but the evolutionary processes behind this cryptic speciation remain largely unknown. The gene order of the mitochondrial genomes of the four species was conserved, but differences in genome length, gene length, and codon usage were observed. The atp8 gene was lacking in all four species. Phylogenetic analyses confirm that PmI and PmIV are sister species and that PmIII diverged earliest. The most recent common ancestor of the four cryptic species was estimated to have diverged 16 MYA. Synonymous mutations outnumbered nonsynonymous changes in all protein-encoding genes, with the Complex IV genes (coxI-III) experiencing the strongest purifying selection. Our mitogenomic results show that morphologically similar species can have long evolutionary histories and that PmIII has several differences in genetic makeup compared to the three other species, which may explain why it is better adapted to higher temperatures than the other species.Entities:
Keywords: Adaptation; Litoditis marina; Miocene; Wolbachia; cryptic speciation
Year: 2016 PMID: 26933490 PMCID: PMC4760989 DOI: 10.1002/ece3.1975
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Summary of the Illumina HiSeq data. The four cryptic species of the Litoditis marina complex are indicated as PmI, PmII, PmIII, and PmIV
| PmI | PmII | PmIII | PmIV | |
|---|---|---|---|---|
| N50 | 3117 | 1034 | 1890 | 1661 |
| Total reads | 52,067,680 | 41,399,444 | 52,823,308 | 42,346,932 |
| Matched Reads | 49,663,612 | 37,036,661 | 50,569,428 | 40,103,528 |
| Number of Contigs | 278,752 | 381,176 | 292,112 | 263,052 |
| Maximum Contig Length | 13,33,912 | 1,570,033 | 18,38,899 | 1,001,282 |
| Average length contigs | 1136 | 768 | 961 | 987 |
| GC content (in %) | 47.6 | 44.6 | 42.4 | 42.7 |
mtDNA characteristics of the four cryptic species (PmI, PmII, PmIII, and PmIV). Gene density was calculated by dividing the number of genes by the genome size without the AT‐rich region
| PmI | PmII | PmIII | PmIV | |
|---|---|---|---|---|
| Genome size (bp) | 13,766 | 13,855 | 14,481 | 13,909 |
| AT‐rich region (bp) | 378 | 479 | 953 | 543 |
| Genome size (bp) without AT‐rich region | 13,388 | 13,376 | 13,528 | 13,366 |
| GC content (%) | 21.2 | 20.4 | 20.6 | 21.3 |
| Coding regions | ||||
| Proportion of the genome (%) | 98.5 | 98.2 | 97.5 | 98.7 |
| No. genes | 36 | 36 | 36 | 36 |
| Gene density (gene/kb) | 2.7 | 2.7 | 2.7 | 2.7 |
| Noncoding regions | ||||
| Proportion of the genome (%) | 0.4 | 0.7 | 0.3 | 0.2 |
| Intergenic regions | ||||
| Proportion of the genome (%) | 1.1 | 1.1 | 2.2 | 1.1 |
Figure 1Mitochondrial genome of the four cryptic Litoditis species. All 12 protein‐coding genes are shown in yellow arrows, 2 ribosomal RNAs in red arrows, and the 22 transfer RNAs as red blocks. tRNAs are indicated by their single letter amino acid code. The 2 noncoding regions (AT‐rich region and the intergenic region) are shown in pink.
Length and position of the 12 PCGs, 2 rRNAs, and 22 tRNAs in the mitochondrial genomes of the four cryptic Litoditis marina species
| PmI | PmII | PmIII | PmIV | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Position (nt) | Length (nt) | Start/Stop codon or anticodon | Position (nt) | Length (nt) | Start/Stop codon or anticodon | Position (nt) | Length (nt) | Start/Stop codon or anticodon | Position (nt) | Length (nt) | Start/Stop codon or anticodon | ||||||
| tRNA | Ala | 22 | 77 | 56 | TGC | 1421 | 1476 | 56 | TGC | 834 | 889 | 56 | TGC | 228 | 283 | 56 | TGC |
| Gene | nad5 | 79 | 1662 | 1584 | ATT/TAA | 1477 | 3060 | 1584 | ATT/TAA | 892 | 2478 | 1587 | ATG/TAA | 286 | 1869 | 1584 | ATT/TAA |
| Gene | nad3 | 1663 | 1998 | 336 | ATT/TAA | 3063 | 3398 | 336 | ATT/TAA | 2481 | 2855 | 375 | ATT/TAA | 1870 | 2205 | 336 | ATT/TAA |
| rRNA | rrnL | 2000 | 2958 | 959 | 3400 | 4359 | 960 | 2818 | 3773 | 956 | 2207 | 3165 | 959 | ||||
| tRNA | His | 2959 | 3013 | 55 | GTG | 4360 | 4415 | 56 | GTG | 3774 | 3830 | 57 | GTG | 3166 | 3220 | 55 | GTG |
| Gene | coxII | 3017 | 3712 | 696 | ATT/TAA | 4419 | 5114 | 696 | ATT/TAA | 3834 | 4529 | 696 | ATA/TAA | 3224 | 3919 | 696 | ATA/TAA |
| tRNA | Gly | 3713 | 3767 | 55 | TCC | 5115 | 5170 | 56 | TCC | 4530 | 4584 | 55 | TCC | 3920 | 3974 | 55 | TCC |
| tRNA | Asp | 3768 | 3822 | 55 | GTC | 5171 | 5226 | 56 | GTC | 4585 | 4639 | 55 | GTC | 3975 | 4029 | 55 | GTC |
| Intergenic region | 4640 | 4845 | 206 | ||||||||||||||
| tRNA | Met | 3823 | 3882 | 60 | CAT | 5227 | 5286 | 60 | CAT | 4846 | 4905 | 60 | CAT | 4030 | 4089 | 60 | CAT |
| tRNA | Cys | 3884 | 3941 | 58 | GCA | 5289 | 5345 | 57 | GCA | 4906 | 4962 | 57 | GCA | 4090 | 4147 | 58 | GCA |
| Gene | coxI | 3945 | 5522 | 1578 | ATT/TAA | 5349 | 6926 | 1578 | ATT/TAA | 4969 | 6546 | 1578 | ATT/TAA | 4150 | 5727 | 1578 | ATT/TAA |
| Intergenic region | 5523 | 5669 | 147 | 6927 | 7075 | 149 | 6547 | 6631 | 85 | 5728 | 5873 | 146 | |||||
| Gene | nad4 | 5670 | 6929 | 1260 | ATA/TAA | 7076 | 8335 | 1260 | ATA/TAA | 6632 | 7891 | 1260 | ATA/TAA | 5874 | 7133 | 1260 | ATA/TAA |
| tRNA | Thr | 6900 | 6954 | 55 | TGT | 8306 | 8360 | 55 | TGT | 7862 | 7916 | 55 | TGT | 7104 | 7158 | 55 | TGT |
| Gene | coxIII | 6954 | 7721 | 768 | ATT/TAG | 8361 | 9128 | 768 | ATT/TAA | 7925 | 8695 | 771 | TTG/TAA | 7158 | 7925 | 768 | ATT/TAG |
| tRNA | Leu | 7722 | 7778 | 57 | TAG | 9129 | 9185 | 57 | TAG | 8693 | 8750 | 58 | TAG | 7926 | 7982 | 57 | TAG |
| Gene | cytb | 7785 | 8897 | 1113 | ATT/TAA | 9199 | 10,311 | 1113 | ATT/TAA | 8755 | 9867 | 1113 | ATT/TAA | 7989 | 9101 | 1113 | ATT/TAA |
| tRNA | Phe | 8898 | 8954 | 57 | GAA | 10,312 | 10,368 | 57 | GAA | 9868 | 9924 | 57 | GAA | 9102 | 9158 | 57 | GAA |
| tRNA | Gln | 8961 | 9015 | 55 | TTG | 10,375 | 10,430 | 56 | TTG | 9929 | 9983 | 55 | TTG | 9165 | 9219 | 55 | TTG |
| tRNA | Arg | 9017 | 9071 | 55 | ACG | 10,431 | 10,485 | 55 | ACG | 9984 | 10,038 | 55 | ACG | 9221 | 9275 | 55 | ACG |
| tRNA | Ile | 9072 | 9133 | 62 | GAT | 10,486 | 10,548 | 63 | GAT | 10,039 | 10,102 | 64 | GAT | 9276 | 9337 | 62 | GAT |
| Gene | nad2 | 9133 | 9978 | 846 | TTG/TAA | 10,548 | 11,393 | 846 | TTG/TAA | 10,103 | 10,948 | 846 | TTG/TAA | 9337 | 10,182 | 846 | TTG/TAA |
| tRNA | Ser | 9979 | 10,033 | 55 | TCT | 11,394 | 11,448 | 55 | TCT | 10,949 | 11,003 | 55 | TCT | 10,183 | 10,237 | 55 | TCT |
| tRNA | Leu | 10,034 | 10,088 | 55 | TAA | 11,449 | 11,503 | 55 | TAA | 11,004 | 11,058 | 55 | TAA | 10,238 | 10,292 | 55 | TAA |
| tRNA | Lys | 10,089 | 10,150 | 62 | TTT | 11,504 | 11,565 | 62 | TTT | 11,059 | 11,120 | 62 | TTT | 10,293 | 10,354 | 62 | TTT |
| Gene | atp6 | 10,153 | 10,752 | 600 | ATT/TAA | 11,568 | 12,170 | 603 | ATT/TAA | 11,124 | 11,723 | 600 | ATT/TAA | 10,357 | 10,956 | 600 | ATT/TAA |
| Gene | nad1 | 10,755 | 11,630 | 876 | TTG/TAA | 12,172 | 13,047 | 876 | TTG/TAA | 11,725 | 12,600 | 876 | TTG/TAA | 10,959 | 11,834 | 876 | TTG/TAA |
| tRNA | Tyr | 11,628 | 11,684 | 57 | GTA | 13,045 | 13,101 | 57 | GTA | 12,598 | 12,653 | 56 | GTA | 11,832 | 11,888 | 57 | GTA |
| tRNA | Asn | 11,685 | 11,741 | 57 | GTT | 13,102 | 13,158 | 57 | GTT | 12,654 | 12,710 | 57 | GTT | 11,889 | 11,945 | 57 | GTT |
| tRNA | Ser | 11,742 | 11,795 | 54 | Undef. | 13,161 | 13,216 | 56 | Undef. | 12,711 | 12,778 | 68 | TGA | 11,946 | 11,999 | 54 | Undef. |
| rRNA | rrnS | 11,795 | 12,490 | 696 | 13,217 | 13,855 | 639 | 12,769 | 13,461 | 693 | 11,999 | 12,695 | 697 | ||||
| tRNA | Glu | 12,495 | 12,551 | 57 | TTC | 49 | 105 | 57 | TTC | 13,466 | 13,522 | 57 | TTC | 12,700 | 12,756 | 57 | TTC |
| tRNA | Trp | 12,554 | 12,610 | 57 | TCA | 107 | 163 | 57 | TCA | 13,525 | 13,581 | 57 | TCA | 12,759 | 12,815 | 57 | TCA |
| Gene | nd4L | 12,610 | 12,843 | 234 | ATT/TAA | 163 | 393 | 231 | ATT/TAA | 13,582 | 13,815 | 234 | ATT/TAA | 12,815 | 13,048 | 234 | ATT/TAA |
| Gene | nad6 | 12,845 | 13,279 | 435 | ATA/TAA | 398 | 832 | 435 | ATA/TAA | 13,817 | 14,251 | 435 | ATA/TAA | 13,050 | 13,484 | 435 | ATA/TAA |
| tRNA | Val | 13,280 | 13,334 | 55 | TAC | 833 | 886 | 54 | TAC | 14,242 | 14,305 | 64 | TAC | 13,485 | 13,539 | 55 | TAC |
| tRNA | Pro | 13,335 | 13,388 | 54 | TGG | 887 | 940 | 54 | TGG | 14,306 | 14,360 | 55 | TGG | 13,540 | 13,593 | 54 | TGG |
| AT‐rich region | 13,389 | 13,766 | 378 | 942 | 1420 | 479 | 14,360 | 14,481 | 122 | 13,594 | 13,909 | 316 | |||||
Comparison of codon usage between the four cryptic species. Calculations are based on the 12 protein‐coding genes
| Codon | PM1 | PM2 | PM3 | PM4 | ||||
|---|---|---|---|---|---|---|---|---|
| Count | RSCU | Count | RSCU | Count | RSCU | Count | RSCU | |
| UUU(F) | 433 | 1.92 | 429 | 1.89 | 431 | 1.91 | 411 | 1.82 |
| UUC(F) | 19 | 0.08 | 25 | 0.11 | 20 | 0.09 | 40 | 0.18 |
| UUA(L) | 447 | 5.16 | 474 | 5.46 | 469 | 5.33 | 475 | 5.45 |
| UUG(L) | 40 | 0.46 | 28 | 0.32 | 22 | 0.25 | 15 | 0.17 |
| CUU(L) | 21 | 0.24 | 10 | 0.12 | 20 | 0.23 | 18 | 0.21 |
| CUC(L) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| CUA(L) | 10 | 0.12 | 8 | 0.09 | 17 | 0.19 | 15 | 0.17 |
| CUG(L) | 2 | 0.02 | 1 | 0.01 | 0 | 0 | 0 | 0 |
| AUU(I) | 284 | 1.91 | 281 | 1.92 | 287 | 1.91 | 280 | 1.93 |
| AUC(I) | 13 | 0.09 | 11 | 0.08 | 14 | 0.09 | 10 | 0.07 |
| AUA(M) | 193 | 1.85 | 190 | 1.84 | 204 | 1.83 | 189 | 1.81 |
| AUG(M) | 16 | 0.15 | 16 | 0.16 | 19 | 0.17 | 20 | 0.19 |
| GUU(V) | 121 | 2.09 | 123 | 2.09 | 110 | 1.91 | 136 | 2.31 |
| GUC(V) | 6 | 0.1 | 1 | 0.02 | 3 | 0.05 | 5 | 0.08 |
| GUA(V) | 99 | 1.71 | 105 | 1.79 | 115 | 2 | 89 | 1.51 |
| GUG(V) | 6 | 0.1 | 6 | 0.1 | 2 | 0.03 | 6 | 0.1 |
| UCU(S) | 110 | 2.3 | 90 | 1.9 | 87 | 1.82 | 114 | 2.38 |
| UCC(S) | 1 | 0.02 | 3 | 0.06 | 0 | 0 | 2 | 0.04 |
| UCA(S) | 46 | 0.96 | 57 | 1.2 | 68 | 1.42 | 38 | 0.79 |
| UCG(S) | 0 | 0 | 3 | 0.06 | 2 | 0.04 | 1 | 0.02 |
| CCU(P) | 57 | 2.81 | 63 | 3.11 | 65 | 3.21 | 57 | 2.81 |
| CCC(P) | 10 | 0.49 | 6 | 0.3 | 0 | 0 | 6 | 0.3 |
| CCA(P) | 12 | 0.59 | 12 | 0.59 | 16 | 0.79 | 18 | 0.89 |
| CCG(P) | 2 | 0.1 | 0 | 0 | 0 | 0 | 0 | 0 |
| ACU(T) | 93 | 2.74 | 97 | 2.77 | 72 | 2.15 | 94 | 2.76 |
| ACC(T) | 4 | 0.12 | 0 | 0 | 3 | 0.09 | 3 | 0.09 |
| ACA(T) | 36 | 1.06 | 41 | 1.17 | 56 | 1.67 | 36 | 1.06 |
| ACG(T) | 3 | 0.09 | 2 | 0.06 | 3 | 0.09 | 3 | 0.09 |
| GCU(A) | 82 | 3.12 | 89 | 3.24 | 68 | 2.64 | 84 | 3.11 |
| GCC(A) | 3 | 0.11 | 2 | 0.07 | 7 | 0.27 | 3 | 0.11 |
| GCA(A) | 18 | 0.69 | 18 | 0.65 | 27 | 1.05 | 18 | 0.67 |
| GCG(A) | 2 | 0.08 | 1 | 0.04 | 1 | 0.04 | 3 | 0.11 |
| UAU(Y) | 156 | 1.91 | 154 | 1.93 | 159 | 1.94 | 153 | 1.88 |
| UAC(Y) | 7 | 0.09 | 6 | 0.07 | 5 | 0.06 | 10 | 0.12 |
| UAA(*) | 10 | 1.82 | 11 | 2 | 11 | 2 | 10 | 1.82 |
| UAG(*) | 1 | 0.18 | 0 | 0 | 0 | 0 | 1 | 0.18 |
| CAU(H) | 51 | 1.79 | 55 | 1.86 | 54 | 1.89 | 52 | 1.82 |
| CAC(H) | 6 | 0.21 | 4 | 0.14 | 3 | 0.11 | 5 | 0.18 |
| CAA(Q) | 41 | 1.78 | 35 | 1.52 | 39 | 1.73 | 34 | 1.48 |
| CAG(Q) | 5 | 0.22 | 11 | 0.48 | 6 | 0.27 | 12 | 0.52 |
| AAU(N) | 153 | 1.9 | 157 | 1.94 | 160 | 1.92 | 151 | 1.88 |
| AAC(N) | 8 | 0.1 | 5 | 0.06 | 7 | 0.08 | 10 | 0.12 |
| AAA(K) | 98 | 1.78 | 98 | 1.77 | 94 | 1.68 | 96 | 1.75 |
| AAG(K) | 12 | 0.22 | 13 | 0.23 | 18 | 0.32 | 14 | 0.25 |
| GAU(D) | 64 | 1.97 | 63 | 1.97 | 60 | 1.9 | 63 | 2 |
| GAC(D) | 1 | 0.03 | 1 | 0.03 | 3 | 0.1 | 0 | 0 |
| GAA(E) | 73 | 1.82 | 69 | 1.75 | 67 | 1.72 | 69 | 1.7 |
| GAG(E) | 7 | 0.17 | 10 | 0.25 | 11 | 0.28 | 12 | 0.3 |
| UGU(C) | 43 | 2 | 43 | 2 | 43 | 1.95 | 42 | 1.95 |
| UGC(C) | 0 | 0 | 0 | 0 | 1 | 0.05 | 1 | 0.05 |
| UGA(W) | 73 | 1.97 | 72 | 1.95 | 73 | 1.97 | 72 | 1.95 |
| UGG(W) | 1 | 0.03 | 2 | 0.05 | 1 | 0.03 | 2 | 0.05 |
| CGU(R) | 30 | 3.87 | 31 | 4 | 29 | 3.87 | 30 | 3.87 |
| CGC(R) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| CGA(R) | 1 | 0.13 | 0 | 0 | 1 | 0.13 | 1 | 0.13 |
| CGG(R) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| AGU(S) | 140 | 2.92 | 144 | 3.04 | 107 | 2.23 | 136 | 2.83 |
| AGC(S) | 8 | 0.17 | 3 | 0.06 | 4 | 0.08 | 7 | 0.15 |
| AGA(S) | 72 | 1.5 | 72 | 1.52 | 102 | 2.13 | 76 | 1.58 |
| AGG(S) | 6 | 0.13 | 7 | 0.15 | 13 | 0.27 | 10 | 0.21 |
| GGU(G) | 147 | 3.25 | 154 | 3.42 | 139 | 3.18 | 159 | 3.55 |
| GGC(G) | 1 | 0.02 | 4 | 0.09 | 2 | 0.05 | 1 | 0.02 |
| GGA(G) | 24 | 0.53 | 15 | 0.33 | 28 | 0.64 | 15 | 0.34 |
| GGG(G) | 9 | 0.2 | 7 | 0.16 | 6 | 0.14 | 4 | 0.09 |
RSCU: Relative Synonymous Codon Usage. In brackets, the coding amino acid is given.
P‐distance values. Pairwise distance values were calculated for all protein‐coding genes, and as comparison, values between Caenorhabditis elegans and C. briggsae were calculated as well. Minimum and maximum distance percentage are shown. All ORFs is the value obtained from comparing all the 12 protein‐coding genes in one sequence each species
| Within | Within | Between | |||
|---|---|---|---|---|---|
| Min % | Max % | % | Min % | Max % | |
| atp6 | 0.03 | 0.07 | 0.13 | 0.12 | 0.15 |
| cytb | 0.05 | 0.08 | 0.13 | 0.16 | 0.17 |
| coxI | 0.06 | 0.08 | 0.13 | 0.14 | 0.16 |
| coxII | 0.07 | 0.09 | 0.11 | 0.15 | 0.16 |
| coxIII | 0.07 | 0.10 | 0.13 | 0.15 | 0.19 |
| nad1 | 0.07 | 0.12 | 0.16 | 0.16 | 0.19 |
| nad2 | 0.06 | 0.13 | 0.15 | 0.22 | 0.25 |
| nad3 | 0.06 | 0.14 | 0.19 | 0.16 | 0.22 |
| nad4 | 0.07 | 0.14 | 0.14 | 0.17 | 0.20 |
| nad4L | 0.02 | 0.10 | 0.10 | 0.14 | 0.17 |
| nad5 | 0.07 | 0.12 | 0.13 | 0.17 | 0.20 |
| nad6 | 0.06 | 0.16 | 0.18 | 0.20 | 0.25 |
| All ORFs | 0.06 | 0.11 | 0.14 | 0.17 | 0.18 |
Figure 2Phylogenetic relationship based on neighbor joining of the four cryptic Litoditis species. Nodes show bootstrap values of, from left to right, maximum likelihood, neighbor‐joining and maximum parsimony analysis. Caenorhabditis elegans and C. briggsae are used as outgroup taxa.
Figure 3Timing of the most recent common ancestor. The divergence time of C. elegans and C. briggsae 18 MYA was used as calibration point, based on mutation rates (Cutter 2001).
Figure 4Scatter plots of nonsynonymous (dN) to synonymous (dS) nucleotide substitutions in the protein‐coding genes between all pairwise comparisons of the Litoditis marina species. All PCGs were evolving under negative (purifying) selection (dN/dS < 1). Colors of symbols reflect degree of relatedness (white: recent sister species; gray: intermediate timing of relationship between species; black: oldest relationships between species).