| Literature DB >> 32282807 |
Xin Sun1,2, Daoyuan Yu3, Zhijing Xie2,4, Jie Dong5, Yinhuan Ding5, Haifeng Yao2,4, Penelope Greenslade6,7.
Abstract
Collembola are a basal group of Hexapoda renowned for both unique morphological characters and significant ecological roles. However, a robust and plausible phylogenetic relationship between its deeply divergent lineages has yet to be achieved. We carried out a mitophylogenomic study based on a so far the most comprehensive mitochondrial genome dataset. Our data matrix contained mitogenomes of 31 species from almost all major families of all four orders, with 16 mitogenomes newly sequenced and annotated. We compared the linear arrangements of genes along mitochondria across species. Then we conducted 13 analyses each under a different combination of character coding, partitioning scheme and heterotachy models, and assessed their performance in phylogenetic inference. Several hypothetical tree topologies were also tested. Mitogenomic structure comparison revealed that most species share the same gene order of putative ancestral pancrustacean pattern, while seven species from Onychiuridae, Poduridae and Symphypleona bear different levels of gene rearrangements, indicating phylogenetic signals. Tomoceroidea was robustly recovered for the first time in the presence of all its families and subfamilies. Monophyly of Onychiuroidea was supported using unpartitioned models alleviating LBA. Paronellidae was revealed polyphyletic with two subfamilies inserted independently into Entomobryidae. Although Entomobryomorpha has not been well supported, more than half of the analyses obtained convincing topologies by placing Tomoceroidea within or near remaining Entomobryomorpha. The relationship between elongate-shaped and spherical-shaped collembolans still remained ambiguous, but Neelipleona tend to occupy the basal position in most trees. This study showed that mitochondrial genomes could provide important information for reconstructing the relationships among Collembola when suitable analytical approaches are implemented. Of all the data refining and model selecting schemes used in this study, the combination of nucleotide sequences, partitioning model and exclusion of third codon positions performed better in generating more reliable tree topology and higher node supports than others.Entities:
Year: 2020 PMID: 32282807 PMCID: PMC7153868 DOI: 10.1371/journal.pone.0230827
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Information of collembolan species used in this study.
| GenBank nos. | Order | Family | Subfamily | Species | PCGs | Genes | Size (bp) |
|---|---|---|---|---|---|---|---|
| MK014212 | Entomobryomorpha | Entomobryidae | Entomobryinae | 13 | 37 | 14,840 | |
| MK431895 | Entomobryomorpha | Entomobryidae | Heteromurinae | 13 | 37 | 14,883 | |
| MK431900 | Entomobryomorpha | Entomobryidae | Lepidocyrtinae | 13 | 37 | 14,698 | |
| KT985987 | Entomobryomorpha | Entomobryidae | Orchesellinae | 13 | 37 | 15,728 | |
| EU016195 | Entomobryomorpha | Entomobryidae | Orchesellinae | 13 | 37 | 14,924 | |
| MK431896 | Entomobryomorpha | Paronellidae | Paronellinae | 13 | 37 | 14,836 | |
| MK409685 | Entomobryomorpha | Paronellidae | Salininae | 13 | 37 | 14,788 | |
| NC_010533 | Entomobryomorpha | Isotomidae | Anurophorinae | 13 | 37 | 15,297 | |
| KX863671 | Entomobryomorpha | Isotomidae | Anurophorinae | 13 | 37 | 15,352 | |
| KU198392 | Entomobryomorpha | Isotomidae | Anurophorinae | 13 | 37 | 15,147 | |
| NC_024155 | Entomobryomorpha | Isotomidae | Isotominae | 13 | 37 | 15,338 | |
| MK423967 | Entomobryomorpha | Isotomidae | Pachyotominae | 11 | 28 | 9,518 | |
| MK431894 | Entomobryomorpha | Oncopoduridae | 13 | 37 | 14,808 | ||
| MK431898 | Entomobryomorpha | Tomoceridae | Lepidophorellinae | 13 | 36 | 15,518 | |
| MK423966 | Entomobryomorpha | Tomoceridae | Tomocerinae | 13 | 37 | 15,045 | |
| MK409686 | Poduromorpha | Hypogastruridae | 13 | 37 | 15,331 | ||
| AY191995 | Poduromorpha | Hypogastruridae | 13 | 37 | 15,075 | ||
| EU084034 | Poduromorpha | Neanuridae | Neanurinae | 13 | 37 | 16,312 | |
| EU124719 | Poduromorpha | Neanuridae | Frieseinae | 13 | 37 | 15,442 | |
| NC_006074 | Poduromorpha | Onychiuridae | Onychiurinae | 13 | 34 | 12,984 | |
| MK423968 | Poduromorpha | Onychiuridae | Onychiurinae | 13 | 37 | 15,213 | |
| NC_002735 | Poduromorpha | Onychiuridae | Tetrodontophorinae | 13 | 37 | 15,455 | |
| MK431897 | Poduromorpha | Tullbergiidae | Mesaphorurinae | 13 | 37 | 14,833 | |
| NC_006075 | Poduromorpha | Poduridae | 13 | 34 | 13,809 | ||
| MK431893 | Neelipleona | Neelidae | 13 | 34 | 13,858 | ||
| MK423965 | Symphypleona | Dicyrtomidae | Ptenothricinae | 13 | 37 | 15,152 | |
| MK423969 | Symphypleona | Katiannidae | 7 | 20 | 5,459 | ||
| MK423964 | Symphypleona | Sminthurididae | 13 | 35 | 14,161 | ||
| KY618680 | Symphypleona | Bourletiellidae | 13 | 37 | 14,794 | ||
| NC_010536 | Symphypleona | Sminthuridae | Sminthurinae | 13 | 37 | 14,817 | |
| MK431899 | Symphypleona | Sminthuridae | Sphyrothecinae | 13 | 37 | 15,141 |
Fig 1Comparison of linear arrangements of genes on mitochondrial genomes of collembolan species in this study.
A. Pancrustacea (putative ancestral) model in most species. B. Podura aquatica. C. Onychiuridae. D. Lipothrix lubbocki. E. Sminthurus viridis. F. Sminthurinus signatus. The transcriptional direction of mitochondrial genes is from left to right, the asterisk (*) below the genes indicates the opposite direction, the question marks indicates the unknown genes, the rearrangements are marked in red, the inverted triangle indicates the deletion of gene.
Fig 2Maximum likelihood phylogenetic tree inferred from partitioned nucleotide sequences of 15 genes, with third codon position excluded (15fna_13PCGs (site1+2)_2rrn (2partition)).
Bootstrap support values are shown in the nodes.
Support values for the major clades recovered by different datasets and analytical approaches. “N” means this was not supported.
| 13fna | 15fna | 13faa | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GHOST _GTR | CODON5_GHOST_GY | 13partition | 13partition_CODON5 | 13partition_site1+2 | GHOST _GTR | 15partition | 13PCGs(site1+2)_2rrn (2partition) | GHOST_LG | GHOST _mtART | PMSF | PMSF_mtART | 13partition | |
| Poduromorpha | N | N | 83.0 | N | 95.0 | 99.9 | 95.0 | 98.0 | N | 99.2 | 98.9 | N | 51.0 |
| Symphypleona | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 97.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| Entomobryomorpha | N | 49.8 | N | 30.9 | 51.0 | N | N | 50.0 | N | N | N | N | 29.0 |
| Symphypleona | N | N | N | N | N | N | N | N | N | 73.9 | 76.1 | N | N |
| Neelipleona at base | 100.0 | 98.8 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | N | N | 24.0 | 100.0 |
| Tomoceroidea + Poduromorpha | N | N | 72.0 | N | N | N | 68.0 | N | 21.5 | 68.1 | 91.1 | 78.4 | N |
| Tullbergiidae + Onychiuridae | N | N | 82.0 | N | 96.0 | 98.0 | 93.0 | 98.0 | N | 96.9 | 31.4 | N | 52.0 |
| Entomobryoidea + Isotomidae | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 97.0 | 100.0 | 100.0 | 99.9 | 100.0 | 100.0 | 100.0 |
| Symphypleona + (Entomobryoidea+Isotomidae) | 33.1 | N | 84.0 | N | N | 34.5 | 79.0 | N | N | N | N | N | N |
| Poduridae + Neanuridae | 82.0 | 99.6 | 87.0 | 100.0 | 96.0 | 72.2 | 91.0 | 97.0 | 98.5 | 87.0 | 87.9 | 74.0 | N |
| Tomoceridae | N | N | N | 100.0 | 81.0 | N | N | 86.0 | 18.5 | 91.2 | 85.1 | 88.5 | 86.0 |
| Hypogastruridae | N | N | N | 100.0 | N | N | N | N | N | N | N | N | N |
| Paronellidae | N | N | N | N | N | N | N | N | N | N | N | N | N |
| Sminthuridae | 100.0 | 100.0 | 100.0 | N | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| Onychiurinae | N | N | 77.0 | N | N | N | N | N | N | N | N | N | N |
Fig 3Maximum likelihood phylogenetic tree inferred from partitioned nucleotide sequences of 13 PCGs under codon 5 model (13fna_13parition_CODON5).
Bootstrap support values are shown in the nodes. Each coloured line linked a same taxon in current taxonomic system but separated in different branches.
Fig 4Maximum likelihood phylogenetic tree inferred from amino acid sequences of 13 PCGs under GHOST_mtART model (13faa_GHOST_mtART).
Bootstrap support values are shown in the nodes. Each coloured line linked a same taxon in current taxonomic system separated in different branches.
Results of tree topology tests of the analyses 13fna_13parition_CODON5 and 13faa_GHOST_mtART.
Seven hypothetical topologies were (A) best tree without any constrains; (B) Tomoceridae + Oncopoduridae; (C) Neelipleona + Symphypleona s. s.; (D) Tomoceroidea + (Entomobryoidea + Isotomidae); (E) Tomoceroidea + Poduromorpha; (F) Tullbergiidae + Onychiuridae; (G) Symphypleona s. s. + Entomobryomorpha.
| Hypotheses | logL | deltaL | bp-RELL | p-KH | p-SH | p-WKH | p-WSH | c-ELW | p-AU |
|---|---|---|---|---|---|---|---|---|---|
| -226076,295 | 0,000 | 0.9684 + | 0.9710 + | 1.0000 + | 0.9710 + | 0.9997 + | 0.9684 + | 0.9818 + | |
| -226439,654 | 363,359 | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | |
| -226591,485 | 515,191 | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | |
| -226159,904 | 83,610 | 0.0291 - | 0.029 - | 0.2168 + | 0.0290 - | 0.1332 + | 0.0291 - | 0.0295 - | |
| -226325,936 | 249,641 | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | |
| -226685,661 | 609,366 | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | 0.0000 - | |
| -226236,015 | 159,721 | 0.0025 - | 0.0035 - | 0.0216 - | 0.0035 - | 0.0156 - | 0.0025 - | 0.0029 - | |
| -116116,632 | 0,047 | 0.0000 - | 0.4240 + | 0.8651 + | 0.4240 + | 0.9231 + | 0.0294 + | 0.5346 + | |
| -116116,586 | 0,001 | 0.0018 - | 0.4468 + | 0.9604 + | 0.4468 + | 0.9457 + | 0.0307 + | 0.7470 + | |
| -116116,639 | 0,054 | 0.0093 - | 0.4160 + | 0.8695 + | 0.3319 + | 0.9636 + | 0.0292 + | 0.7458 + | |
| -116116,585 | 0,000 | 0.0196 + | 0.5532 + | 1.0000 + | 0.5772 + | 0.9632 + | 0.0307 + | 0.7241 + | |
| -116116,587 | 0,002 | 0.0374 + | 0.3935 + | 0.9447 + | 0.3935 + | 0.9060 + | 0.0307 + | 0.7467 + | |
| -116116,632 | 0,047 | 0.0648 + | 0.4228 + | 0.8619 + | 0.4228 + | 0.9494 + | 0.0294 + | 0.6709 + | |
| -116139,142 | 22,557 | 0.0144 + | 0.1280 + | 0.1313 + | 0.1280 + | 0.3050 + | 0.0226 + | 0.1397 + | |
deltaL: logL difference from the maximal logl in the set. bp-RELL: bootstrap proportion using RELL method [54]. p-KH: p-value of one sided Kishino-Hasegawa test [57]. p-SH: p-value of Shimodaira-Hasegawa test [58]. p-WKH: p-value of weighted KH test. p-WSH: p-value of weighted SH test. c-ELW: Expected Likelihood Weight [56]. p-AU: p-value of approximately unbiased (AU) test [55]. Plus signs denote the 95% confidence sets. Minus signs denote significant exclusion. All tests performed 10000 resamplings using the RELL method.