| Literature DB >> 31192351 |
Dong Zhang1,2,3, Hong Zou1,2, Cong-Jie Hua1,2, Wen-Xiang Li1,2, Shahid Mahboob4,5, Khalid Abdullah Al-Ghanim4, Fahad Al-Misned4, Ivan Jakovlić6, Gui-Tang Wang1,2.
Abstract
The phylogeny of Isopoda, a speciose order of crustaceans, remains unresolved, with different data sets (morphological, nuclear, mitochondrial) often producing starkly incongruent phylogenetic hypotheses. We hypothesized that extreme diversity in their life histories might be causing compositional heterogeneity/heterotachy in their mitochondrial genomes, and compromising the phylogenetic reconstruction. We tested the effects of different data sets (mitochondrial, nuclear, nucleotides, amino acids, concatenated genes, individual genes, gene orders), phylogenetic algorithms (assuming data homogeneity, heterogeneity, and heterotachy), and partitioning; and found that almost all of them produced unique topologies. As we also found that mitogenomes of Asellota and two Cymothoida families (Cymothoidae and Corallanidae) possess inversed base (GC) skew patterns in comparison to other isopods, we concluded that inverted skews cause long-branch attraction phylogenetic artifacts between these taxa. These asymmetrical skews are most likely driven by multiple independent inversions of origin of replication (i.e., nonadaptive mutational pressures). Although the PhyloBayes CAT-GTR algorithm managed to attenuate some of these artifacts (and outperform partitioning), mitochondrial data have limited applicability for reconstructing the phylogeny of Isopoda. Regardless of this, our analyses allowed us to propose solutions to some unresolved phylogenetic debates, and support Asellota are the most likely candidate for the basal isopod branch. As our findings show that architectural rearrangements might produce major compositional biases even on relatively short evolutionary timescales, the implications are that proving the suitability of data via composition skew analyses should be a prerequisite for every study that aims to use mitochondrial data for phylogenetic reconstruction, even among closely related taxa.Entities:
Keywords: Cymothoida; GC skew; base composition skew; compositional heterogeneity; mitochondrial phylogenomics; replication origin inversion
Mesh:
Year: 2019 PMID: 31192351 PMCID: PMC6601869 DOI: 10.1093/gbe/evz121
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Taxonomy, Length (bp), Base Composition (%), and Skews of Mitogenomes Used in the Analysis
| Species | Suborder | Family | Acc. No. | Length | A + T | A | C | AT Skew | GC Skew |
|---|---|---|---|---|---|---|---|---|---|
| Isopoda | |||||||||
|
| Asellota | Asellidae | GU130252 | 13,639 | 61.9 | 31 | 21.3 | 0.002 | −0.122 |
|
| Cymothoida | Cymothoidae | MH396438 | 14,475 | 63.8 | 36 | 26.1 | 0.129 | −0.442 |
|
| Cymothoida | Cymothoidae | MF419233 | 15,440 | 72.7 | 37 | 18.7 | 0.026 | −0.375 |
|
| Cymothoida | Corallanidae | MF419232 | 14,616 | 72.8 | 38 | 18.5 | 0.055 | −0.354 |
|
| Cymothoida | Cirolanidae | GU130253 | 13,055 | 55.9 | 27 | 17.7 | −0.052 | 0.198 |
|
| Cymothoida | Cirolanidae | KU057374 | 14,965 | 58.7 | 27 | 17 | −0.093 | 0.175 |
|
| Cymothoida | Bopyridae | NC_037467 | 14,268 | 59.6 | 27 | 17.8 | −0.093 | 0.118 |
|
| Phreatoicidea | Amphisopidae | NC_013976 | 14,994 | 69.6 | 31 | 11.4 | −0.104 | 0.25 |
|
| Sphaeromatidea | Sphaeromatidae | GU130256 | 13,467 | 54.4 | 25 | 17.8 | −0.069 | 0.219 |
|
| Limnoriidea | Limnoriidae | NC_024054 | 16,515 | 66.3 | 30 | 13.6 | −0.104 | 0.188 |
|
| Valvifera | Idoteidae | DQ442915 | 14,247 | 61 | 28 | 16.3 | −0.076 | 0.163 |
|
| Valvifera | Chaetiliidae | GU130254 | 13,809 | 65.4 | 32 | 16.6 | −0.033 | 0.038 |
|
| Oniscidea | Trachelipodidae | MF187612 | 14,080 | 67.3 | 33 | 13.3 | −0.029 | 0.184 |
|
| Oniscidea | Porcellionidae | KX289582 | 14,103 | 65.6 | 31 | 13.4 | −0.067 | 0.224 |
|
| Oniscidea | Cylisticidae | KR013002 | 14,154 | 67.8 | 33 | 12.9 | −0.035 | 0.194 |
|
| Oniscidea | Porcellionidae | KX289584 | 14,078 | 60.5 | 28 | 14.9 | −0.09 | 0.248 |
|
| Oniscidea | Armadillidiidae | KX289585 | 13,812 | 69.7 | 33 | 12.5 | −0.045 | 0.172 |
|
| Oniscidea | Armadillidiidae | MF187611 | 13,943 | 68.1 | 33 | 13.4 | −0.043 | 0.16 |
|
| Oniscidea | Armadillidiidae | MF187614 | 13,932 | 71.5 | 34 | 11.8 | −0.043 | 0.174 |
|
| Oniscidea | Armadillidiidae | MF187613 | 13,955 | 71.3 | 34 | 11.8 | −0.039 | 0.179 |
|
| Oniscidea | Ligiidae | NC_008412 | 15,289 | 60.9 | 29 | 17 | −0.041 | 0.134 |
|
| |||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |||||||||
| Nonisopod Malacostraca and | |||||||||
|
| Decapoda | Pleocyemata | NC_037201 | 16,180 | 69.3 | 33 | 20.3 | −0.036 | −0.319 |
|
| Decapoda | Dendrobranchiata | NC_009626 | 15,990 | 67.7 | 33 | 19.2 | −0.026 | −0.192 |
|
| Decapoda | Pleocyemata | NC_036335 | 15,865 | 66.2 | 36 | 22.5 | 0.076 | −0.332 |
|
| Stomatopoda | Unipeltata | NC_006081 | 15,994 | 70.2 | 35 | 16.8 | −0.001 | −0.13 |
|
| Stomatopoda | Unipeltata | NC_007443 | 16,325 | 63.9 | 33 | 21.4 | 0.026 | −0.185 |
|
| Amphipoda | Senticaudata | NC_019653 | 14,355 | 76.9 | 38 | 11.7 | −0.025 | −0.014 |
|
| Mysida | NA | NC_027510 | 17,652 | 74.5 | 37 | 13.8 | −0.021 | −0.085 |
|
| Amphipoda | Senticaudata | NC_033360 | 14,370 | 67.6 | 33 | 20.3 | −0.019 | −0.251 |
|
| Euphausiacea | NA | NC_016184 | 16,898 | 72 | 36 | 16 | 0.004 | −0.145 |
|
| Xiphosurida | NA | NC_003057 | 14,985 | 67.6 | 38 | 22.7 | 0.111 | −0.399 |
. 1.—Mitochondrial phylogenomics of Isopoda (suborder information shown) reconstructed using partitioned nucleotide sequences of PCGs and rRNAs (NUC data set) and BI algorithm. A set of nine nonisopod Malacostraca species and Limulus polyphemus were used as outgroups (order information shown). The scale bar corresponds to the estimated number of substitutions per site. Bayesian posterior support values are shown next to corresponding nodes. Star sign indicates a putative origin of replication inversion scenario implied by the topology (see Discussion).
Summary of Some Key Features of Phylogenetic Analyses
| NUC | BI_P | BI_NP | ML_P | ML_NP | GHOST | Parsimony | PB_CAT |
|---|---|---|---|---|---|---|---|
| Basal | C+C | C+C | C+C | C+C | C+C | C+C | C+C |
| Cymothoida | Par | Par | Par | Par | Par | Par | Par |
| Sister-gr. | M+A | M+A | M+A | M+A | M+A | M+A | Malacostraca |
| Cym+Asel | + | + | + | + | + | + | + |
| Limnoriidea | G.o. | G.o. | G.o. | G.o. | G.o. | G.o. | * |
|
| |||||||
|
|
|
|
|
|
|
|
|
|
| |||||||
| Basal | Phreat. | C+C | Phreat. | Phreat. | Phreat. | C+C | Phreat. |
| Cymothoida | Par | Par | Par | Par | Par | Par | Par |
| Sister-gr. | M+A | M+A | Mysida | Mysida | Mysida | M+A | Mysida |
| Cym+Asel | +(s) | +(s) | +(s) | +(s) | +(s) | + | − |
| Limnoriidea | G.o. | G.o. | G.o. | G.o. | G.o. | G.o. | G.o.* |
|
| |||||||
|
|
|
|
|
|
| ||
|
| |||||||
| Basal | Asellota | Asellota | Asellota | Limnoriidea | Asellota | ||
| Cymothoida | Par | Par | Par | Mon | Par | ||
| Sister-gr. | Decapoda | Decapoda | Decapoda | Amphipoda | Amphipoda | ||
| Cym+Asel | − | − | − | − | − | ||
| Limnoriidea | Amphipoda | Amphipoda | Corallanidae | Basal | Cymothoida | ||
Note.—Basal, basal isopod clade; Cymothoida, monophyletic or paraphyletic; Sister-gr., sister-group to the Isopoda; Cym+Asel, does the topology exhibit the artifact of a close relationship of Cymothoidae+Corallanidae and Asellota (isopods with congruent skews), where +(s) indicates a sister-group relationship; Limnoriidea, sister-group to this taxon, where G.o. is Gyge ovalis and * indicates polyphyly; P, partitioned; NP, nonpartitioned data set; C + C, Cymothoidae+Corallanidae; M + A, sister-group Mysida+Amphipoda; Phreat., Phreatoicidea.
Amphipoda clustered within the Isopoda clade (paraphyletic isopods).
. 2.—A phylogram reconstructed using nonpartitioned NUC data set and an algorithm designed to address compositional heterogeneity: CAT-GTR (PB). Posterior Bayesian support values are shown. See figure 1 for other details.
. 3.—A phylogram reconstructed using amino acid data set (AAs; 13 PCGs) in combination with data partitioning strategy and BI algorithm. See figure 1 for other details.
. 4.—A phylogram reconstructed using AAs data set and CAT-GTR algorithm designed for heterogeneous data sets (PB). See figure 1 for other details.
. 5.—A phylogram inferred using the nuclear 18S gene and CAT-GTR algorithm (PB). See figure 1 for other details.
. 6.—Cumulative GC skews of the majority strands of a selected subset of mitogenomes used for phylogenetic analyses.