| Literature DB >> 30221094 |
Artur Burzyński1, Marianna Soroka2.
Abstract
Freshwater bivalves from the family Unionidae usually have two very divergent mitogenomes, inherited according to the doubly uniparental model. The early divergence of these two mitogenomic lineages gives a unique opportunity to use two mitogenomic data sets in a single phylogenetic context. However, the number of complete sequences of the maternally inherited mitogenomes of these animals available in GenBank greatly exceeds that of the paternally inherited mitogenomes. This is a problem for phylogenetic reconstruction because it limits the use of both mitogenomic data sets. Moreover, since long branch attraction phenomenon can bias reconstructions if only a few but highly divergent taxa are considered, the shortage of the faster evolving paternally inherited mitogenome sequences is a real problem. Here we provide, for the first time, complete sequences of the M mitogenomes sampled from Polish populations of two species: native Unio pictorum and invasive Sinanodonta woodiana. It increases the available set of mitogenomic pairs to 18 species per family, and allows unambiguous reconstruction of phylogenetic relationships among them. The reconstructions based on M and F mitogenomes which were separated for many millions of years, and subject to differing evolutionary dynamics, are fully congruent.Entities:
Keywords: DUI; Mitogenomics; Molecular clock; Phylogeny; mtDNA
Year: 2018 PMID: 30221094 PMCID: PMC6138038 DOI: 10.7717/peerj.5573
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Latin names, GenBank accession numbers and literature references for all mitogenomes used in comparative analysis.
Most of the M mitogenomes were referenced in the same publication as the F mitogenome from the same species, as indicated by “F” in the “Reference M” column. Some relevant mitogenomes present in GenBank do not have a formal publication. The two sources of S. woodiana mitogenomes are indicated by country code appended to the latin name.
| Latin name | Accession F | Reference F | Accession M | Reference M |
|---|---|---|---|---|
|
|
| – | ||
|
|
| |||
|
|
| – | ||
|
| – |
| – | |
|
|
| F | ||
|
|
| F | ||
|
|
| F | ||
|
|
| F | ||
|
| – |
| – | |
|
|
| F | ||
|
|
| F | ||
|
|
| – | ||
|
|
| This study | ||
|
|
| – | ||
|
|
| F | ||
|
|
| F | ||
|
|
| F | ||
|
|
| This study | ||
|
|
| F | ||
|
|
| F | ||
|
|
| F |
Figure 1Genetic map of the sequenced M-type mitogenomes.
The mitogenome of U. pictorum (A) is structurally very similar to all M-type mitogenomes from this family published to date. The mitogenome of S. woodiana (B) is more exceptional because of the two additional features labeled in red. All protein coding genes are labeled by the names of the encoded proteins, two rRNA genes are labeled 16S and 12S for large and small subunit, respectively. The trn genes are labeled by the one letter code of the respective amino acid. Direction of transcription is indicated by the position and direction of the arrows, with clockwise transcribed genes on the outside and anticlockwise transcribed genes on the inside of the circle representing the genome. The two inner circles represent local compositional bias, calculated in a 300 bp long window, in 25 bp long steps. The light gray parts of the first circle represent above average AT skew [(A–T)/(A+T)] while the dark gray parts of this circle represent below average AT skews. The inner, black circle represents local GC content. Both indices are calculated with the relation to mitogenome-wide averages which are, for S. woodiana: AT-skew -0.3166, GC content 0.3284, for U. pictorum: AT-skew -0.2828, GC-content 0.3396.
Figure 2Genetic distance between Chinese and Polish M-type mitogenomes of S. woodiana.
Uncorrected nucleotide p-distance calculated in a sliding window of 300 bp along the alignment of the two M-type mitogenomes of S. woodiana, one sampled in Poland, the other in China. Schematic annotation is showed above the plot, with the major genes labeled.
Figure 3Relationships between duplicated m-ORF sequences.
Phylogenetic reconstruction of the relationships between the m-ORFs of the seven closely related sequences derived from four species. The tree is based on Bayesian inference, the nucleotide alignment contained 429 bp. Posterior probability is shown at all nodes, as well as 95% confidence interval of node height (blue bars).
Figure 4Comparison of the new trn. genes.
Alignment of the three trn genes from the two M-type mitogenomes of S. woodiana. The top two lines represent the regular trnASP gene, the two bottom lines represent the typical trnMET gene. The two lines in the middle represent the novel trn gene. Sequences extracted from both M-type mitogenomes are shown, one from Poland (PL), one from China (CH). The position of anticodon sequence is boxed.
Figure 5Phylogenetic reconstruction of relationships within Unionidae.
Phylogenetic reconstruction of relationships within Unionidae based on the concatenated amino acid alignment of all 13 mitochondrially encoded proteins and nucleotide alignment of the two rRNA genes, from both M and F type mitogenomes (A). Six separate reconstructions were performed: three under BI and three under ML framework, separately for the F and M data sets as well as jointly for both data sets. The presented tree is a result of BI under joined data set but the topology of all three trees obtained under BI was the same. All nodes have posterior probability of 1.0. The 95% CI on node heights is shown as blue bars at nodes for the joined M+F analysis, as red bars above nodes for the analysis based on the M data set and as green bars below nodes for the analysis based on the F data set. The topology obtained under ML framework for M and for joined M+F data sets was the same, with 100% bootstrap support for most of the nodes. The nodes with less than 100% bootstrap support are indicated by the actual percentage numbers next to the CI bars, in the color corresponding to the used data set (red for M, green for F, and blue for M+F ). The two “zero” values in green indicate that the topology recovered by ML under the F data set was different, placing A. lanceolata outside the Unioninae clade with poor support, so that these nodes were not present. Relevant fragment of this alternative tree is also shown (B), again only nodes with less than 100% support are labeled. The yellow bar is the 95% CI on the MRCA of the apparent duplication of m-ORF within the S. woodiana lineage, transferred from Fig. 3.