| Literature DB >> 29848319 |
Omri Bronstein1,2, Andreas Kroh3, Elisabeth Haring4,5.
Abstract
BACKGROUND: In Metazoa, mitochondrial markers are the most commonly used targets for inferring species-level molecular phylogenies due to their extremely low rate of recombination, maternal inheritance, ease of use and fast substitution rate in comparison to nuclear DNA. The mitochondrial control region (CR) is the main non-coding area of the mitochondrial genome and contains the mitochondrial origin of replication and transcription. While sequences of the cytochrome oxidase subunit 1 (COI) and 16S rRNA genes are the prime mitochondrial markers in phylogenetic studies, the highly variable CR is typically ignored and not targeted in such analyses. However, the higher substitution rate of the CR can be harnessed to infer the phylogeny of closely related species, and the use of a non-coding region alleviates biases resulting from both directional and purifying selection. Additionally, complete mitochondrial genome assemblies utilizing next generation sequencing (NGS) data often show exceptionally low coverage at specific regions, including the CR. This can only be resolved by targeted sequencing of this region.Entities:
Keywords: Control region; Echinoidea; Mitochondrial markers; Molecular phylogeny; NGS; Sea urchins
Mesh:
Substances:
Year: 2018 PMID: 29848319 PMCID: PMC5977486 DOI: 10.1186/s12862-018-1198-x
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Representation of echinoid complete mitochondrial genomes assembled from NGS data, showing gene annotation and coverage. The annotated genomes are represented by four echinoid species: Hemicentrotus pulcherrimus, Strongylocentrotus fragilis, Mesocentrotus franciscanus, and Strongylocentrotus intermedius, corresponding to GenBank accession numbers: KC898202, KC898198, KC898199, and KC898200, respectively. Annotations are given at the outer margin of the external circle. Concentric circles represent the corresponding coverage for each of the represented species mitogenomes. Data was obtained from Kober and Bernardi [86, 87]. Enlarged segment illustrates the position of the various primers used in the current study. Coverage was calculated in BRIG [88], after read mapping with Bowtie2 [89] (using the predefined alignment threshold “very-sensitive”). Annotations are based on those for H. pulcherrimus (GenBank accession no. NC_023771) and radial plots generated using BRIG
Detailed taxonomic placement and GenBank accession numbers of taxa included in the current study
| Order, Family | Genus | Species |
|
| CRA | Voucher No. |
|---|---|---|---|---|---|---|
| Cidaroida, Cidaridae |
| sp. | – | – | MG198151b,d | MNHN IE-2007-3745 |
| Cidaroida, Cidaridae |
|
| – | – | MG198152b,d | SMNH Ec 25,624 |
| Cidaroida, Cidaridae |
| sp. | – | – | MG198153b,d | MNHN IE-2007-3764 |
| Cidaroida, Cidaridae |
| sp. | – | – | MG198154b,d | MNHN IE-2013-8705 |
| Echinothurioida, Echinothuriidae |
|
| – | – | MG198163b,d | MNHN IE-2007-1143 |
| Echinothurioida, Echinothuriidae |
|
| – | – | MG198164b,d | SMNH Ec 25,628 |
| Diadematoida, Diadematidae |
|
| KX385835c,b | KX385835c,b | KX385835c,b | |
| Diadematoida, Diadematidae |
|
| – | – | MG198159b,d | SMNH Ec 25,437 |
| Diadematoida, Diadematidae |
|
| – | – | MG198160b,d | SMNH Ec 25,625 |
| Diadematoida, Diadematidae |
|
| – | – | MG198161b,d | SMNH Ec 25,626 |
| Diadematoida, Diadematidae |
|
| – | – | MG198162b,d | SMNH Ec 25,627 |
| Diadematoida, Diadematidae |
|
| KX385836c | KX385836c | KX385836c | |
| Micropygoida, Micropygidae |
|
| – | – | MG198165b,d | MNHN IE-2007-1152 |
| Salenioida, Saleniidae |
|
| – | – | MG198166b,d | MNHN IE-2007-3765 |
| Stomopneustoida, Glyptocidaridae |
|
| KX638403c,a | KX638403c,a | KX638403c,a | |
| Arbacioida, Arbaciidae |
|
| X80396c,b | X80396c,b | X80396c,b | |
| Camarodonta, Echinidae |
|
| KJ680295c | KJ680295c | KJ680295c | |
| Camarodonta, Echinidae |
|
| KF214257c,a | KF214257c,a | KF214257c,a | |
| Camarodonta, Echinometridae |
|
| KJ680291c | KJ680291c | KJ680291c | |
| Camarodonta, Echinometridae |
| sp. | – | – | MG198122b,d | SMNH OB ZNZ16 |
| Camarodonta, Echinometridae |
| sp. | – | – | MG198123b,d | SMNH OB ZNZ37 |
| Camarodonta, Echinometridae |
| sp. | – | – | MG198124b,d | SMNH OB ZNZ54 |
| Camarodonta, Echinometridae |
|
| KC479025c | KC479025c | KC479025c | |
| Camarodonta, Echinometridae |
|
| KJ680292c | KJ680292c | KJ680292c | |
| Camarodonta, Echinometridae |
|
| – | – | MG198150b,d | USNM E40502 |
| Camarodonta, Parechinidae |
|
| JX888466c,b | JX888466c,b | JX888466c,b | |
| Camarodonta, Parechinidae |
|
| KC490910c,b | KC490910c,b | KC490910c,b | |
| Camarodonta, Parechinidae |
|
| J04815c,b | J04815c,b | J04815c,b | |
| Camarodonta, Parechinidae |
|
| – | – | MG198127b,d | SMNH Ec 25,622 |
| Camarodonta, Strongylocentrotidae |
|
| KC898202c,a | KC898202c,a | KC898202c,a | |
| Camarodonta, Strongylocentrotidae |
|
| KC490911c,b | KC490911c,b | KC490911c,b | |
| Camarodonta, Strongylocentrotidae |
|
| KJ526170c,a | KJ526170c,a | KJ526170c,a | |
| Camarodonta, Strongylocentrotidae |
|
| JX263663c,b | JX263663c,b | JX263663c,b | |
| Camarodonta, Strongylocentrotidae |
|
| KC898201c,a | KC898201c,a | KC898201c,a | |
| Camarodonta, Strongylocentrotidae |
|
| KC490913c | KC490913c | KC490913c | |
| Camarodonta, Strongylocentrotidae |
|
| EU054306c,a | EU054306c,a | EU054306c,a | |
| Camarodonta, Strongylocentrotidae |
|
| AM900391c,b | AM900391c,b | AM900391c,b | |
| Camarodonta, Strongylocentrotidae |
|
| KC898198c,a | KC898198c,a | KC898198c,a | |
| Camarodonta, Strongylocentrotidae |
|
| KC490912c | KC490912c | KC490912c | |
| Camarodonta, Strongylocentrotidae |
|
| KC898200c,a | KC898200c,a | KC898200c,a | |
| Camarodonta, Strongylocentrotidae |
|
| AM900392c | AM900392c | AM900392c | |
| Camarodonta, Strongylocentrotidae |
|
| X12631c,b | X12631c,b | X12631c,b | |
| Camarodonta, Temnopleuridae |
|
| KJ680293c | KJ680293c | KJ680293c | |
| Camarodonta, Temnopleuridae |
|
| – | – | MG198126b,d | SMNH Ec 25,621 |
| Camarodonta, Temnopleuridae |
|
| – | – | MG198130b,d | SMNH Ec 25,623 |
| Camarodonta, Temnopleuridae |
|
| KU302104c | KU302104c | KU302104c | |
| Camarodonta, Temnopleuridae |
|
| KU302103c | KU302103c | KU302103c | |
| Camarodonta, Temnopleuridae |
|
| KP070768c,b | KP070768c,b | KP070768c,b | |
| Camarodonta, Temnopleuridae |
|
| KU302106c | KU302106c | KU302106c | |
| Camarodonta, Temnopleuridae |
|
| KU302105c | KU302105c | KU302105c | |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198125b,d | SMNH Ec 25,620 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198128b,d | AIM MA 121531.1 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198129b,d | AIM MA 121531.2 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198131b,d | NHMW-DNAtis_26362 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198132b,d | NHMW-DNAtis_26363 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198133b,d | NHMW-DNAtis_26364 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198134b,d | NHMW-DNAtis_26365 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198135b,d | NHMW-DNAtis_26366 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198136b,d | NHMW-DNAtis_26367 |
| Camarodonta, Toxopneustidae |
|
| KY268294c,a | KY268294c,a | KY268294c,a | |
| Camarodonta, Toxopneustidae |
|
| KJ680294c | KJ680294c | KJ680294c | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515261b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515262b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515263b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515264b | |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198149b,d | CAS 187197 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198137b,d | NHMW 2017/0125/0001 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198138b,d | NHMW 2017/0125/0003 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198139b,d | NHMW 2017/0125/0004 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198140b,d | NHMW Ev 20,497 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198141b,d | NHMW Ev 20,498 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198142b,d | NHMW Ev 20,499 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198143b,d | NHMW 2016/0329/0001 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198144b,d | NHMW 2016/0329/0002 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198145b,d | NHMW 2016/0329/0003 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198146b,d | NHMW 2016/0329/0004 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198147b,d | NHMW Ev 20,500 |
| Camarodonta, Toxopneustidae |
|
| – | – | MG198148b,d | NHMW Ev 20,501 |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515254b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515255b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515256b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515257b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515258b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515259b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515260b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515241b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515242b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515243b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515244b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515245b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515246b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515247b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515248b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515249b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515250b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515251b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515240b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515252b | |
| Camarodonta, Toxopneustidae |
|
| – | – | KY515253b | |
| Clypeasteroida, Clypeasteroidae |
| sp. | – | – | MG198155b,d | QM NO1_F4 |
| Clypeasteroida, Clypeasteroidae |
| sp. | – | – | MG198156b,d | QM NO1_F4B |
| Clypeasteroida, Clypeasteroidae |
|
| – | – | MG198157b,d | MNHN IE-2013-8702 |
| Clypeasteroida, Clypeasteroidae |
|
| – | – | MG198158b,d | MNHN IE-2013-8704 |
| Spatangoida, Loveniidae |
|
| FN562581c,b | FN562581c,b | FN562581c,b | |
| Spatangoida, Maretiidae |
|
| KC990834c,b | KC990834c,b | KC990834c,b | |
| Spatangoida, Pericosmidae |
|
| – | – | MG198167b,d | MNHN IE-2007-1138 |
| Spatangoida, Pericosmidae |
|
| – | – | MG198168b,d | MNHN IE-2013-8706 |
| Spatangoida, Pericosmidae |
|
| – | – | MG198169b,d | MNHN IE-2013-8707 |
| Spatangoida, Pericosmidae |
|
| – | – | MG198170b,d | MNHN IE-2013-8708 |
COI - cytochrome c oxidase subunit I, 16S rRNA – ribosomal RNA, CRA -control region area (including the control, adjacent tRNAs and a part of the 12S rRNA genes). Sequence type indicates whether the source sequence was generated by Sanger or next generation sequencing
asequence data generated by NGS
bsequence data generated by Sanger sequencing
cdata retrieved from complete mitochondrial genome sequence
dsequences generated in the current study
AIM Auckland War Memorial Museum, CAS California Academy of Sciences, MNHN Muséum national d'Histoire naturelle, NHMW Natural History Museum Vienna, QM Queensland Museum, SMNH Steinhardt Museum of Natural History, USNM Smithsonian Institution National Museum of Natural History
Sequences summary for the different datasets including best-fitting nucleotide substitution models
|
|
|
|
| |
|---|---|---|---|---|
| Number of unique sequences | 86 | 33 | 33 | 33 |
| MSA length (bp) | 405 | 562 | 558 | 405 |
| %G | 20.4 | 18.9 | 22.3 | 20.9 |
| %A | 30.9 | 26.0 | 31.0 | 30.7 |
| %C | 21.4 | 24.4 | 20.5 | 21.0 |
| %T | 27.3 | 30.8 | 26.2 | 27.5 |
| Pinv | 0.09384 | 0.34741 (codon position 1&2) | 0.36998 | 0.17559 |
| Overall mean K2P/p-distance | 0.16/0.14 | 0.21/0.18 | 0.16/0.143 | 0.16/0.14 |
| Best-fit model – (ML) | GTR + G | GTR + I + G | GTR + G | GTR + G |
| Best-fit model – (BI) | HKY + G | SYM + I + G | GTR + G | HKY + G |
ML Maximum Likelihood, BI Bayesian Inference, CR Control region, COI cytochrome c oxidase subunit 1, 16S 16S ribosomal RNA, MSA multiple sequence alignment, P proportion of invariant sites
Fig. 2Pairwise tree comparisons for phylogenetic trees based on commonly used mitochondrial markers. Trees include the two most commonly used phylogenetic mitochondrial markers: a fragment of the cytochrome c oxidase subunit 1 (a) gene and a fragment of the 16S ribosomal RNA (c) as well as the novel tRNAs and control region (e). To facilitate independent comparisons, the genetically inferred trees were restricted to the 35 publicly available complete echinoid mitochondrial genomes. Genera represented by more than one species were collapsed and are depicted by single branches. Supporting values (> 0.85 posterior probabilities and > 75% ML bootstrap values) are shown next to nodes. Topological comparisons between the genetically inferred trees and current classification (b, d, f) (see text for details) were visualised using Phylo.io [62]. Colour scale for the comparison metric (a variant of the Jaccard Index as implemented in Phylo.io) ranges from 0 (subtrees completely different) to 1 (subtree structure of the respective node is identical)
Phylogenetic tree comparisons using the duplication-aware algorithm TreeKO as implemented in the Python Environment for Tree Exploration (ETE)
| Source tree | Reference tree | Effective tree size | nRF | RF | maxRF | %src_br | %ref_br | Subtrees number | treekoD |
|---|---|---|---|---|---|---|---|---|---|
|
| Syst | 10.0 | 0.60 | 9 | 15 | 0.38 | 0.43 | 1 | 0.60 |
|
| Syst | 11.0 | 0.53 | 9 | 17 | 0.44 | 0.50 | 4 | 0.53 |
| CRA | Syst | 18.5 | 0.61 | 18.5 | 31 | 0.36 | 0.43 | 2 | 0.61 |
Genetically inferred trees based on: COI, 16S, CRA are compared to current echinoid classification (Syst) (see text for details)
RF Robinson-Foulds symmetric distance, nRF normalized RF distance (RF/maxRF); %src_br – frequency of edges in target tree found in the reference; %ref_br – frequency of edges in the reference tree found in the target; Subtrees – number of subtrees used for the comparison; treekoD – average p distance among all possible subtrees in the original target trees to the reference tree (lower treekoD values are indicative of higher similarity between trees)
Fig. 3Substitution saturation plot of the CRA marker based on the CRA-All dataset. The number of transitions (s) and transversions (v) is plotted against F84 genetic distance. A linear correlation is sustained for both transitions and transversions as expected in the absence of saturation
Substitution saturation analysis of the CRA region based on the index of substitution saturation as implemented in DAMBE6
| Marker dataset | Number of OTUa | Issb | Iss.cSymc | dfd | Iss.cAsymf | dfd | ||
|---|---|---|---|---|---|---|---|---|
|
| 4 | 0.240 | 0.789 | 366 | < 0.0001 | 0.757 | 366 | < 0.0001 |
| 8 | 0.259 | 0.743 | 366 | < 0.0001 | 0.631 | 366 | < 0.0001 | |
| 16 | 0.242 | 0.703 | 366 | < 0.0001 | 0.494 | 366 | < 0.0001 | |
| 32 | 0.260 | 0.692 | 366 | < 0.0001 | 0.363 | 366 | 0.0001 |
aNumber of sequences used in the random resampling
bindex of substitution saturation
ccritical value for a symmetrical tree topology
ddegrees of freedom
eprobability that Iss is significantly different from the critical value (Iss.cSym/Iss.cAsym)
fcritical value for an asymmetrical tree topology
Note: two-tailed tests are used
Fig. 4Phylogenetic tree reconstruction of the echinoid control region and adjacent areas (CRA). The BI tree presented is based on 86 unique haplotypes retrieved from a total of 110 sequences, 405 bp long (see Table 1 for details on the sequences used for this tree). Supporting values (> 0.5 posterior probabilities and > 50% ML bootstrap values) are shown above the nodes
Fig. 5Coverage (orange curve) and GC content (black curve; 200 bp sliding window, 10 bp step width) through the mitogenome of Hemicentrotus pulcherrimus (GenBank accession no. KC898202) illustrating moderate (R2 = 0.335), but highly significant correlation (t-test, p < 10− 100) between the two graphs. Note extreme drop of coverage towards the end of the CR (highlighted in grey), which coincides with a slight decrease in GC-content, but shows a much stronger negative excursion than other GC-poor areas in the mitogenome of this species (e.g. at nucleotide positions 4.4, 8.5, or 12.6 kb)