| Literature DB >> 30271544 |
Ying Li1, Yongzhi Yang1, Le Yu1, Xin Du1, Guangpeng Ren1.
Abstract
Poor phylogenetic resolution and inconsistency of gene trees are major complications when attempting to construct trees of life for various groups of organisms. In this study, we addressed these issues in analyses of the genus Carpinus (hornbeams) of the Betulaceae. We assembled and annotated the chloroplast (cp) genomes (plastomes) of nine hornbeams representing main clades previously distinguished in this genus. All nine plastomes are highly conserved, with four regions, and about 158-160 kb long, including 121-123 genes. Phylogenetic analyses of whole plastome sequences, noncoding sequences, and the well-aligned coding genes resulted in high resolution of the sampled species in contrast to the failure based on a few cpDNA markers. Phylogenetic relationships in a few clades based only on the coding genes are slightly inconsistent with those based on the noncoding and total plastome datasets. Moreover, these plastome trees are highly incongruent with those based on bi-parentally inherited internal transcribed spacer (ITS) sequence variations. Such high inconsistencies suggest widespread occurrence of incomplete lineage sorting and hybrid introgression during diversification of these hornbeams.Entities:
Keywords: Carpinus; coding genes; complete chloroplast genome; phylogenetics
Year: 2018 PMID: 30271544 PMCID: PMC6157693 DOI: 10.1002/ece3.4414
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Gene map of the Carpinus betulus plastome, as an example of the nine investigated plastomes. Genes drawn outside of the circle are transcribed clockwise, while those inside the circle are transcribed counterclockwise. The typical small single copy (SSC), large single copy (LSC), and inverted repeats (IRa, IRb) are indicated [Colour figure can be viewed at http://wileyonlinelibrary.com]
Figure 2Visualization of alignment of the nine Carpinus chloroplast genome sequences. VISTA‐based identity plots showing sequence identity between six sequenced cp genomes of Carpinus. Arrows above the alignment indicate genes with their orientation. A cutoff of 70% identity was used for the plots, the Y‐scale indicates the percent identity (50–100%), and the X‐axis indicates the coordinate in the chloroplast genome. Exons, rRNA/tRNA, and conserved noncoding sequences (CNS) are marked in purple, blue, and pink, respectively [Colour figure can be viewed at http://wileyonlinelibrary.com]
The Chloroplast assembly and annotation information for nine species of Carpinus
| Species | GenBank | Raw base (G) | Clean base (G) | Entire plastid size (bp) | LSC | SSC | IR (two copies) | Overall GC content (%) | No. of genes | No. of PCGs | No. of rRNA genes | No. of tRNA genes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| 3.2 | 2.9 | 158865 | 87938 | 18800 | 52127 | 36.5 | 122 | 84 | 8 | 29 |
|
|
| 6.1 | 5.6 | 159222 | 88280 | 18825 | 52117 | 36.5 | 122 | 84 | 8 | 29 |
|
|
| 3.2 | 2.9 | 160583 | 88282 | 17167 | 55134 | 36.4 | 123 | 85 | 8 | 29 |
|
|
| 2.8 | 2.5 | 160151 | 88555 | 18513 | 53083 | 36.3 | 123 | 85 | 8 | 29 |
|
|
| 5.0 | 4.6 | 158626 | 87679 | 18797 | 52150 | 36.5 | 122 | 84 | 8 | 29 |
|
|
| 5.6 | 5.2 | 159640 | 88711 | 18795 | 52134 | 36.4 | 123 | 85 | 8 | 29 |
|
|
| 5.4 | 5.1 | 158681 | 87808 | 18810 | 52134 | 36.5 | 121 | 83 | 8 | 29 |
|
|
| 6.6 | 6.1 | 159505 | 88616 | 18760 | 52129 | 36.4 | 121 | 83 | 8 | 29 |
|
|
| 5.8 | 5.6 | 159484 | 88519 | 18816 | 52149 | 36.4 | 123 | 85 | 8 | 29 |
Note. LSC: long single copy; SSC: small single copy; PCGs: protein‐coding genes.
Summary of length and variability across different data partitions
| Locus | cp genome | LSC | SSC | PCGs | Non‐coding | ITS |
|---|---|---|---|---|---|---|
| Constant sites | 154718 | 84115 | 16575 | 67552 | 68000 | 547 |
| Parsimony informative sites | 262 | 193 | 49 | 66 | 190 | 53 |
| Variable sites | 1865 | 1395 | 321 | 506 | 1308 | 76 |
| Total sites | 156583 | 85510 | 16896 | 68058 | 69308 | 623 |
| % Parsimony informative sites | 0.17 | 0.23 | 0.30 | 0.10 | 0.27 | 8.5 |
Note. LSC: long single copy; SSC: small single copy; PCGs: protein‐coding genes.
Figure 3Phylogenetic trees of nine Carpinus species based on the three plastome datasets and nrITS sequences, with branch lengths based on results of the Bayesian analyses. Bayesian posterior probabilities (left) and maximum‐likelihood bootstrap values (right) are shown on each node. Different taxa of Carpinus and Corylus are marked by different colors. (a) Trees based on the chloroplast genome and noncoding regions; (b) trees based on protein coding genes and ITS sequences. *Indicates inconsistent topology based on ML and Bayesian analyses; vertical bars represent different species in the ITS tree [Colour figure can be viewed at http://wileyonlinelibrary.com]