| Literature DB >> 27446185 |
Yanci Yang1, Tao Zhou1, Dong Duan1, Jia Yang1, Li Feng1, Guifang Zhao1.
Abstract
Quercus is considered economically and ecologically one of the most important genera in the Northern Hemisphere. Oaks are taxonomically perplexing because of shared interspecific morphological traits and intraspecific morphological variation, which are mainly attributed to hybridization. Universal plastid markers cannot provide a sufficient number of variable sites to explore the phylogeny of this genus, and chloroplast genome-scale data have proven to be useful in resolving intractable phylogenetic relationships. In this study, the complete chloroplast genomes of four Quercus species were sequenced, and one published chloroplast genome of Quercus baronii was retrieved for comparative analyses. The five chloroplast genomes ranged from 161,072 bp (Q. baronii) to 161,237 bp (Q. dolicholepis) in length, and their gene organization and order, and GC content, were similar to those of other Fagaceae species. We analyzed nucleotide substitutions, indels, and repeats in the chloroplast genomes, and found 19 relatively highly variable regions that will potentially provide plastid markers for further taxonomic and phylogenetic studies within Quercus. We observed that four genes (ndhA, ndhK, petA, and ycf1) were subject to positive selection. The phylogenetic relationships of the Quercus species inferred from the chloroplast genomes obtained moderate-to-high support, indicating that chloroplast genome data may be useful in resolving relationships in this genus.Entities:
Keywords: Quercus; chloroplast genome; nucleotide substitution; phylogeny; plastid marker; positive selection; repeat
Year: 2016 PMID: 27446185 PMCID: PMC4923075 DOI: 10.3389/fpls.2016.00959
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Assembly information for the five Quercus species.
| Species | Locality | GenBank numbers | Assembly reads | Mean coverage |
|---|---|---|---|---|
| Baoji, Shaanxi, China | KT963087 | 298,797 | 230x | |
| Xi’an, Shaanxi, China | KU240007 | 617,114 | 481x | |
| Maoxian, Sichuan, China | KU240008 | 560,430 | 435x | |
| Xi’an, Shaanxi, China | KU240009 | 611,174 | 475x | |
| Yichang, Hubei, China | KU240010 | 209,578 | 163x |
Characteristics of Quercus chloroplast genomes.
| Size (bp) | 161,072 | 161,150 | 161,153 | 161,077 | 161,237 |
| LSC (bp) | 90,341 | 90,444 | 90,457 | 90,387 | 90,461 |
| SSC (bp) | 19,045 | 19,054 | 19,044 | 19,056 | 19,048 |
| IR (bp) | 51,686 | 51,652 | 51,652 | 51,634 | 51,728 |
| Protein-coding regions (bp) | 80,250 | 80,121 | 80,073 | 80,133 | 80,322 |
| Number of total genes | 134 | 134 | 134 | 134 | 134 |
| Number of protein-coding genes | 86 (7) | 86 (7) | 86 (7) | 86 (7) | 86 (7) |
| Number of tRNA genes | 40 (7) | 40 (7) | 40 (7) | 40 (7) | 40 (7) |
| Number of rRNA genes | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) |
| Overall GC content (%) | 36.81 | 36.83 | 36.83 | 36.78 | 36.80 |
| GC content in LSC (%) | 34.67 | 34.69 | 34.69 | 34.63 | 34.67 |
| GC content in SSC (%) | 30.88 | 30.90 | 30.92 | 30.83 | 30.86 |
| GC content in IR (%) | 42.73 | 42.77 | 42.77 | 42.77 | 42.74 |
Types and number of SSRs in cp genomes.
| Mononucleotide repeats | 84 | 85 | 86 | 87 | 85 |
| Dinucleotide repeats | 14 | 12 | 12 | 15 | 14 |
| Trinucleotide repeats | 4 | 3 | 3 | 3 | 3 |
| Tetranucleotide repeats | 6 | 3 | 4 | 6 | 6 |
| Pentanucleotide repeats | 2 | 2 | 1 | 1 | 2 |
| Hexanucleotide repeats | 1 | 0 | 0 | 0 | 1 |
| All types in complete cp genome | 111 | 105 | 106 | 112 | 111 |
| All types in protein-coding regions | 14 | 14 | 14 | 13 | 14 |
Numbers of nucleotide substitutions and indels in five complete cp genomes.
| / | 716 (337/379) | 715 (339/376) | 245 (109/136) | 190 (77/113) | |
| 98 | / | 16 (0/16) | 720 (351/369) | 677 (326/351) | |
| 97 | 4 | / | 719 (353/366) | 680 (328/352) | |
| 46 | 108 | 107 | / | 222 (106/116) | |
| 26 | 102 | 99 | 43 | / |
Numbers of synonymous and non-synonymous substitutions in CDS and ratios of Ts/Tv in complete cp genomes.
| / | 0.89 | 0.90 | 0.80 | 0.68 | |
| 111/117 | / | 0 | 0.95 | 0.93 | |
| 110/117 | 1/2 | / | 0.96 | 0.93 | |
| 44/46 | 120/125 | 119/126 | / | 0.91 | |
| 25/28 | 113/112 | 112/113 | 47/42 | / |
Model in ML and BI analysis.
| Best fit model | Model in ML | Model in BI | |
|---|---|---|---|
| LSC region | TVM+I+G | GTR+G | TVM+I+G |
| SSC region | TVM+I+G | GTR+G | TVM+I+G |
| IRB region | K81uf+I | GTR+G | K81uf+I |
| LSC + SSC region | TVM+I+G | GTR+G | TVM+I+G |
| LSC + SSC + IRB region | K81uf+I +G | GTR+G | K81uf+I +G |
| Complete cp genome | K81uf+I +G | GTR+G | K81uf+I +G |