| Literature DB >> 31607787 |
Damien Daniel Hinsinger1,2, Joeri Sergej Strijk1,2,3.
Abstract
The genus Quercus L. contains several of the most economically important species for timber production in the Northern Hemisphere. It was one of the first genera described, but genetic diversity at a global scale within and amongst oak species remains unclear, despite numerous regional or species-specific assessments. To evaluate global plastid diversity in oaks, we sequenced the complete chloroplast of Quercus xanthoclada and compared its sequence with those available from other main taxonomic groups in Quercus. We quantify genomic divergence amongst oaks and performed a sliding window analysis to detect the most variable regions amongst members of the various clades, as well as divergent regions occurring in specific pairs of species. We identified private and shared SNPs amongst oaks species and sections and stress the need for a large global assessment of genetic diversity in this economically and ecologically important genus. Damien Daniel Hinsinger, Joeri Sergej Strijk.Entities:
Keywords: Fagaceae ; Quercus ; Diversification; Genomic Diversity; Genomic Resources; Phylogenomics; Plastid genome; SNPs; Sections; Taxonomy
Year: 2019 PMID: 31607787 PMCID: PMC6783484 DOI: 10.3897/phytokeys.132.36365
Source DB: PubMed Journal: PhytoKeys ISSN: 1314-2003 Impact factor: 1.635
Characteristics of the complete chloroplasts used in this study, showing the length, the GC content of each regions and the number of coding, tRNA and rRNA loci. LSC: Large Single Copy region; SSC: Small Single Copy region; IR: Inverted Repeats. Data from Du et al. (2015) (), Lu et al. (2016) (), Yin et al. (2018) (, GenBank data), Alexander and Woeste (2014) ().
| Species | length |
|
|
| GC total | GC | GC | GC | coding | tRNA | rRNA |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 160,988 | 90,353 | 18,955 | 25,840 | 36.9 | 34.8 | 31.1 | 42.8 | 90 | 31 | 8 |
|
| 160,825 | 90,371 | 18,732 | 25,861 | 36.87 | 34.7 | 31.2 | 42.6 | 87 | 29 | 8 |
|
| 160,921 | 90,258 | 18,980 | 25,841 | 36.9 | 34.8 | 31.3 | 42.7 | 89 | 39 | 8 |
|
| 160,415 | 89,856 | 18,935 | 25,812 | 37.0 | 36.6 | 31.2 | 42.8 | 78 | 29 | 8 |
|
| 161,304 | 90,542 | 19,025 | 25,869 | 36.8 | 34.6 | 30.9 | 42.7 | 89 | 41 | 8 |
Figure 1.Circular gene map of the plastid genome of . Genes drawn within the circle are transcribed clockwise, while those drawn outside are transcribed counter clockwise. Genes are colour-coded according to their functional groups.
Figure 2.ML phylogenetic tree of the nine selected plastomes in GenBank, plus the plastome of . The tree is rooted with . Bootstraps (1000 replicates) are shown at the nodes, values below 50% not shown. Scale in substitution per site.
Results of the Tajima relative rate test. 2 test statistic value indicated, p-value in parenthesis. p-values < 0.01 in bold. For significant rate heterogeneity comparisons, the species with slower evolutionary rate is indicated: Qsp: , Qal: , Qaq: , Qru: . For clarity, p-values lower than 0.005 are not indicated.
| Species |
|
|
|
|
|
|---|---|---|---|---|---|
|
| / | 17.37 – Qsp | 30.35 – Qal | 10.37 – Qaq | 32.64 – Qru |
|
| / | 4.17 (0.04) – Qal | 0.02 (0.89) | 3.47 (0.06) | |
|
| / | 4.99 (0.03)-Qal | 0.06 (0.81) | ||
|
| / | 3.30 (0.07) | |||
|
| / |
Estimates of p-distance amongst oak species. The number of base differences per site is shown. All positions containing gaps and missing data were eliminated. There were a total of 158480 positions in the final dataset.
| Species |
|
|
|
|
|
|---|---|---|---|---|---|
|
| / | 0.005 | 0.006 | 0.006 | 0.006 |
|
| / | 0.005 | 0.005 | 0.005 | |
|
| / | 0.005 | 0.004 | ||
|
| / | 0.005 | |||
|
| / |
Figure 3.Sliding window analysis of the whole plastomes of five oak species. (window length: 500 bp, step size: 250 bp). X-axis: position of the mid-point of the window, Y-axis: number of SNPs (solid line) and indels (dashed line) positions of each window in bp. Coding regions and directions of transcription are indicated by arrows, inverted repeats by grey areas. Putative barcode loci are highlighted for SNPs and indels. For readability, only a few major genes are indicated.
Figure 4.mVISTA percent identity plot comparing the four Quercus chloroplast genomes with as a reference. Vertical scale indicates the percentage of identity ranging from 50% to 100%. Coding regions are in blue and non-coding regions are in pink.
Figure 5.Venn diagram showing the private and shared SNPs amongst the five oak species. Each area is coloured according to the relative number of shared SNPs in this area.