| Literature DB >> 30533315 |
Zhenyu Zhao1, Xin Wang1,2, Yi Yu3, Subo Yuan4, Dan Jiang5, Yujun Zhang6, Teng Zhang1, Wenhao Zhong1, Qingjun Yuan1, Luqi Huang1.
Abstract
Dioscorea L., the largest genus of the family Dioscoreaceae with over 600 species, is not only an important food but also a medicinal plant. The identification and classification of Dioscorea L. is a rather difficult task. In this study, we sequenced five Dioscorea chloroplast genomes, and analyzed with four other chloroplast genomes of Dioscorea species from GenBank. The Dioscorea chloroplast genomes displayed the typical quadripartite structure of angiosperms, which consisted of a pair of inverted repeats separated by a large single-copy region, and a small single-copy region. The location and distribution of repeat sequences and microsatellites were determined, and the rapidly evolving chloroplast genome regions (trnK-trnQ, trnS-trnG, trnC-petN, trnE-trnT, petG-trnW-trnP, ndhF, trnL-rpl32, and ycf1) were detected. Phylogenetic relationships of Dioscorea inferred from chloroplast genomes obtained high support even in shortest internodes. Thus, chloroplast genome sequences provide potential molecular markers and genomic resources for phylogeny and species identification.Entities:
Keywords: Chloroplast genome; Dioscorea; Phylogeny; Single sequence repeats; Variable marker
Year: 2018 PMID: 30533315 PMCID: PMC6284424 DOI: 10.7717/peerj.6032
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Sampling and assembly information for the five Dioscorea species.
| Species | ID | Raw data no. | Mapped read no. | Precent of chloroplast genome reads (%) | Chloroplast gemome coverage (X) | Accession number |
|---|---|---|---|---|---|---|
| LJW01 | 69,648,118 | 428,514 | 0.62% | 838 | ||
| LSC09 | 77,185,326 | 2,467,928 | 3.20% | 4,834 | ||
| LAW08 | 53,889,722 | 1,140,614 | 2.12% | 2,235 | ||
| MHW01 | 81,562,406 | 3,050,140 | 3.74% | 5,944 | ||
| MHW08 | 62,610,816 | 1,119,774 | 1.79% | 2,192 |
Notes:
W, wild.
C, cultivated.
Characteristics of the chloroplast genomes of nine Dioscorea species.
| Genme features | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Size (bp) | 153,337 | 153,161 | 153,075 | 153,946 | 153,243 | 152,609 | 153,919 | 153,970 | 155,418 |
| LSC length (bp) | 83,517 | 83,414 | 83,226 | 83,979 | 83,431 | 82,777 | 83,865 | 83,950 | 85,600 |
| IR length (bp) | 25,478 | 25,464 | 25,499 | 25,529 | 25,489 | 25,513 | 25,576 | 25,491 | 25,484 |
| SSC length (bp) | 18,864 | 18,819 | 18,851 | 18,909 | 18,834 | 18,806 | 18,902 | 19,038 | 18,850 |
| Total genes | 112 | 112 | 112 | 112 | 112 | 112 | 112 | 112 | 112 |
| Protein coding genes | 78 | 78 | 78 | 78 | 78 | 78 | 78 | 78 | 78 |
| tRNA genes | 30 | 30 | 30 | 30 | 30 | 30 | 30 | 30 | 30 |
| rRNA genes | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 |
| Overall GC content (%) | 37.0 | 37.0 | 37.0 | 37.2 | 37 | 37.2 | 37.2 | 37.2 | 37.2 |
| GC content in LSC (%) | 34.8 | 34.8 | 34.8 | 35.0 | 34.8 | 34.9 | 35.0 | 35.1 | 35.2 |
| GC content in SSC (%) | 31.0 | 31.0 | 30.8 | 31.2 | 30.9 | 31.2 | 31.2 | 31.2 | 30.9 |
| GC content in IR (%) | 43.0 | 43.0 | 43.0 | 43.0 | 42.9 | 43.0 | 43.0 | 43.0 | 42.9 |
Figure 1Gene maps of chloroplast genomes of Dioscorea.
Genes on the inside of the large circle are transcribed clockwise and those on the outside are transcribed counter clockwise. The genes are color-coded based on their functions. The dashed area represents the GC composition of the chloroplast genome.
Figure 2Analysis of repeated sequences in nine Dioscorea species.
(A) Number of repeated sequences by length; (B) Number of types repeated three times in the nine chloroplast genomes.
Figure 3Analysis of simple sequence repeats (SSR) in the chloroplast genomes of nine Dioscorea species.
(A) Number of different SSR types detected in the nine genomes; (B) Number of identified SSR motifs in different repeat class types.
Figure 4Phylogenetic tree reconstruction using maximum likelihood, and Bayesian inference methods based on the complete chloroplast genome sequences.
ML topology shown with ML bootstrap support values/Bayesian posterior probability listed at each node.
Figure 5Sliding window analysis of the Dioscorea chloroplast genomes (window length: 800 bp; step size: 200 bp).
(A) Nucleotide diversity of A-clade dataset; (B) Nucleotide diversity of B-clade dataset. X-axis: position of the midpoint of a window; Y-axis: nucleotide diversity of each window.
Figure 6Phylogeny of the nine Dioscorea species constructed using eight regions of highly variable sequences.
Numbers above nodes are support values with ML bootstrap values.