| Literature DB >> 29601491 |
Wencai Wang1, Siyun Chen2, Xianzhi Zhang3.
Abstract
Eucommia ulmoides (E. ulmoides), the sole species of Eucommiaceae with high importance of medicinal and industrial values, is a Tertiary relic plant that is endemic to China. However, the population genetics study of E. ulmoides lags far behind largely due to the scarcity of genomic data. In this study, one complete chloroplast (cp) genome of E. ulmoides was generated via the genome skimming approach and compared to another available E. ulmoides cp genome comprehensively at the genome scale. We found that the structure of the cp genome in E. ulmoides was highly consistent with genome size variation which might result from DNA repeat variations in the two E. ulmoides cp genomes. Heterogeneous sequence divergence patterns were revealed in different regions of the E. ulmoides cp genomes, with most (59 out of 75) of the detected SNPs (single nucleotide polymorphisms) located in the gene regions, whereas most (50 out of 80) of the indels (insertions/deletions) were distributed in the intergenic spacers. In addition, we also found that all the 40 putative coding-region-located SNPs were synonymous mutations. A total of 71 polymorphic cpDNA fragments were further identified, among which 20 loci were selected as potential molecular markers for subsequent population genetics studies of E. ulmoides. Moreover, eight polymorphic cpSSR loci were also developed. The sister relationship between E. ulmoides and Aucuba japonica in Garryales was also confirmed based on the cp phylogenomic analyses. Overall, this study will shed new light on the conservation genomics of this endangered plant in the future.Entities:
Keywords: Eucommia ulmoides; chloroplast genome; heterogeneous divergence; mutation hotspots; whole-genome comparison
Mesh:
Substances:
Year: 2018 PMID: 29601491 PMCID: PMC5979487 DOI: 10.3390/ijms19041037
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Comparison between the newly and previously sequenced chloroplast genomes of Eucommia ulmoides.
| Item | This Study | KU204775 |
|---|---|---|
| Chloroplast genome size (bp) | 163,586 | 163,341 |
| LSC a length (bp) | 86,764 | 86,592 |
| SSC b length (bp) | 14,166 | 14,149 |
| IRa/IRb c length (bp) | 31,328 | 31,300 |
| Number of genes (unique genes) | 136 (115) | 136 (115) |
| Number of protein-coding genes (unique genes) | 89 (80) | 89 (80) |
| Number of tRNA genes (unique genes) | 39 (31) | 39 (31) |
| Number of rRNA genes (unique genes) | 8 (4) | 8 (4) |
| GC d content (%) | 38.33% | 38.34% |
| Protein-coding regions (%) | 51.91% | 51.99% |
a LSC, large single-copy region; b SSC, small single-copy region; c IRa/IRb, two identical inverted repeat regions a/b; d GC, Guanine and Cytosine.
Figure 1Conserved chloroplast genome structure in Eucommia ulmoides. (A) Pairwise chloroplast genome alignments derived from Multiple Alignment using Fast Fourier Transform (MAFFT) program. The sequence identity is indicated on the top. Label KU204775.1 represents the E. ulmoides chloroplast genome retrieved from GenBank, while label E. ulmoides indicates the newly sequenced genome in this study. (B) Pairwise chloroplast genome alignments derived from MAUVE software.
DNA insertions and deletions with more than 10 nucleotides in the chloroplast genomes of Eucommia ulmoides.
| No. | Size (bp) | Start Position | Location | Type |
|---|---|---|---|---|
| 1 | 56 | 6851 | insertion | |
| 2 | 27 | 7006 | insertion | |
| 3 | 45 | 7196 | insertion | |
| 4 | 13 | 12,693 | insertion | |
| 5 | 23 | 12,912 | insertion | |
| 6 | 111 | 13,312 | insertion | |
| 7 | 12 | 13,471 | insertion | |
| 8 | 32 | 24,279 | insertion | |
| 9 | 12 | 26,615 | insertion | |
| 10 | 11 | 51,194 | insertion | |
| 11 | 17 | 52,547 | insertion | |
| 12 | 40 | 57,075 | insertion | |
| 13 | 18 | 64,506 | insertion | |
| 14 | 12 | 73,717 | insertion | |
| 15 | 14 | 127,486 | insertion | |
| 16 | 16 | 4673 | deletion | |
| 17 | 44 | 24,700 | deletion | |
| 18 | 31 | 41,767 | deletion | |
| 19 | 44 | 51,109 | deletion | |
| 20 | 90 | 62,865 | deletion |
Figure 2Mutational events (SNPs and indels) detected across the chloroplast genome of Eucommia ulmoides. SNPs (single nucleotide polymorphisms) indicate nucleotide substitutions and indels represent nucleotide insertions and deletions. The homologous loci are oriented according to their locations in the chloroplast genome.
Figure 3Percentage of variable characters (SNPs and indels) in polymorphic chloroplast loci in Eucommia ulmoides. The homologous loci are oriented according to their locations in the chloroplast genome.
The 20 chloroplast DNA fragments with relative high genetic divergences identified in Eucommia ulmoides.
| Region | Aligned Length (bp) | No. VCs a | Percentage of VCs (%) |
|---|---|---|---|
| 234 | 3 | 1.28 | |
| 343 | 4 | 1.17 | |
| 369 | 3 | 0.81 | |
| 266 | 2 | 0.75 | |
| 210 | 1 | 0.48 | |
| 1062 | 5 | 0.47 | |
| 651 | 3 | 0.46 | |
| 1502 | 6 | 0.40 | |
| 261 | 1 | 0.38 | |
| 531 | 2 | 0.38 | |
| 546 | 2 | 0.37 | |
| 280 | 1 | 0.36 | |
| 1170 | 4 | 0.34 | |
| 881 | 3 | 0.34 | |
| 325 | 1 | 0.31 | |
| 338 | 1 | 0.30 | |
| 357 | 1 | 0.28 | |
| 359 | 1 | 0.28 | |
| 1521 | 4 | 0.27 | |
| 384 | 1 | 0.26 |
a VCs: variable characters, including SNPs and indels.
The polymorphic chloroplast SSRs identified in Eucommia ulmoides.
| No. | SSR Repeat Motif | Length Variation (bp) | Location | Region a |
|---|---|---|---|---|
| 1 | (G) | 10–11 | LSC | |
| 2 | (A) | 12–15 | LSC | |
| 3 | (A) | 12–13 | IRb | |
| 4 | (A) | 13–14 | IRa | |
| 5 | (T) | 10–14 | LSC | |
| 6 | (T) | 10–11 | LSC | |
| 7 | (T) | 12–13 | IRa | |
| 8 | (T) | 14–15 | LSC |
a LSC, large single-copy region; IRa/IRb, two identical inverted repeat regions a/b.
Figure 4Maximum likelihood (ML) tree for 34 taxa based on 80 unique plastid protein-coding genes of Eucommia ulmoides. Values above the branches represent maximum parsimony bootstrap (MPBS)/maximum likelihood bootstrap (MLBS)/Bayesian inference posterior probability (PP). The newly sequenced Eucommia ulmoides chloroplast genome is indicated by red color and the previously published E. ulmoides chloroplast genome is followed by its GenBank accession number KU204775.