| Literature DB >> 28855603 |
Bin Li1,2,3, Furong Lin1,2,3, Ping Huang1,2,3, Wenying Guo1,2,3, Yongqi Zheng4,5,6.
Abstract
Decaisnea insignis is a wild resource plant and is used as an ornamental, medicinal, and fruit plant. High-throughput sequencing of chloroplast genomes has provided insight into the overall evolutionary dynamics of chloroplast genomes and has enhanced our understanding of the evolutionary relationships within plant families. In the present study, we sequenced the complete chloroplast genome of D. insignis and used the data to assess its genomic resources. The D. insignis chloroplast genome is 158,683 bp in length and includes a pair of inverted repeats of 26,167 bp that are separated by small and large single copy regions of 19,162 bp and 87,187 bp, respectively. We identified 83 simple sequence repeats and 18 pairs of large repeats. Most simple-sequence repeats were located in the noncoding sections of the large single-copy/small single-copy region and exhibited a high A/T content. The D. insignis chloroplast genome bias was skewed towards A/T on the basis of codon usage. A phylogenetic tree based on 82 protein-coding genes of 33 angiosperms showed that D. insignis was clustered with Akebia in Lardizabalaceae. Overall, the results of this study will contribute to better understanding the evolution, molecular biology and genetic improvement of D. insignis.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28855603 PMCID: PMC5577308 DOI: 10.1038/s41598-017-10409-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Chloroplast genome map of D. insignis. The genes drawn outside of the circle are transcribed clockwise, while those inside are counterclockwise. Small single copy (SSC), large single copy (LSC), and inverted repeats (IRa, IRb) are indicated.
Characteristics of the chloroplast genomes in three species.
| Species |
|
|
|
|---|---|---|---|
| Genome size (bp) | 158,339 | 157,817 | 158,683 |
| LSC (bp) | 87,057 | 86,543 | 87,187 |
| IR (bp) | 26,129 | 26,143 | 26,167 |
| SSC (bp) | 19,024 | 18,988 | 19,162 |
| Total number of genes | 113 | 113 | 113 |
| Protein coding genes | 79 | 79 | 79 |
| rRNA | 4 | 4 | 4 |
| tRNA | 30 | 30 | 30 |
| GC% | 38.7 | 38.7 | 38.5 |
Figure 2Codon content of 20 amino acid and stop codon of 82 coding genes of D. insignis chloroplast genome. Color of the histogram is the proportion of codon usage for amino acid and stop codon.
Figure 3The distribution, type and presence of simple sequence repeats(SSRs) in the chloroplast genome of D. insignis. (A) Number of different SSRs types. (B) Proportion of SSRs in the LSC, SSC, and IR regions. (C) Number of identified SSR motifs in different repeat class types.
Distribution and localization of tandem repeats in D. insignis chloroplast genome. F, forward repeat; P, palindrome repeat.
| No. | Size (bp) | Start position1 | Start position2 | Type | Location Region | E-value |
|---|---|---|---|---|---|---|
| 1 | 33 | 8813 | 37730 | F |
| 1.41E-05 |
| 2 | 30 | 8819 | 47631 | P |
| 5.53E-07 |
| 3 | 30 | 16853 | 16853 | P |
| 2.40E-05 |
| 4 | 30 | 37736 | 47631 | P |
| 6.73E-04 |
| 5 | 30 | 39077 | 39077 | P |
| 2.40E-05 |
| 6 | 73 | 40902 | 43126 | F |
| 1.33E-28 |
| 7 | 58 | 40917 | 43141 | F |
| 1.27E-21 |
| 8 | 39 | 45906 | 101569 | F |
| 2.34E-14 |
| 9 | 39 | 45906 | 144262 | P |
| 2.34E-14 |
| 10 | 30 | 66548 | 66595 | P |
| 2.40E-05 |
| 11 | 44 | 77100 | 77100 | P |
| 2.29E-17 |
| 12 | 32 | 85121 | 85121 | P |
| 3.84E-10 |
| 13 | 42 | 92063 | 92084 | F |
| 4.61E-14 |
| 14 | 42 | 92063 | 153744 | P |
| 4.61E-14 |
| 15 | 42 | 92084 | 153765 | P |
| 4.61E-14 |
| 16 | 40 | 117507 | 117507 | P |
| 5.86E-15 |
| 17 | 42 | 153744 | 153765 | F |
| 4.61E-14 |
| 18 | 31 | 153755 | 153776 | F |
| 1.54E-09 |
Figure 4Visualization alignment of chloroplast genome sequences of D. insignis, A. quinata and A. indica. VISTA based similarity graphical information portraying sequence identity of D. insignis with reference A. indica chloroplast genomes. Grey arrows above the alignment indicate the orientation of genes. Purple bars represent exons, blue ones represent introns, and pink ones represent non-coding sequences (CNS). A cut-off of 50% identity was used for the plots. The Y-scale axis represents the percent identity within 50–100%.
Figure 5Comparison of chloroplast genome borders of LSC, SSC, and IRs among three Lardizabalaceae species.
Figure 6Nonsynonymous substitution (dN), synonymous substitution (dS), and dN/dS values for individual genes or gene groups.
Figure 7Phylogenetic tree reconstruction of 33 taxa using maximum likelihood and bayesian inference based on concatenated sequences of 82 genes. ML topology was shown with ML bootstrap support value/Bayesian posterior probability given at each node.