| Literature DB >> 25365514 |
Yang Yang1, Yuanye Dang1, Dang Yuanye, Qing Li2, Li Qing, Jinjian Lu1, Lu Jinjian, Xiwen Li3, Li Xiwen, Yitao Wang1, Wang Yitao.
Abstract
Datura stramonium is a widely used poisonous plant with great medicinal and economic value. Its chloroplast (cp) genome is 155,871 bp in length with a typical quadripartite structure of the large (LSC, 86,302 bp) and small (SSC, 18,367 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,601 bp). The genome contains 113 unique genes, including 80 protein-coding genes, 29 tRNAs and four rRNAs. A total of 11 forward, 9 palindromic and 13 tandem repeats were detected in the D. stramonium cp genome. Most simple sequence repeats (SSR) are AT-rich and are less abundant in coding regions than in non-coding regions. Both SSRs and GC content were unevenly distributed in the entire cp genome. All preferred synonymous codons were found to use A/T ending codons. The difference in GC contents of entire genomes and of the three-codon positions suggests that the D. stramonium cp genome might possess different genomic organization, in part due to different mutational pressures. The five most divergent coding regions and four non-coding regions (trnH-psbA, rps4-trnS, ndhD-ccsA, and ndhI-ndhG) were identified using whole plastome alignment, which can be used to develop molecular markers for phylogenetics and barcoding studies within the Solanaceae. Phylogenetic analysis based on 68 protein-coding genes supported Datura as a sister to Solanum. This study provides valuable information for phylogenetic and cp genetic engineering studies of this poisonous and medicinal plant.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25365514 PMCID: PMC4217734 DOI: 10.1371/journal.pone.0110656
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Base composition in the Datura stramonium chloroplast genome.
| T (U) (%) | C (%) | A (%) | G (%) | Length (bp) | |
| LSC | 32.7 | 18.4 | 31.3 | 17.6 | 86,302 |
| SSC | 34.0 | 16.8 | 33.7 | 15.5 | 18,367 |
| IRa | 28.3 | 20.7 | 28.6 | 22.4 | 25,601 |
| IRb | 28.6 | 22.4 | 28.3 | 20.7 | 25,601 |
| Total | 31.5 | 19.2 | 30.6 | 18.6 | 155,871 |
| CDS | 31.2 | 17.9 | 30.6 | 20.3 | 80,316 |
| 1st position | 23.7 | 18.9 | 30.6 | 26.8 | 26,772 |
| 2nd position | 32.5 | 20.3 | 29.3 | 17.9 | 26,772 |
| 3rd position | 37.6 | 14.3 | 31.8 | 16.3 | 26,772 |
Figure 1Gene map of the Datura stramonium chloroplast genome.
Genes drawn inside the circle are transcribed clockwise, and those outside are counterclockwise. Genes belonging to different functional groups are color-coded. The darker gray in the inner circle corresponds to GC content, while the lighter gray corresponds to AT content.
The codon–anticodon recognition pattern and relative synonymous codon usage (RSCU) for the Datura stramonium chloroplast genome.
| Amino acid | Codon | Count | RSCU | tRNA |
| Phe | UUU | 955 | 1.27 | |
| Phe | UUC | 551 | 0.73 | trnF-GAA |
| Leu | UUA | 875 | 1.84 | trnL-UAA |
| Leu | UUG | 579 | 1.22 | trnL-CAA |
| Leu | CUU | 612 | 1.29 | |
| Leu | CUC | 207 | 0.44 | |
| Leu | CUA | 378 | 0.80 | trnL-UAG |
| Leu | CUG | 197 | 0.42 | |
| Ile | AUU | 1105 | 1.47 | |
| Ile | AUC | 461 | 0.61 | trnI-GAU |
| Ile | AUA | 684 | 0.91 | trnI-CAU |
| Met | AUG | 622 | 1.00 | trn(f)M-CAU |
| Val | GUU | 526 | 1.46 | |
| Val | GUC | 182 | 0.50 | trnV-GAC |
| Val | GUA | 539 | 1.46 | trnV-UAC |
| Val | GUG | 197 | 0.55 | |
| Ser | UCU | 591 | 1.70 | |
| Ser | UCC | 341 | 0.98 | trnS-GGA |
| Ser | UCA | 407 | 1.17 | trnS-UGA |
| Ser | UCG | 210 | 0.60 | |
| Pro | CCU | 428 | 1.52 | |
| Pro | CCC | 206 | 0.73 | trnP-GGG |
| Pro | CCA | 334 | 1.18 | trnP-UGG |
| Pro | CCG | 162 | 0.57 | |
| Thr | ACU | 540 | 1.58 | |
| Thr | ACC | 267 | 0.78 | trnT-GGU |
| Thr | ACA | 410 | 1.20 | trnT-UGU |
| Thr | ACG | 148 | 0.43 | |
| Ala | GCU | 616 | 1.75 | |
| Ala | GCC | 245 | 0.70 | |
| Ala | GCA | 400 | 1.14 | trnA-UGC |
| Ala | GCG | 143 | 0.41 | |
| Tyr | UAU | 783 | 1.60 | |
| Tyr | UAC | 198 | 0.40 | trnY-GUA |
| Stop | UAA | 44 | 1.52 | |
| Stop | UAG | 25 | 0.86 | |
| His | CAU | 482 | 1.53 | |
| His | CAC | 147 | 0.47 | trnH-GUG |
| Gln | CAA | 705 | 1.49 | trnQ-UUG |
| Gln | CAG | 244 | 0.51 | |
| Asn | AAU | 1003 | 1.52 | |
| Asn | AAC | 317 | 0.48 | trnN-GUU |
| Lys | AAA | 1052 | 1.48 | trnK-UUU |
| Lys | AAG | 371 | 0.52 | |
| Asp | GAU | 860 | 1.60 | |
| Asp | GAC | 217 | 0.40 | trnD-GUC |
| Glu | GAA | 1036 | 1.47 | trnE-UUC |
| Glu | GAG | 370 | 0.53 | |
| Cys | UGU | 223 | 1.46 | |
| Cys | UGC | 82 | 0.54 | trnC-GCA |
| Stop | UGA | 18 | 0.62 | |
| Trp | UGG | 475 | 1.00 | trnW-CCA |
| Arg | CGU | 341 | 1.26 | trnR-ACG |
| Arg | CGC | 100 | 0.37 | |
| Arg | CGA | 394 | 1.46 | |
| Arg | CGG | 126 | 0.47 | |
| Arg | AGA | 487 | 1.80 | trnR-UCU |
| Arg | AGG | 174 | 0.64 | |
| Ser | AGU | 420 | 1.21 | |
| Ser | AGC | 119 | 0.34 | trnS-GCU |
| Gly | GGU | 578 | 1.26 | |
| Gly | GGC | 199 | 0.43 | trnG-GCC |
| Gly | GGA | 740 | 1.61 | trnG-UCC |
| Gly | GGG | 324 | 0.70 |
Figure 2Distribution of SSRs present in the 41 asteridae chloroplast genomes.
Repeated sequences in the Datura stramonium chloroplast genome.
| Repeat number | Size (bp) | Type | Location | Repeat Unit | Region |
| 1 | 50 | F |
|
| LSC |
| 2 | 39 | F |
|
| LSC, IRa |
| 3 | 39 | F |
|
| LSC, SSC |
| 4 | 39 | F | IGS ( |
| IRa, SSC |
| 5 | 40 | F |
|
| LSC |
| 6 | 34 | F | IGS ( |
| SSC |
| 7 | 35 | F |
|
| IRa |
| 8 | 30 | F |
|
| LSC, IRa |
| 9 | 31 | F |
|
| LSC |
| 10 | 31 | F |
|
| LSC |
| 11 | 31 | F | IGS ( |
| LSC |
| 12 | 56 | P |
|
| LSC |
| 13 | 48 | P | IGS ( |
| LSC |
| 14 | 39 | P |
|
| LSC, IRb |
| 15 | 39 | P |
|
| SSC, IRb |
| 16 | 30 | P |
|
| LSC |
| 17 | 34 | P | IGS ( |
| IRa, IRb |
| 18 | 35 | P |
|
| IRa, IRb |
| 19 | 32 | P | IGS ( |
| LSC |
| 20 | 30 | P |
|
| LSC |
| 21 | 44 | T | IGS ( |
| LSC |
| 22 | 40 | T | IGS ( |
| LSC |
| 23 | 33 | T | IGS ( |
| LSC |
| 24 | 45 | T | IGS ( |
| LSC |
| 25 | 35 | T | IGS ( |
| LSC |
| 26 | 50 | T | IGS ( |
| LSC |
| 27 | 41 | T |
|
| LSC |
| 28 | 47 | T |
|
| LSC |
| 29 | 58 | T |
|
| IRa |
| 30 | 62 | T | IGS ( |
| IRa |
| 31 | 66 | T | IGS ( |
| IRa |
| 32 | 39 | T |
|
| SSC |
| 33 | 55 | T |
|
| SSC |
*copy number.
‘F’ is forward, ‘P’ is palindromic, and ‘T’ is Tandem; IGS: Intergenic spacer; CDS: protein-coding regions.
Figure 3Comparison of four chloroplast genomes using mVISTA program.
Grey arrows and thick black lines above the alignment indicate genes with their orientation and the position of the IRs, respectively. A cut-off of 70% identity was used for the plots, and the Y-scale represents the percent identity between 50–100%. Genome regions are color-coded as protein-coding (exon), rRNA, tRNA and conserved noncoding sequences (CNS).
Figure 4The Ka/Ks ratio of Datura stramonium, Ipomoea purpurea, Nicotiana undulate and Solanum tuberosum for comparison with Nicotiana tabacum.
Figure 5Comparison of the borders of LSC, SSC and IR regions among six chloroplast genomes.
The IRb/SSC border extended into the ycf1 genes to create various lengths of ycf1 pseudogenes among six chloroplast genomes. The ycf1 pseudogene and the ndhF gene overlapped in the N. tabacum, S. lycopersicum and D. stramonium cp genomes from 3 bp to 20 bp, respectively. Various lengths of rps19 pseudogenes were created at the IRa/LSC borders except N. tabacum. This figure is not to scale.
Figure 6The ML phylogenetic tree of the asteridae clade based on 68 protein-coding genes.
The ML tree was obtained with the - lnL of 356757.56 using the GTR+I+G nucleotide substitution model. Number above each node are bootstrap support values. Cycas taitungensis was set as outgroup.