| Literature DB >> 31578450 |
Ueric José Borges de Souza1, Rhewter Nunes1, Cíntia Pelegrineti Targueta1, José Alexandre Felizola Diniz-Filho2, Mariana Pires de Campos Telles3,4.
Abstract
Stryphnodendron adstringens is a medicinal plant belonging to the Leguminosae family, and it is commonly found in the southeastern savannas, endemic to the Cerrado biome. The goal of this study was to assemble and annotate the chloroplast genome of S. adstringens and to compare it with previously known genomes of the mimosoid clade within Leguminosae. The chloroplast genome was reconstructed using de novo and referenced-based assembly of paired-end reads generated by shotgun sequencing of total genomic DNA. The size of the S. adstringens chloroplast genome was 162,169 bp. This genome included a large single-copy (LSC) region of 91,045 bp, a small single-copy (SSC) region of 19,014 bp and a pair of inverted repeats (IRa and IRb) of 26,055 bp each. The S. adstringens chloroplast genome contains a total of 111 functional genes, including 77 protein-coding genes, 30 transfer RNA genes, and 4 ribosomal RNA genes. A total of 137 SSRs and 42 repeat structures were identified in S. adstringens chloroplast genome, with the highest proportion in the LSC region. A comparison of the S. adstringens chloroplast genome with those from other mimosoid species indicated that gene content and synteny are highly conserved in the clade. The phylogenetic reconstruction using 73 conserved coding-protein genes from 19 Leguminosae species was supported to be paraphyletic. Furthermore, the noncoding and coding regions with high nucleotide diversity may supply valuable markers for molecular evolutionary and phylogenetic studies at different taxonomic levels in this group.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31578450 PMCID: PMC6775074 DOI: 10.1038/s41598-019-50620-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gene map of the S. adstringens chloroplast genome. The genes drawn outside and inside of the circle are transcribed in clockwise and counterclockwise directions, respectively. Genes were colored based on their functional groups. The inner circle shows the quadripartite structure of the chloroplast: small single copy (SSC), large single copy (LSC) and a pair of inverted repeats (IRa and IRb). The gray ring marks the GC content with the inner circle marking a 50% threshold. Asterisks mark genes that have introns.
Chloroplast genome information from sampled Mimosoid species and the newly assembled S. adstringens.
| Species | Size (bp) | LSC (bp) | SSC (bp) | IR (bp) | GC (%) | Protein | RNA |
|---|---|---|---|---|---|---|---|
|
| 159,389 | 88,577 | 18,756 | 26,028 | 36.5% | 77 | 34 |
|
| 161,240 | 90,430 | 18,526 | 26,142 | 35.9% | 77 | 34 |
|
| 164,692 | 93,690 | 18,890 | 26,056 | 35.6% | 77 | 34 |
|
| 161,681 | 91,093 | 18,574 | 26,007 | 35.9% | 77 | 34 |
|
| 162,552 | 91,517 | 18,941 | 26,047 | 35.9% | 77 | 34 |
|
| 162,169 | 91,045 | 19,014 | 26,055 | 35,9% | 77 | 34 |
LSC Large Single Copy, SSC Small Single Copy, IR Inverted Repeat.
List of genes in the chloroplast genome of S. adstringens.
| Category | Gene groups | Name of genes |
|---|---|---|
| Self-replication | Large subunit of ribosomal proteins | |
| Small subunit of ribosomal proteins | ||
| DNA-dependent RNA polymerase | ||
| Ribosomal RNA genes | ||
| Transfer RNA genes | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| NADH dehydrogenase | NADH dehydrogenase | |
| Cytochrome b/f complex | ||
| ATP synthase | ||
| RubisCo large subunit |
| |
| Other genes | Maturase K |
|
| Envelope membrane protein |
| |
| Subunit of acetyl-CoAcarboxylase |
| |
| C-type cytochrome synthesis gene |
| |
| Protease |
| |
| Conserved hypothetical chloroplast open reading frames |
1 – Gene with introns.
2 – Gene completely duplicated in the inverted repeat.
Codon usage for S. adstringens chloroplast genome.
| Amino Acid | Codon | Number | Fraction | Amino Acid | Codon | Number | Fraction |
|---|---|---|---|---|---|---|---|
| Ala | GCG | 138 | 0.12 | Leu | CTA | 302 | 0.14 |
| GCA | 321 | 0.27 | CTT | 462 | 0.21 | ||
| GCT | 571 | 0.48 | CTC | 135 | 0.06 | ||
| GCC | 168 | 0.14 | Lys | AAG | 245 | 0.24 | |
| Arg | AGG | 134 | 0.11 | AAA | 759 | 0.76 | |
| AGA | 365 | 0.30 | Met | ATG | 504 | 1 | |
| CGG | 92 | 0.07 | Phe | TTT | 803 | 0.67 | |
| CGA | 282 | 0.23 | TTC | 397 | 0.33 | ||
| CGT | 275 | 0.22 | Pro | CCG | 112 | 0.13 | |
| CGC | 87 | 0.07 | CCA | 249 | 0.29 | ||
| Asn | AAT | 737 | 0.78 | CCT | 341 | 0.39 | |
| AAC | 208 | 0.22 | CCC | 170 | 0.19 | ||
| Asp | GAT | 655 | 0.81 | Ser | AGT | 324 | 0.20 |
| GAC | 149 | 0.19 | AGC | 97 | 0.06 | ||
| Cys | TGT | 186 | 0.74 | TCG | 145 | 0.09 | |
| TGC | 67 | 0.26 | TCA | 312 | 0.19 | ||
| Gln | CAG | 164 | 0.23 | TCT | 458 | 0.29 | |
| CAA | 559 | 0.77 | TCC | 267 | 0.17 | ||
| Glu | GAG | 264 | 0.25 | Thr | ACG | 113 | 0.11 |
| GAA | 784 | 0.75 | ACA | 322 | 0.30 | ||
| Gly | GGG | 238 | 0.16 | ACT | 442 | 0.42 | |
| GGA | 593 | 0.39 | ACC | 188 | 0.18 | ||
| GGT | 531 | 0.35 | Trp | TGG | 370 | 1 | |
| GGC | 152 | 0.10 | Tyr | TAT | 622 | 0.80 | |
| His | CAT | 395 | 0.78 | TAC | 153 | 0.20 | |
| CAC | 112 | 0.22 | Val | GTG | 168 | 0.14 | |
| Ile | ATA | 564 | 0.31 | GTA | 460 | 0.38 | |
| ATT | 933 | 0.51 | GTT | 434 | 0.36 | ||
| ATC | 341 | 0.19 | GTC | 134 | 0.11 | ||
| Leu | TTG | 466 | 0.21 | End | TGA | 29 | 0.28 |
| TTA | 727 | 0.33 | TAG | 21 | 0.21 | ||
| CTG | 138 | 0.06 | TAA | 52 | 0.51 |
Figure 2Number and type of simple sequence repeats in S. adstringens chloroplast genome.
Figure 3Sliding window analysis of Mimosoid chloroplast genomes.
Figure 4The Ka/Ks ratio of Mimosoid chloroplast genomes for individual genes.
Figure 5Visualization of genome alignment of six Mimosoid chloroplast genomes using S. adstringens as reference. The vertical scale indicates the percent of identity, ranging from 50% to 100%. Coding regions are marked in purple, and non-coding as red. The horizontal axis indicates the coordinates within the chloroplast genome.
Figure 6Comparison of the junction sites between the Long Single Copy (LSC, light blue), Short Single Copy (SSC, light green) and Inverted Repeat (IRa and IRb, orange) regions among the six Mimosoid chloroplast genomes. JLB (IRb/LSC), JSB (IRb/SSC) JSA (SSC/IRa) and JLA (IRa/LSC) denote the junction sites between each corresponding regions on the genome.
Figure 7Maximum credibility tree reconstructed based on 73 conserved coding-protein genes from twenty-one species. All nodes of the phylogenetic tree are supported by 1.00 Bayesian inference posterior probability.