| Literature DB >> 30856242 |
Vanessa Santos1, Cícero Almeida1.
Abstract
This study reports the complete chloroplast sequences of three Spondias species. The genome sequences were obtained for Spondias tuberosa, Spondias bahienses, and Spondias mombin using the Illumina sequencing technology by a combination of de novo methods and a reference-guided assembly using Sapindus mukorossi as reference. The genomes of S. tuberosa, S. bahiensis, and S. mombin had 162,036, 162,218, and 162,302 bp, respectively. The coding regions exhibited 130 genes, including 34-35 tRNAs and 4 rRNAs. The results revealed synteny among the genomes, with high conservation in the gene order and content and CG content. The inverted repeat regions (IRA and IRB) and the large and small single copies were very similar among the three genomes. The phylogenomic analysis reported similar topologies as that of previous studies, which used partial chloroplast, wherein S. mombin was the first diverging lineage, while S. tuberosa and S. bahiensis were derived, indicating that the phylogenetic analysis using partial or complete genome produces similar results. In summary, (1) we presented the first complete chloroplast genome for the genus Spondias, (2) phylogenies analyzed using the complete chloroplast genomes revealed a robust phylogenetic topology for Spondias, and (3) gene order, content, and orientation in Spondias are highly conserved.Entities:
Year: 2019 PMID: 30856242 PMCID: PMC6428118 DOI: 10.1590/1678-4685-GMB-2017-0265
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Chloroplast genomes sequenced in this study, and others utilized as reference or out-group in phylogenetic analysis.
| Taxon | GenBank | References |
|---|---|---|
|
| KU756562 | This study |
|
| KU756561 | This study |
|
| KY828469 | This study |
|
| KY549635 | unpublished |
|
| KM454982 | Yang |
|
| KY635882 | Rabah |
|
| KY635877 | Rabah |
|
| KX447140 | Lee |
|
| KU756561 | Khan |
Summary of the chloroplast genome characteristics within the Anacardiaceae family. Genome size (bp), GC content (%), large single copy region - LSC (bp), small single copy region - SSC (bp), inverted repeat - IR (bp), N. of protein-coding genes, N. of tRNAs, and N. of rRNAs.
| Species | Characteristics | |||||||
|---|---|---|---|---|---|---|---|---|
| Size (bp) | GC (%) | LSC | SSC | IR | Genes | tRNAs | rRNAs | |
|
| 162,039 | 37.7 | 89,453 | 18,369 | 27,139 | 130 | 35 | 4 |
|
| 162,218 | 37.7 | 89,606 | 18,381 | 27,156 | 130 | 34 | 4 |
|
| 162,302 | 37.6 | 89,938 | 18,094 | 27,135 | 130 | 35 | 4 |
|
| 160,674 | 37.9 | 88,236 | 19,086 | 26,676 | 126 | 37 | 4 |
List of genes present in the Spondias tuberosa chloroplast genome, obtained by genome annotation using Sapindus mukorossi as reference.
| Group of genes | Name of genes | |
|---|---|---|
|
| Transfer RNAs | trnA-UGC (2x), trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnfM-CAU, trnG-UCC, trnH-GUG, trnI-CAU (2x), trnI-GAU (3x) trnK-UUU, trnL-CAA(2x), trnL-UAA, trnL-UAG, trnM-CAU, trnN-GUU (2x), trnP-UGG, trnQ-UUG, trnR-ACG (2x), trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU (2x), trnT-UGU, trnV-GAC (2x), trnV-UAC, trnW-CCA, trnY-GUA |
| Ribossomal RNAs (16S, 23S, 4.5S, 5S) | rrn16 (2x), rrn23 (2x), rrn4.5 (2x), rrn5 (2x) | |
| Ribossomal Protein small subunit | rps2, rps3, rps4, rps7 (2x), rps8, rps11, rps12, rps14, rps15, rps16, rps18, rps19 (2x) | |
| Ribossomal Protein large subunit | rpl2 (2x), rpl14, rpl16, rpl20, rpl22, rpl23 (2x), rpl32, rpl33, rpl36 | |
| Subunits (α, β, β‘, β“) of the DNA-dependent RNA polymerase | rpoA, rpoB, rpoC1, rpoC2 | |
|
| Photosystem I | psaA, psaB, psaC, psaI, psaJ |
| Photosystem II | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | |
| Cythochrome b/f complex | petA, petB, petD, petG, petL, petN | |
| ATP synthase | atpA, atpB, atpE, atpF, atpH, atpI | |
| NADH-dehydrogenase | ndhA, ndhB (2×), ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK | |
| Large subunit RUBISCO | rbcL | |
|
| Acetyl-CoA carboxylase | accD |
| Cythochrome c biogenesis | ccsA | |
| Maturase | matK | |
| ATP-dependent protease | clpP | |
| Inner membrane protein | cemA | |
|
| Conserved hypothetical chloroplast ORFs | ycf1 (2×), ycf2 (2×), ycf3, ycf4, ycf15 (2x) |
Figure 1Complete gene map of Spondias chloroplast genomes. Gene annotations are represented in green. The chloroplast genomes are represented in purple (S. tuberosa), red (S. bahiensis), and blue (S. mombin). LSC: large single copy region; SSC: small single copy region; IR: inverted repeat. The green ring represents the A+T contents and the blue ring indicates C+G contents. The numbers near to S. tuberosa (purple circle) represent the nucleotide positions (in kbp).
Figure 2Comparative analysis of microsatellites in the chloroplast genomes of Spondias. (A) Microsatellite type distribution in three Spondias species. (B) Venn diagram showing the number of SSR that are shared among S. bahiensis, S. tuberosa, and S. mombin.
Figure 3Molecular phylogenetic analysis by maximum likelihood method, with supported values estimated by bootstrap.