| Literature DB >> 32614355 |
Rhewter Nunes1, Ueric José Borges de Souza1, Cintia Pelegrineti Targueta1, Rafael Barbosa Pinto1, Thannya Nascimento Soares1, José Alexandre Felizola Diniz-Filho2, Mariana Pires de Campos Telles1,3.
Abstract
Caryocar brasiliense (Caryocaraceae) is a Neotropical tree species widely distributed in Brazilian Savannas. This species is very popular in central Brazil mainly by the use of its fruits in the local cuisine, and indeed it is one of the candidates, among Brazilian native plants, for fast track incorporation into cropping systems. Here we sequenced the complete chloroplast genome of C. brasiliense and used the data to access its genomic resources using high-throughput sequencing. The chloroplast exhibits a genome length of 165,793 bp and the typical angiosperm quadripartite structure with two copies of an inverted repeat sequence (IRa and IRb) of 34,902 bp each, separating a small single copy (SSC) region of 11,852 bp and a large single copy (LSC) region of 84,137 bp. The annotation analysis identified 136 genes being 87 protein-coding, eight rRNA and 37 tRNA genes. We identified 49 repetitive DNA elements and 85 microsatellites. A bayesian phylogenetic analysis helped to understand previously unresolved relationships in Malpighiales, placing Caryocaraceae as a separated group in the order, with high supported nodes. This study synthetizes valuable information for further studies allowing a better understanding of evolutionary patterns in the group and providing resources for future breeding programs.Entities:
Year: 2020 PMID: 32614355 PMCID: PMC7263422 DOI: 10.1590/1678-4685-GMB-2019-0161
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Figure 1Chloroplast genome map of Caryocar brasiliense. The genes drawn outside and inside of the circle are transcribed in clockwise and counterclockwise directions, respectively. Genes were colored based on their functional groups. The inner circle shows the quadripartite structure of the chloroplast: small single copy (SSC), large single copy (LSC) and a pair of inverted repeats (IRa and IRb). The gray ring marks the GC content with the inner circle marking a 50% threshold. Genes that have introns were marked with “*” and pseudogenes were marked with “#”.
Comparative chloroplast genome structural features in 10 species from Malpighiales order. LSC: Large Single Copy region; SSC: Small Single Copy region; IR: Inverted Repeats regions and bp: base pair.
| Species | Family | Genome size (bp) | LSC (bp) | SSC (bp) | IR (bp) | GC(%) |
|---|---|---|---|---|---|---|
|
| Caryocaraceae | 165,793 | 84,137 | 11,852 | 34,902 | 36.7 |
|
| Clusiaceae | 158,179 | 86,458 | 17,703 | 27,009 | 36.1 |
|
| Chrysobalanaceae | 163,937 | 89,188 | 19,817 | 26,885 | 36.2 |
|
| Erythroxylaceae | 163,937 | 91,384 | 18,137 | 27,208 | 35.9 |
|
| Euphorbiaceae | 161,453 | 89,295 | 18,250 | 26,954 | 35.9 |
|
| Linaceae | 156,721 | 81,767 | 10,974 | 31,990 | 37.5 |
|
| Malpighiaceae | 160,329 | 88,524 | 17,833 | 26,986 | 36.8 |
|
| Passifloraceae | 151,406 | 85,720 | 13,378 | 26,154 | 37.0 |
|
| Salicaceae | 156,067 | 84,367 | 16,670 | 27,509 | 36.8 |
|
| Violaceae | 156,507 | 85,691 | 18,008 | 26,404 | 36.3 |
Figure 2Comparison of the junctions involving Inverted Repeat (IRa and IRb) regions with Large Single Copy (LSC) and Small Single Copy (SSC) regions among 10 chloroplast genomes of Malpighiales. The IR regions are represented in yellow whereas LSC and SSC in blue and orange, respectively. The white boxes represent the genes present in each region. The arrows represent the distance (in base pairs) of genes from the junction site between regions.
Figure 3Phylogenetic tree reconstruction based on 52 taxa using Bayesian inference based on 76 protein-coding chloroplast genes. Numbers represent the Bayesian posterior probability given to each node. The bars on the right represents the botanic families of species.