| Literature DB >> 29396532 |
Alison P A Menezes1, Luciana C Resende-Moreira1, Renata S O Buzatti1, Alison G Nazareno2, Monica Carlsen3,4, Francisco P Lobo1, Evanguedes Kalapothakis1, Maria Bernadete Lovato5.
Abstract
Byrsonima is the third largest genus (about 200 species) in the Malpighiaceae family, and one of the most common in Brazilian savannas. However, there is no molecular phylogeny available for the genus and taxonomic uncertainties at the generic and family level still remain. Herein, we sequenced the complete chloroplast genome of B. coccolobifolia and B. crassifolia, the first ones described for Malpighiaceae, and performed comparative analyses with sequences previously published for other families in the order Malpighiales. The chloroplast genomes assembled had a similar structure, gene content and organization, even when compared with species from other families. Chloroplast genomes ranged between 160,212 bp in B. crassifolia and 160,329 bp in B. coccolobifolia, both containing 115 genes (four ribosomal RNA genes, 28 tRNA genes and 83 protein-coding genes). We also identified sequences with high divergence that might be informative for phylogenetic inferences in the Malpighiales order, Malpighiaceae family and within the genus Byrsonima. The phylogenetic reconstruction of Malpighiales with these regions highlighted their utility for phylogenetic studies. The comparative analyses among species in Malpighiales provided insights into the chloroplast genome evolution in this order, including the presence/absence of three genes (infA, rpl32 and rps16) and two pseudogenes (ycf1 and rps19).Entities:
Mesh:
Year: 2018 PMID: 29396532 PMCID: PMC5797077 DOI: 10.1038/s41598-018-20189-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
General information and comparison of chloroplast genomes of Byrsonima coccolobifolia and B. crassifolia.
| Characteristics | |||
|---|---|---|---|
| Size (base pair; bp) | 160329 | 160212 | |
| LSC length (bp) | 88524 | 88448 | |
| SSC length (bp) | 17833 | 17814 | |
| IR length (bp) | 26986 | 26975 | |
| Number of genes | 139 | 139 | |
| Protein-coding genes | 94 | 94 | |
| tRNA genes | 37 | 37 | |
| rRNA genes | 8 | 8 | |
| Genes with intron(s) | 18 | 18 | |
| GC content | Total (%) | 36.76 | 36.77 |
| LSC (%) | 34.53 | 34.52 | |
| SSC (%) | 30.66 | 30.76 | |
| IR (%) | 42.4 | 42.4 | |
| CDS (%) | 37.74 | 37.72 | |
| rRNA (%) | 55.42 | 55.42 | |
| tRNA (%) | 53.11 | 53.01 | |
| Coding protein genes (%bp) | 50.2 | 50.2 | |
| Noncoding regions (%bp) | 49.8 | 49.8 | |
Figure 1Chloroplast genome circular map of Byrsonima coccolobifolia Kunth and B. crassifolia (L.) Kunth (Malpighiaceae) with annotated genes. Genes inside the circle are transcribed clockwise, genes outside are transcribed counter-clockwise. Genes are color coded according to functional groups. Boundaries of the small (SSC) and large (LSC) single copy regions and inverted repeat (IRa and IRb) regions are noted in the inner circle for each species. Picture of B. crassiflora was taken by Dr. Daniel L. Nickrent (source: http://www.phytoimages.siu.edu). Picture of B. coccolobifolia was provided by Maurício Mercadante.
Chloroplast genome gene content and functional classification in Byrsonima coccolobifolia Kunth and B. crassifolia (L.) Kunth.
| Gene group | Gene name | |||
|---|---|---|---|---|
| Ribosomal RNA genes |
|
|
| |
| Transfer RNA genes |
|
|
|
|
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
| |||
| Small subunit of ribosome |
|
|
|
|
|
|
|
|
| |
|
|
|
|
| |
|
| ||||
| Large subunit of ribosome |
|
|
|
|
|
|
|
|
| |
|
| ||||
| RNA polymerase subunits |
|
|
|
|
| NADH dehydrogenase |
|
|
|
|
|
|
|
|
| |
|
|
|
| ||
| Photosystem I |
|
|
|
|
|
|
| |||
| Photosystem II |
|
|
|
|
|
|
|
|
| |
|
|
|
|
| |
|
|
|
| ||
| Cytochrome b/f complex |
|
|
|
|
|
|
| |||
| ATP synthase |
|
|
|
|
|
|
| |||
| Large subunit of rubisco |
| |||
| Maturase |
| |||
| Protease |
| |||
| Envelope membrane protein |
| |||
| Subunit of acetyl-CoA-carboxylase |
| |||
| c-type cytochrome synthesis |
| |||
| Component of TIC complex |
|
| ||
| Hypothetical chloroplast reading frames |
| |||
| ORFs |
|
|
|
|
|
| ||||
*Genes containing introns; ΨPseudogene; genes in bold are located within the IR and therefore are duplicated.
Figure 2Comparisons of percentage identity of chloroplast genomes for six species belonging to five different families within the order Malpighiales. Bc: Byrsonima coccolobifolia; Br: Byrsonima crassifolia (Malpighiaceae); Ci: Chrysobalanus icaco (Chrysobalanaceae); Vs: Viola seoulensis (Violaceae); Pa: Populus alba (Salicaceae), Rc: Ricinus communis (Euphorbiaceae). The percentage of identity is shown in the vertical axis, ranging from 50% to 100%, while the horizontal axis shows the position within the chloroplast genome. Each arrow displays the annotated genes and direction of their transcription in the reference genome (Byrsonima coccolobifolia). Genome regions are color coded as exon, untranslated region (UTR), conserved noncoding sequences (CNS) and mRNA.
Figure 3Maximum likelihood trees for the order Malpighiales inferred from complete chloroplast genomes of nine species of the order (using all putative 1–1 orthologs - right) and from five highly variable coding sequences identified in this study (accD, matK, rpoA, ycf2 and rps7 - left). Bootstrap values are indicated above branches.
Figure 4Details of boundary positions between inverted repeat regions (IR) and large and small single copy regions (LSC and SSC) among nine chloroplast genomes within the order Malpighiales. Bc: Byrsonima coccolobifolia; Br: B. crassifolia (Malpighiaceae); Ci: Chrysobalanus icaco; Hr: Hirtela racemosa (Chrysobalanaceae); Vs: Viola seoulensis (Violaceae); Sp: Salix purpurea, Pa: Populus alba (Salicaceae), Rc: Ricinus communis, Me: Manihot esculenta (Euphorbiaceae). Both Byrsonima species sequences are represented together at the top of the figure given that there are no differences between their boundaries. The direction of arrows shows the direction of transcription (right is forward and left is reverse). Ψ indicates a pseudogene. Length of arrows is illustrative. Number of base pairs (bp) indicates distance from the boundary to the end of the gene. Complete chloroplast genome sizes are noted on the right-hand side of the panel.
Distribution of repeated sequences in the chloroplast genome of Byrsonima coccolobifolia and B. crassifolia.
| Type | Location | Region | Repeated sequence | Size (bp) |
|---|---|---|---|---|
| F | ycf2 | IRa | ATATCGTCACTATCATCAATATCGTCACTATCATCAATATCGTCACTATCATCAATA | 57 |
| P | ycf2 | IRa/IRb | TATTGATGATAGTGACGATATTGATGATAGTGACGATATTGATGATAGTGACGATAT | 57 |
| P | ycf2 | IRa/IRb | TATTGATGATAGTGACGATATTGATGATAGTGACGATATTGATGATAGTGACGATAT | 57 |
| F | ycf2 | IRb | ATATCGTCACTATCATCAATATCGTCACTATCATCAATATCGTCACTATCATCAATA | 57 |
| P | trnQ-rps16 | LSC | AGAGATCTAATCCCATTGATTGAATTCAATCAATGGGATTAGATCTCT | 48 |
| F | trnS-trnQ* | LSC | TATACTATTAGATACTACTATATACTATTAGTATACTATTAGATACTA | 48 |
| P | petN-trnT* | LSC | AGATAGTATGGTAGAAAGAAATATATATATTTCTTTCTACCATACTAT | 48 |
| P | petA-petL | LSC | CTTTTCGATTTTATACGTATAAATTTATACGTATAAAATCGAAAAG | 46 |
| F | ycf2 | IRa | ATATCGTCACTATCATCAATATCGTCACTATCATCAATA | 39 |
| P | ycf2 | IRa/IRb | TATTGATGATAGTGACGATATTGATGATAGTGACGATAT | 39 |
| P | ycf2 | IRa/IRb | TATTGATGATAGTGACGATATTGATGATAGTGACGATAT | 39 |
| F | ycf2 | IRb | ATATCGTCACTATCATCAATATCGTCACTATCATCAATA | 39 |
| R | rbcL-accD | LSC | AGAATTAAGAGAATTAAAATTAAGAGAATTAAGA | 34 |
| F | psaB and psaA | LSC | ACCGATATTGCACACCATCATTTAGCTATTGCA | 33 |
| P | petN-psbM | LSC | TTTAATTTAAATTGAATTCAATTTAAATTAAA | 32 |
| P | trnR-trnS and ycf2 | LSC/IRa | ATATATGTTTGGAATAGATTCCATTTTGAGA | 31 |
| F | trnR-trnS and ycf2 | LSC/IRa | TCTCAAAATGGAATCTATTCCAAACATATAT | 31 |
| F | psbK-psbI* | LSC | ATACTATTAGATACTACTATATACTATTAG | 30 |
| F | psbK-psbI* | LSC | ATACTATTAGATACTACTATATACTATTAG | 30 |
*Repeats that appear only in B. crassifolia. Types of repeats are F (forward), P (palindrome) and R (reverse).
Twenty most divergent regions of chloroplast genome based on a comparison between Byrsonima coccolobifolia Kunth and B. crassifolia (L.) Kunth.
| Region | Nucleotide diversity (π) | Total number of mutations (η) | Region length (bp) |
|---|---|---|---|
|
| 0.065574 | 4 | 61 |
|
| 0.040000 | 10 | 250 |
|
| 0.029851 | 2 | 80 |
|
| 0.015385 | 1 | 65 |
|
| 0.014337 | 4 | 279 |
|
| 0.011765 | 1 | 85 |
|
| 0.011765 | 2 | 172 |
|
| 0.011475 | 7 | 625 |
|
| 0.011050 | 6 | 712 |
|
| 0.009639 | 4 | 417 |
|
| 0.008869 | 4 | 453 |
|
| 0.007874 | 3 | 381 |
|
| 0.007813 | 1 | 128 |
|
| 0.007246 | 1 | 139 |
|
| 0.006289 | 3 | 518 |
|
| 0.005859 | 3 | 555 |
|
| 0.005682 | 1 | 176 |
|
| 0.005587 | 4 | 716 |
|
| 0.005556 | 1 | 184 |
|
| 0.005178 | 8 | 1,575 |