| Literature DB >> 34428237 |
Raquel Santos da Silva1, Charles Roland Clement2, Eduardo Balsanelli3,4, Valter Antonio de Baura3, Emanuel Maltempi de Souza3, Hugo Pacheco de Freitas Fraga1, Leila do Nascimento Vieira1.
Abstract
The family Arecaceae is distributed throughout tropical and subtropical regions of the world. Among the five subfamilies, Arecoideae is the most species-rich and still contains some ambiguous inter-generic relationships, such as those within subtribes Attaleinae and Bactridineae. The hypervariable regions of plastid genomes (plastomes) are interesting tools to clarify unresolved phylogenetic relationships. We sequenced and characterized the plastome of Bactris gasipaes (Bactridinae) and compared it with eight species from the three Cocoseae sub-tribes (Attaleinae, Bactridinae, and Elaeidinae) to perform comparative analysis and to identify hypervariable regions. The Bactris gasipaes plastome has 156,646 bp, with 113 unique genes. Among them, four genes have an alternative start codon (cemA, rps19, rpl2, and ndhD). Plastomes are highly conserved within tribe Cocoseae: 97.3% identity, length variation of ~2 kb, and a single ~4.5 kb inversion in Astrocaryum plastomes. The LSC/IR and IR/SSC junctions vary among the subtribes: in Bactridinae and Elaeidinae the rps19 gene is completely contained in the IR region; in the subtribe Attaleinae the rps19 gene is only partially contained in the IRs. The hypervariable regions selected according to sequence variation (SV%) and frequency of parsimony informative sites (PIS%) revealed plastome regions with great potential for molecular analysis. The ten regions with greatest SV% showed higher variation than the plastid molecular markers commonly used for phylogenetic analysis in palms. The phylogenetic trees based on the plastomes and the hypervariable regions (SV%) datasets had well-resolved relationships, with consistent topologies within tribe Cocoseae, and confirm the monophyly of the subtribes Bactridinae and Attaleinae.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34428237 PMCID: PMC8384209 DOI: 10.1371/journal.pone.0256373
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Substitution models selected for the phylogenetic inferences using Maximum Likelihood (ML).
| Region | Models for ML |
|---|---|
| Plastome | K3Pu+F+R2 |
|
| HKY+F |
|
| JC+I |
|
| F81+F+G4 |
|
| F81+F+I |
|
| TPM2+F+I |
|
| F81+F+I |
|
| F81+F+I |
|
| HKY+F+I |
|
| F81+F+I |
|
| F81+F+I |
|
| F81+F+I |
|
| F81+F+I |
|
| F81+F+I |
|
| F81+F+I |
List of genes of Bactris gasipaes plastome organized according to their location.
| Plastome region | Name of genes |
|---|---|
| Large Single Copy (LSC) | |
| Inverted Repeat (IR) | |
| IR / SSC junction | |
| Small Single Copy (SSC) |
* Intron-containing genes.
Plastomes of tribe Cocoseae.
| Subtribe | Species | Plastome (bp) | LSC (bp) | IR (bp) | SSC (bp) |
|---|---|---|---|---|---|
| Elaeidinae |
| 156,973 | 85,192 | 27,071 | 17,639 |
| Bactridinae |
| 156,804 | 85,037 | 27,081 | 17,605 |
|
| 156,801 | 85,017 | 27,081 | 17,622 | |
|
| 156,646 | 85,118 | 27,038 | 17,452 | |
|
| 156,500 | 84,936 | 27,092 | 17,380 | |
| Attaleinae |
| 155,053 | 84,535 | 26,522 | 17,474 |
|
| 154,731 | 84,230 | 26,555 | 17,391 | |
|
| 154,048 | 83,805 | 26,437 | 17,369 |
LSC: large single copy region; IR: inverted repeat; SSC: small single copy region.
Fig 1Plastome rearrangement analysis within tribe Cocoseae.
Locally collinear blocks (LCBs) are identified by colors. The 4.5 kb inversion in Astrocaryum is in green.
Fig 2Comparison of plastome junctions (IRb/LSC; IRb/SSC, SSC/IRa; IRa/LSC) among Cocoseae species.
The numbers indicate sequence length in base pairs.
Fig 3Hypervariable plastome regions compared with commonly used plastid and nuclear markers.
A—The ten plastome regions with greatest sequence variation (SV%); B—The ten plastome regions with greatest frequency of parsimony informative sites (PIS%).
Frequency of substitutions/mutations and insertions/deletions (InDels) among the plastome sequence.
| Plastome | CDS | IGS | Introns | |
|---|---|---|---|---|
| Mutations (%) | 1.53 | 0.80 | 2.30 | 1.12 |
| InDels (%) | 0.30 | 0.01 | 0.58 | 0.34 |
| Mutation / InDel ratio | 5.02 | 79.90 | 3.99 | 3.26 |
CDS–coding sequence; IGS–intergenic spacer.
Fig 4Phylogenetic trees based on maximum likelihood inference.
A–Phylogenetic inference using plastome sequences (one IR removed); B—Phylogenetic inference using the ten plastome regions with greatest sequence variability (SV%) values. The numbers above the branches are maximum likelihood bootstrap values (1000 replicates).