| Literature DB >> 35901127 |
Ana Flávia Francisconi1, Luiz Augusto Cauz-Santos2, Jonathan Andre Morales Marroquín1, Cássio van den Berg3,4, Alessandro Alves-Pereira5, Luciano Delmondes de Alencar1, Doriane Picanço-Rodrigues6, Cesar Augusto Zanello1, Marcones Ferreira Costa1,7, Maria Teresa Gomes Lopes8, Elizabeth Ann Veasey4, Maria Imaculada Zucchi1,9.
Abstract
The Brazilian palm fruits and hearts-of-palm of Euterpe edulis, E. oleracea and E. precatoria are an important source for agro-industrial production, due to overexploitation, conservation strategies are required to maintain genetic diversity. Chloroplast genomes have conserved sequences, which are useful to explore evolutionary questions. Besides the plastid DNA, genome skimming allows the identification of other genomic resources, such as single nucleotide polymorphisms (SNPs), providing information about the genetic diversity of species. We sequenced the chloroplast genome and identified gene content in the three Euterpe species. We performed comparative analyses, described the polymorphisms among the chloroplast genome sequences (repeats, indels and SNPs) and performed a phylogenomic inference based on 55 palm species chloroplast genomes. Finally, using the remaining data from genome skimming, the nuclear and mitochondrial reads, we identified SNPs and estimated the genetic diversity among these Euterpe species. The Euterpe chloroplast genomes varied from 159,232 to 159,275 bp and presented a conserved quadripartite structure with high synteny with other palms. In a pairwise comparison, we found a greater number of insertions/deletions (indels = 93 and 103) and SNPs (284 and 254) between E. edulis/E. oleracea and E. edulis/E. precatoria when compared to E. oleracea/E. precatoria (58 indels and 114 SNPs). Also, the phylogeny indicated a closer relationship between E. oleracea/E. precatoria. The nuclear and mitochondrial genome analyses identified 1,077 SNPs and high divergence among species (FST = 0.77), especially between E. edulis and E. precatoria (FST = 0.86). These results showed that, despite the few structural differences among the chloroplast genomes of these Euterpe palms, a differentiation between E. edulis and the other Euterpe species can be identified by point mutations. This study not only brings new knowledge about the evolution of Euterpe chloroplast genomes, but also these new resources open the way for future phylogenomic inferences and comparative analyses within Arecaceae.Entities:
Mesh:
Year: 2022 PMID: 35901127 PMCID: PMC9333295 DOI: 10.1371/journal.pone.0266304
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Gene map of the Euterpe edulis, E. oleracea and E. precatoria chloroplast genomes.
Genes represented inside the large circle are oriented clockwise and the ones outside are oriented counter clockwise. The different colors represent functional groups, and the darker gray in the inner circle indicates the GC content. The quadripartite structure is also reported as: LSC = Large Single Copy, SSC = Small Single Copy, IRA/IRB = Inverted Repeats A and B.
General features of chloroplast genomes of three Euterpe species.
| Species | Total cpDNA size (bp) | Length of LSC region (bp) | Length of IR region (bp) | Length of SSC region (bp) | GC content (%) | Protein coding genes (bp) | tRNA coding genes (bp) | rRNA coding genes (bp) | Introns (bp) | Intergenic Regions (bp) |
|---|---|---|---|---|---|---|---|---|---|---|
|
| 159,232 | 87,237 | 54,280 | 17,715 | 37.20 | 80,462 | 2,880 | 9,052 | 18,068 | 48,770 |
|
| 159,260 | 87,273 | 54,232 | 17,755 | 37.30 | 80,415 | 2,881 | 9,052 | 18,062 | 48,850 |
|
| 159,275 | 87,282 | 54,234 | 17,759 | 37.20 | 80,411 | 2,881 | 9,052 | 18,064 | 48,867 |
Gene content in Euterpe edulis, E. oleracea and E. precatoria chloroplast genomes according to each respective category.
| Category | Gene | |||||||
|---|---|---|---|---|---|---|---|---|
| Subunits of photosystem I | ||||||||
| Subunits of photosystem II | ||||||||
| Subunits of cytochrome b/f complex | ||||||||
| Subunits of ATP synthase | ||||||||
| Large subunit of rubisco |
| |||||||
| Subunits of NADH-dehydrogenase | ||||||||
| Proteins of large ribosomal subunit | ||||||||
| Proteins of small ribosomal subunit | ||||||||
| Subunits of RNA polymerase | ||||||||
| Maturase |
| |||||||
| Translational initiation factor |
| |||||||
| Protease |
| |||||||
| Envelope membrane protein |
| |||||||
| Subunit of acetyl-CoA carboxylase |
| |||||||
| Cytochrome c biogenesis |
| |||||||
| Conserved hypothetical genes | ||||||||
| Component of TIC complex |
| |||||||
| Component of 2-MD heteromeric AAAATPase complex |
| |||||||
| Ribosomal RNAs | ||||||||
| Transfer RNAs | ||||||||
aIntron-containing gene
bTwo gene copies in the Irs
cGene divided into two independent transcription units.
Fig 2Synteny and divergence in the SSC size detected in Arecaceae chloroplast genomes using the Mauve multiple-genome alignment program.
A sample of nine different chloroplast genomes is shown. Color bars indicate syntenic blocks and the lines indicate the correspondence between them. Blocks on the top row are in the same orientation, while blocks on the bottom row are in inverse orientation.
Fig 3Comparison of the IRA and IRB borders among Brazilian palms from the Arecoideae species.
The numbers indicate the lengths of IGSs, genes, and spacers between IR-LSC and IR-SSC junctions. The ycf1* and rps19* genes have incomplete CDSs.
Fig 4Distribution and classification of SSR and dispersed repeats in the chloroplast genomes of Euterpe edulis, E. oleracea and E. precatoria.
(A) Proportion of coding and non-coding regions containing SSRs; (B) Proportion of different types of SSR present in the chloroplast genomes; (C) Proportion of regions containing repeats; (D) Frequency distribution of different types of repeats: F = Forward, P = Palindrome, R = Reverse and C = Complement. CDS = Coding sequence, IGS = Intergenic spacer.
Fig 5Indels and single nucleotide polymorphisms (SNPs) detected in comparisons between Euterpe chloroplast genomes.
(A) Proportion of indels in different coding and non-coding regions; (B) Comparison of the number of SNPs found in the alignment (C) Proportion of SNPs in different coding and non-coding regions of the chloroplast genomes.
Fig 6Sliding window analysis of the alignment from the chloroplast genome of Euterpe edulis, E. oleracea and E. precatoria chloroplast genomes.
The regions with high nucleotide variability (Pi > 0.02) are indicated. Pi is the nucleotide diversity of each window, and the window length was 200 bp with 50 bp step sizes.
Fig 7Majority-rule consensus tree of 30,000 trees obtained from a Bayesian inference analysis of chloroplast protein coding genes of 55 taxa.
Posterior probabilities (PP) for each are indicated above branches. Co = Coryphoideae, Ar = Arecoideae, Ny = Nypoideae, Ce = Ceroxyloideae, Ca = Calamoideae.
Parameters of genetic diversity using 1,077 SNPs found the nuclear and mitochondrial genome sequences from three Euterpe species.
|
| 1,251 | 1,309 | 1,263 |
|
| 292 | 260 | 421 |
|
| 0.162 (±0.011) | 0.215 (± 0.013) | 0.173 (±0.012) |
|
| 0.081 (±0.006) | 0.108 (±0.006) | 0.086 (±0.006) |
A = number of alleles, Ap = number of private alleles, H = observed heterozygosity, H = expected heterozygosity.