| Literature DB >> 31481967 |
Ana M Bedoya1, Bradley R Ruhfel2, C Thomas Philbrick3, Santiago Madriñán4, Claudia P Bove5, Attila Mesterházy6, Richard G Olmstead1.
Abstract
With the advent of next-generation sequencing technologies, whole-plastome data can be obtained as a byproduct of low-coverage sequencing of the plant genomic DNA. This provides an opportunity to study plastid evolution across groups, as well as testing phylogenetic relationships among taxa. Within the order Malpighiales (∼16,000 spp.), the Podostemaceae (∼300 spp.) stand out for their unique habit, living attached to rocks in fast-flowing aquatic habitats, and displaying highly modified morphologies that confound our understanding of their classification, biology, and evolution. In this study, we used genome skimming data to assemble the full plastid genome of 5 species within Podostemaceae. We analyzed our data in a comparative framework within Malpighiales to determine the structure, gene content, and rearrangements in the plastomes of the family. The Podostemaceae have one of the smallest plastid genomes reported so far for the Malpighiales, possibly due to variation in length of inverted repeat (IR) regions, gene loss, and intergenic region variation. We also detected a major inversion in the large single-copy region unique to the family. The uncommon loss or pseudogenization of ycf1 and ycf2 in angiosperms and in land plants in general is also found to be characteristic of Podostemaceae, but the compensatory mechanisms and implications of this and of the pseudogenization of accD, rpl22, and clpP and loss of rps16 remain to be explained in this group. In addition, we estimated a phylogenetic tree among selected species in Malpighiales. Our findings indicate that the Podostemaceae are a distinct lineage with long branches that suggest faster rates of evolution in the plastome of the group, compared with other taxa in the order. This study lays the foundations for future phylogenomic studies in the family.Entities:
Keywords: Malpighiales; Podostemaceae; genome rearrangements; phylogenomics; plastome
Year: 2019 PMID: 31481967 PMCID: PMC6710714 DOI: 10.3389/fpls.2019.01035
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Provenance, voucher information, and/or GenBank accession numbers of the species in Malpighiales whose plastomes were included in this study.
| Species | Family | Voucher (Herbarium) | GenBank accession no. | Collection locality |
|---|---|---|---|---|
|
| Chrysobalanaceae | — | NC_024060 | |
|
| Clusiaceae | NC_036341 | ||
|
| Euphorbiaceae | — | NC_037192 | |
|
| Passifloraceae | — | NC_034285 | |
|
| Podostemaceae | C.P. Bove 2513 (R) | MN165812 | Brazil, South America |
|
| Podostemaceae | AMB 497 (ANDES) | MN165814 | Colombia, South America |
|
| Podostemaceae | C.P. Bove 2493 (R) | MN165813 | Brazil, South America |
|
| Podostemaceae | C.P. Bove 2524 (R) | MN165815 | Brazil, South America |
|
| Podostemaceae | A. Mesterhazy MLI 128(Z) | MN165816 | Mali, Africa |
|
| Salicaceae | — | NC_026722 | |
|
| Violaceae | — | NC_026986 |
Voucher number and collection locality are provided only for those species whose genome was generated in this study (*).
Figure 1Plastid genomes of the 5 species of Podostemaceae included in this study. Only functional genes are drawn, and GC content graphs are included as dark gray bars toward the center of each diagram. Intron-containing genes are marked with (*).
Structural information of the plastid genomes of Podostemaceae, Clusiaceae, Malpighiaceae, Chrysobalanaceae, Violaceae, Passifloraceae, and Salicaceae. The percentages of the total size of the genome that corresponds to each region are included.
| Species | Family | Plastome genome size (bp) | IRs length (bp) | SSC length (bp) | LSC length (bp) |
|---|---|---|---|---|---|
|
| Podostemaceae | 134,912 | 21,049 × 2 (∼30.1%) | 12,437 (∼8.9%) | 85,377 (∼61%) |
|
| Podostemaceae | 133,944 | 21,026 × 2 (∼31.4) | 12,395 (∼9.3%) | 79,497 (∼59.4%) |
|
| Podostemaceae | 131,951 | 19,945 × 2 (∼30.2%) | 12,283 (∼9.3%) | 79,778 (∼60.5%) |
|
| Podostemaceae | 134,374 | 21,041 × 2 (∼31.3) | 12,302 (∼9.2%) | 79,990 (∼59.5%) |
|
| Podostemaceae | 130,285 | 19,349 × 2 (∼29.7) | 12,662 (∼9.7%) | 78,925 (∼60.6%) |
|
| Clusiaceae | 158,179 | 27,009 × 2 (∼34.1%) | 17,704 (∼11.2%) | 86,457 (∼54.7%) |
|
| Malpighiaceae | 160,212 | 26,975 × 2 (∼33.7%) | 17,814 (∼11.1%) | 88,448 (∼55.2%) |
|
| Chrysobalanaceae | 162,891 | 26,866 × 2 (∼33%) | 19,915 (∼12.2%) | 89,244 (∼54.8%) |
|
| Violaceae | 156,507 | 26,404 × 2 (∼33.7%) | 18,008 (∼11.5%) | 85,691 (∼54.8%) |
|
| Passifloraceae | 151,406 | 26,152 × 2 (∼34.5%) | 13,378 (∼8.8%) | 85,724 (∼56.6%) |
|
| Salicaceae | 155,590 | 27,459 × 2 (∼35.3%) | 16,220 (∼10.4%) | 84,452 (∼54.3%) |
Gene content in all Podostemaceae species included in this study.
| Gene function | Gene group | Gene name |
|---|---|---|
| Self-replication | Ribosomal RNA genes |
|
| Transfer RNA genes |
| |
| Small subunit of ribosome |
| |
| Large subunit of ribosome |
| |
| RNA polymerase subunits |
| |
| Photosynthesis | Subunits of NADH dehydrogenase |
|
| Subunits of photosystem I |
| |
| Subunits of photosystem II |
| |
| Subunits of cytochrome b/f complex |
| |
| Subunits of ATP synthase |
| |
| Large subunit of Rubisco |
| |
| Other | Maturase |
|
| Envelope membrane protein |
| |
| C-type cytochrome synthesis |
| |
| ORFs |
|
Genes in bold correspond to genes that are located in the IRs and hence are duplicated. Genes that contain introns are marked with asterisk (*).
Figure 2Alignment resulted from Mauve showing a large inversion shared by all Podostemaceae. Color bars indicate syntenic blocks, and connecting lines indicate correspondence of blocks across genomes.
Figure 3Comparison of border positions of the 4 plastid regions (LSC, IRb, SSC, IRa) among plastomes of Ar, Apinagia riedelii; Mc, Marathrum capillaceum; Mu, Marathrum utile; MoC, Monostylis capillacea; Tt, Tristicha trifaria; Gm, Garcinia mangostana; Bc, Byrsonima crassifolia; Hr, Hirtella racemosa; Vs, Viola seoulensis; Pe, Passiflora edulis; Sp, Salix purpurea. Functional genes and truncated fragments are shown with the same color. The sizes of fragments in genes that are located in a boundary are shown.
Figure 4Comparison of percentage identity of plastomes in mVista using Garcinia mangostana (Gm) as reference. Ar, Apinagia riedelii; Mc, Marathrum capillaceum; Mu, Marathrum utile; Moc, Monostylis capillacea; Tt, Tristicha trifaria; Bc, Byrsonima crassifolia; Hr, Hirtella racemosa; Vs, Viola seoulensis; Pe, Passiflora edulis; Sp, Salix purpurea. The vertical axis corresponds to the percentage identity (50%–100%), while the horizontal axis shows the position of each region within the locus. Arrows indicate the transcription of annotated genes in the reference genome. Genome regions are color coded.
Figure 5Comparison of percentage identity of plastomes in mVista using Apinagia riedelii (Ar) as reference. Mc, Marathrum capillaceum; Mu, Marathrum utile; Moc, Monostylis capillacea; Tt, Tristicha trifaria; Gm, Garcinia mangostana; Bc, Byrsonima crassifolia; Hr, Hirtella racemosa; Vs, Viola seoulensis; Pe, Passiflora edulis; Sp, Salix purpurea. The vertical axis corresponds to the percentage identity (50%–100%), while the horizontal axis shows the position of each region within the locus. Arrows indicate the transcription of annotated genes in the reference genome. Genome regions are color coded.
Figure 6Maximum likelihood tree obtained with RAxML, using Averrhoa carambola as outgroup for rooting. Bootstrap support is shown above branches.