| Literature DB >> 35161429 |
Salvador Guzmán-Díaz1, Fabián Augusto Aldaba Núñez1, Emily Veltjen2,3, Pieter Asselman2, Isabel Larridon2,4, Marie-Stéphanie Samain1,2.
Abstract
Chloroplast genomes are considered to be highly conserved. Nevertheless, differences in their sequences are an important source of phylogenetically informative data. Chloroplast genomes are increasingly applied in evolutionary studies of angiosperms, including Magnoliaceae. Recent studies have focused on resolving the previously debated classification of the family using a phylogenomic approach and chloroplast genome data. However, most Neotropical clades and recently described species have not yet been included in molecular studies. We performed sequencing, assembly, and annotation of 15 chloroplast genomes from Neotropical Magnoliaceae species. We compared the newly assembled chloroplast genomes with 22 chloroplast genomes from across the family, including representatives from each genus and section. Family-wide, the chloroplast genomes presented a length of about 160 kb. The gene content in all species was constant, with 145 genes. The intergenic regions showed a higher level of nucleotide diversity than the coding regions. Differences were higher among genera than within genera. The phylogenetic analysis in Magnolia showed two main clades and corroborated that the current infrageneric classification does not represent natural groups. Although chloroplast genomes are highly conserved in Magnoliaceae, the high level of diversity of the intergenic regions still resulted in an important source of phylogenetically informative data, even for closely related taxa.Entities:
Keywords: chloroplast assembly; comparative genomics; complete chloroplast genome; phylogenomics; whole genome sequencing
Year: 2022 PMID: 35161429 PMCID: PMC8838774 DOI: 10.3390/plants11030448
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure A1Graphical maps of the 15 newly annotated plastomes generated by OGDRAW.
Plastome sequence length, assembly coverage, GC content, and gene content of the 37 Magnoliaceae plastomes included in the analyses; newly assembled plastomes are highlighted in gray. The classification is according to [51,60]. NA = Not applicable, LSC = large single copy region, SSC = small single copy region, IR = inverted repeat region, CDS = coding DNA sequence, tRNA = transfer RNA, rRNA = ribosomal RNA. 1: Synonym of Magnolia conifera var. chingii, treated as M. glaucifolia in [43]. 2: Synonym of M. vrieseana, treated as M. ovalis in [43].
| Genus | Section | Species | Plastome Sequence Length | Assembly Coverage | GC | Gene Content | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Total | LSC | SSC | IR | CDS | tRNAs | rRNAs | |||||
|
| NA |
| 159,426 | 87,762 | 18,998 | 26,333 | - | 0.39 | 92 | 45 | 8 |
|
| 159,961 | 88,231 | 18,966 | 26,382 | - | 0.39 | 92 | 45 | 8 | ||
|
|
|
| 160,021 | 88,047 | 18,767 | 26,603 | - | 0.39 | 92 | 45 | 8 |
|
| 159,738 | 87,934 | 18,690 | 26,557 | - | 0.39 | 92 | 45 | 8 | ||
|
| 159,828 | 87,958 | 18,760 | 26,555 | - | 0.39 | 92 | 45 | 8 | ||
|
|
| 160,027 | 88,130 | 18,725 | 26,586 | - | 0.39 | 92 | 45 | 8 | |
|
| 160,085 | 88,170 | 18,745 | 26,585 | - | 0.39 | 92 | 45 | 8 | ||
|
|
| 159,838 | 88,048 | 18,732 | 26,529 | - | 0.39 | 92 | 45 | 8 | |
|
|
| 160,075 | 88,161 | 18,740 | 26,587 | 65.9 | 0.39 | 92 | 45 | 8 | |
|
| 159,880 | 87,968 | 18,736 | 26,588 | 130.2 | 0.39 | 92 | 45 | 8 | ||
|
|
| 159,724 | 87,834 | 18,748 | 26,571 | 78.4 | 0.39 | 92 | 45 | 8 | |
|
| 159,711 | 87,782 | 18,759 | 26,585 | 48.5 | 0.39 | 92 | 45 | 8 | ||
|
| 159,779 | 87,867 | 18,760 | 26,576 | 74.1 | 0.39 | 92 | 45 | 8 | ||
|
|
| 160,134 | 88,213 | 18,799 | 26,561 | - | 0.39 | 92 | 45 | 8 | |
|
| 160,059 | 88,094 | 18,803 | 26,581 | - | 0.39 | 92 | 45 | 8 | ||
|
|
| 160,100 | 88,294 | 18,765 | 26,585 | - | 0.39 | 92 | 45 | 8 | |
|
| 160,144 | 88,179 | 18,781 | 26,592 | - | 0.39 | 92 | 45 | 8 | ||
|
| 159,988 | 88,123 | 18,799 | 26,533 | - | 0.39 | 92 | 45 | 8 | ||
|
| 159,926 | 88,134 | 18,770 | 26,511 | - | 0.39 | 92 | 45 | 8 | ||
| 160,008 | 88,077 | 18,809 | 26,561 | - | 0.39 | 92 | 45 | 8 | |||
|
| 160,177 | 88,240 | 18,785 | 26,576 | - | 0.39 | 92 | 45 | 8 | ||
|
| 160,232 | 88,294 | 18,766 | 26,586 | - | 0.39 | 92 | 45 | 8 | ||
|
| 160,057 | 88,162 | 18,771 | 26,562 | - | 0.39 | 92 | 45 | 8 | ||
|
| 160,136 | 88,163 | 18,829 | 26,572 | - | 0.39 | 92 | 45 | 8 | ||
|
| 159,781 | 88,009 | 18,758 | 26,507 | 37.5 | 0.39 | 92 | 45 | 8 | ||
| 159,927 | 88,009 | 18,736 | 26,541 | 19.8 | 0.39 | 92 | 45 | 8 | |||
|
| 159,905 | 88,034 | 18,787 | 26,542 | 78.4 | 0.39 | 92 | 45 | 8 | ||
|
| 159,250 | 87,640 | 18,702 | 26,454 | 195.0 | 0.39 | 92 | 45 | 8 | ||
|
| 159,121 | 87,541 | 18,732 | 26,424 | 78.0 | 0.39 | 92 | 45 | 8 | ||
|
| 159,810 | 88,008 | 18,720 | 26,541 | 73.2 | 0.39 | 92 | 45 | 8 | ||
|
| 159,849 | 88,044 | 18,749 | 26,528 | 88.3 | 0.39 | 92 | 45 | 8 | ||
|
| 159,774 | 87,977 | 18,751 | 26,523 | 56.5 | 0.39 | 92 | 45 | 8 | ||
|
| 159,836 | 88,026 | 18,750 | 26,530 | 82.0 | 0.39 | 92 | 45 | 8 | ||
|
| 159,790 | 88,001 | 18,757 | 26,516 | 66.4 | 0.39 | 92 | 45 | 8 | ||
|
| 159,810 | 87,837 | 18,769 | 26,602 | - | 0.39 | 92 | 45 | 8 | ||
|
| 159,486 | 87,554 | 18,762 | 26,585 | - | 0.39 | 92 | 45 | 8 | ||
|
| 160,105 | 88,136 | 18,777 | 26,596 | - | 0.39 | 92 | 45 | 8 | ||
| Minimum | 159,121 | 87,541 | 18,690 | 26,333 | 19.8 | 0.39 | 92 | 45 | 8 | ||
| Maximum | 160,232 | 88,294 | 18,998 | 26,603 | 195.0 | 0.39 | 92 | 45 | 8 | ||
| Mean | 159,878 | 88,018 | 18,772 | 26,544 | 78.2 | 0.39 | 92 | 45 | 8 | ||
Genes found in the 37 Magnoliaceae plastomes. Duplicated genes are underlined. Triplicated genes are in bold.
| Category | Gene Group | Gene Names |
|---|---|---|
| Photosynthesis | ATP synthase | |
| Cytochrome complex | ||
| NADH dehydrogenase | ||
| Photosystem I | ||
| Photosystem II | ||
| Rubisco large subunit |
| |
| rRNAs | ||
| Self-replication | Ribosomal proteins (LSU) | |
| Ribosomal proteins (SSU) | ||
| RNA polymerase | ||
| tRNAs | ||
| Other | Conserved open reading frames | |
| Cytochrome c synthesis |
| |
| Membrane protein |
| |
| Protease |
| |
| RNA processing |
| |
| Subunit of acetyl-CoA carboxylase |
| |
| Translational initiation |
|
Figure A2Sequence identity plot produced by Shuffle-LAGAN alignment in mVista comparing 37 Magnolia species; Magnolia coco was used as reference. Grey arrows represent genes with their orientation. Pink areas are conserved non-coding sequences (CNS). Blue areas are exons. The Y-axis represents the percentage of conservation of each species against the reference.
Figure 1Mauve progressive alignment, including all 37 Magnoliaceae plastomes. Blocks of the same color connected by a line represent colinear regions. Blocks below the graphs represent coding regions. Colinear regions appear in the same order in all species, which suggests that no significant rearrangement has been found.
Figure 2Nucleotide diversity (Pi) values resulting from the sliding window analysis of the 37 included Magnoliaceae plastomes. The Pi values ranged from 0 to 0.0236. The most diverse sites corresponded to genes such as PetL, ccsA, and ndhD, while the IR regions presented the lowest diversity. LSC = large single copy region; IR = inverted repeat regions; SSC = small single copy region.
Figure 3Expansion and contraction of 37 Magnoliaceae IR regions, analyzed with IRscope. All four junctions (LSC/IRb, IRb/SSC, SSC/IRa, and IRa/LSC) are shown, as well as their flanking genes.
Figure 4Phylogenetic relationships obtained from the Bayesian inference analysis of the newly assembled plastomes from 15 Neotropical species plus 22 plastomes downloaded from the NCBI; Liriodendron was used as an outgroup. The classification was according to [51,60]; The Neotropical subsections are shown in dotted brackets. The numbers at the nodes represent the ML bootstrap and BI posterior probability support values, respectively; nodes without numbers correspond to 100/1 support values. * = Neotropical groups.
Sampled Neotropical Magnolia species. The classification is according to [51,60]. The collection is either a herbarium, in which case the acronyms are according to [103], or living specimens from the natural reserve “El Refugio” in Dagua, Colombia [104]. NA = Not applicable.
| Section (Subsection) | Species | Country | Collection | Voucher |
|---|---|---|---|---|
|
| Mexico | XAL | M. Mata 1188b | |
| Mexico | XAL | M. Mata 0866a | ||
|
| Mexico | XAL | S. Cházaro B. and M. Rodríguez 8590 | |
| Mexico | IEB, MEXU | M.S. Samain and E. Martínez 2019-019 | ||
| Mexico | IEB, MEXU | F. Aldaba 187 | ||
| Ecuador | ECUAMZ | F. Arroyo and Á.J. Pérez 286 | ||
| Cuba | HAJB | A. Palmarola | ||
| Puerto Rico | GENT | E. Veltjen | ||
| Colombia | “El Refugio” Natural Reserve | NA | ||
| Venezuela | K | J. Steyermark 1191 | ||
| Colombia | “El Refugio” Natural Reserve | NA | ||
| Costa Rica | USJ | J.E. Jiménez 4622 | ||
| Colombia | K | J. Hernández et al. 1001 | ||
| Mexico | IBUG | J.A. Vázquez García | ||
| Cuba | HAJB | B. Falcón HFC88953 |
Sequencing results of the newly assembled Magnolia species. Columns show the number of paired raw reads, the number of paired reads after quality trimming with Trimmomatic, and the NCBI reference number. NA = Not applicable.
| Section (Subsection) | Species | Voucher | Raw Reads | Trimmed Reads | NCBI |
|---|---|---|---|---|---|
|
| M. Mata 1188b | 1,583,270 | 1,369,706 | TBD | |
| M. Mata 0866a | 1,576,433 | 1,435,083 | TBD | ||
|
| S. Cházaro B. and M. Rodríguez 8590 | 2,654,727 | 2,564,393 | TBD | |
| M.S. Samain and E. Martínez 2019-019 | 1,857,563 | 1,792,016 | TBD | ||
| F. Aldaba 187 | 1,713,367 | 1,672,698 | TBD | ||
| E. Veltjen and A. Dahua 2018-005 | 1,678,118 | 1,582,603 | TBD | ||
| NA | 1,772,721 | 1,731,278 | TBD | ||
| A. Palmarola et al. HFC-89195 | 638,499 | 586,472 | TBD | ||
| E. Veltjen et al. 2016-033 | 1,613,856 | 1,549,640 | TDB | ||
| NA | 2,008,282 | 1,910,887 | TBD | ||
| J. Steyermark 1191 | 1,603,135 | 1,514,091 | TBD | ||
| J.E. Jiménez 4622 | 1,889,242 | 1,806,280 | TBD | ||
| Hernández 1001 | 1,089,114 | 716,041 | TBD | ||
| J.A. Vázquez García et al. 9335 | 1,800,168 | 1,671,463 | TBD | ||
| B. Falcón HFC88953 | 1,750,979 | 1,611,208 | TBD |
Magnoliaceae plastomes downloaded from [39]; the classification is according to [51,60]. NA = Not applicable. 1: Synonym of Magnolia conifera var. chingii (Dandy) V.S.Kumar. 2: Synonym of M. vrieseana (Miq.) Baill. ex Pierre.
| Genus | Section | Species | NCBI reference |
|---|---|---|---|
|
| NA | MN990597 | |
| MN990625 | |||
|
|
| MN990599 | |
| MN990612 | |||
| MN990610 | |||
|
| MN990641 | ||
| KF753638 | |||
|
| MN990593 | ||
|
| MN990576 | ||
| MF990565 | |||
|
| MN990584 | ||
| MN990630 | |||
| MN990602 | |||
| MN990570 | |||
| MT269873 | |||
| MN990583 | |||
| MN990621 | |||
| MN990571 | |||
| MN990572 | |||
| MN990595 | |||
| MN990635 | |||
| MN990588 |