| Literature DB >> 30486510 |
Dingxuan He1,2, Andrew W Gichira3,4,5, Zhizhong Li6,7, John M Nzei8,9,10, Youhao Guo11, Qingfeng Wang12,13, Jinming Chen14.
Abstract
The order Nymphaeales, consisting of three families with a record of eight genera, has gained significant interest from botanists, probably due to its position as a basal angiosperm. The phylogenetic relationships within the order have been well studied; however, a few controversial nodes still remain in the Nymphaeaceae. The position of the Nuphar genus and the monophyly of the Nymphaeaceae family remain uncertain. This study adds to the increasing number of the completely sequenced plastid genomes of the Nymphaeales and applies a large chloroplast gene data set in reconstructing the intergeneric relationships within the Nymphaeaceae. Five complete chloroplast genomes were newly generated, including a first for the monotypic Euryale genus. Using a set of 66 protein-coding genes from the chloroplast genomes of 17 taxa, the phylogenetic position of Nuphar was determined and a monophyletic Nymphaeaceae family was obtained with convincing statistical support from both partitioned and unpartitioned data schemes. Although genomic comparative analyses revealed a high degree of synteny among the chloroplast genomes of the ancient angiosperms, key minor variations were evident, particularly in the contraction/expansion of the inverted-repeat regions and in RNA-editing events. Genome structure, and gene content and arrangement were highly conserved among the chloroplast genomes. The intergeneric relationships defined in this study are congruent with those inferred using morphological data.Entities:
Keywords: Nymphaeaceae; Nymphaeales; basal angiosperms; chloroplast; comparative genomics; phylogenomics; water lily
Mesh:
Substances:
Year: 2018 PMID: 30486510 PMCID: PMC6320877 DOI: 10.3390/ijms19123780
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Characteristics of chloroplast genomes of five species of Nymphaeaceae.
| Name of Organism |
| ||||
|---|---|---|---|---|---|
| GenBank accession number | KY392762 | KY392765 | MH050795 | MH050796 | MH050797 |
| Genome size (bp) | 160,051 | 159,930 | 160,858 | 160,737 | 160,645 |
| Large single copy (LSC) length (bp) | 90,026 | 89,677 | 90,375 | 90,610 | 90,551 |
| Small single copy (SSC) length (bp) | 19,325 | 20,201 | 18,811 | 18,865 | 18,830 |
| Inverted repeat (IR) length (bp) | 25,350 | 25,026 | 25,836 | 25,631 | 25,632 |
| Number of genes | 113 | 113 | 113 | 113 | 113 |
| Number of protein-coding genes (duplicated in IR) | 79 (6) | 79 (6) | 79 (6) | 79 (6) | 79 (6) |
| Number of tRNA genes (duplicated in IR) | 30 (7) | 30 (7) | 30 (8) | 30 (7) | 30 (7) |
| Number of rRNA genes (duplicated in IR) | 4 (4) | 4 (4) | 4 (4) | 4 (4) | 4 (4) |
| Number of genes with one intron (two introns) | 15 (3) | 15 (3) | 15 (3) | 15 (3) | 15 (3) |
| Proportion of coding to noncoding regions | 0.68 | 0.68 | 0.69 | 0.68 | 0.69 |
| Average gene density (genes/kb) | 0.82 | 0.82 | 0.83 | 0.82 | 0.82 |
| GC content (%) | 39.1 | 39.1 | 39.1 | 39.1 | 39.1 |
Figure 1Circular gene maps of five chloroplast genomes of Nymphaeaceae. Grey arrows indicate the direction in which genes are transcribed. Color codes indicates the various gene functional groups, and the grey-shaded part in the inner circle shows the GC level of each chloroplast genome.
List of genes encoded in each of the five chloroplast genomes of Nymphaeaceae.
| Category | Gene Type | Gene | ||||||
|---|---|---|---|---|---|---|---|---|
| Self-replication | Ribosomal RNA |
|
|
|
| |||
| Transfer RNA |
|
|
|
| ||||
|
|
|
|
|
| ||||
|
|
|
|
|
| ||||
|
|
|
|
|
| ||||
|
|
|
|
|
| ||||
| Small ribosomal units |
|
|
|
|
| |||
|
|
|
|
|
|
| |||
| Large ribosomal units |
|
|
|
|
|
| ||
|
|
| |||||||
| RNA polymerase subunits |
|
|
| |||||
| translation initiation factor |
| |||||||
| Photosynthesis genes | NADH dehydrogenase |
|
|
|
| |||
|
|
|
|
|
| ||||
| photosystem I |
|
|
|
|
|
| ||
| photosystem II |
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
| ||
|
| ||||||||
| cytochrome b/f complex |
|
|
|
|
|
| ||
| ATP synthase |
|
|
|
|
| |||
| Large subunit of rubisco |
| |||||||
| Other genes | Maturase |
| ||||||
| Protease | ||||||||
| Acetyl-CoA-carboxylase sub-unit |
| |||||||
| Envelope membrane protein |
| |||||||
| Component of TIC complex |
| |||||||
| c-type cytochrome synthesis |
| |||||||
| Unknown | hypothetical genes reading frames |
| ||||||
Notes: the * and ** symbols indicate genes with one and two intron(s) respectively.
Figure 2Details of codon preferences (bar) and relative synonymous codon usage values (line) of 12 chloroplast genomes of Nymphaeaceae.
List of protein-coding genes affected by RNA editing in each of the 12 chloroplast genomes of Nymphaeaceae.
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
| ||||
|
| |||||||||||
|
|
|
|
|
|
|
|
| ||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||||||||||
|
|
|
|
|
|
| ||||||
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
| ||||||
|
|
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
|
| ||||
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Note: the superscript number indicates the number of edited sites in each of the 19 protein-coding genes common in all the genomes.
Figure 3Number of RNA-editing sites in each of the transcripts of 19 common genes in all analyzed chloroplast genomes.
Figure 4Comparison of the border positions of the large single copy, small single copy, and the inverted-repeat regions among chloroplast genomes of twelve species of Nymphaeaceae. Complete genes and portions of genes adjacent to the junctions are depicted by differently colored blocks.
Figure 5Mauve software alignment of the whole chloroplast genome of 12 species of Nymphaeaceae. Local collinear blocks representing identical gene clusters are depicted by the same color and are connected by lines.
Figure 6Phylogenetic relationships among the species of Nymphaeaceae, Cabombaceae, and Hydatellaceae (outgroup). The Maximum Likelihood (ML) and Bayesian Inference (BI) phylogenetic tree was based on 66 protein codon genes. The numbers indicate ML bootstrap support (100) and BI posterior probabilities (1.0) values. The - symbol indicates maximum support. The first two values and the last two are for unpartitioned data and partitioned data respectively.