| Literature DB >> 30104517 |
Qixiang Lu1, Wenqing Ye2, Ruisen Lu3, Wuqin Xu4, Yingxiong Qiu5.
Abstract
The monocot genus Croomia (Stemonaceae) comprises three herbaceous perennial species that exhibit EA (Eastern Asian)⁻ENA (Eastern North American) disjunct distribution. However, due to the lack of effective genomic resources, its evolutionary history is still weakly resolved. In the present study, we conducted comparative analysis of the complete chloroplast (cp) genomes of three Croomia species and two Stemona species. These five cp genomes proved highly similar in overall size (154,407⁻155,261 bp), structure, gene order and content. All five cp genomes contained the same 114 unique genes consisting of 80 protein-coding genes, 30 tRNA genes and 4 rRNA genes. Gene content, gene order, AT content and IR/SC boundary structures were almost the same among the five Stemonaceae cp genomes, except that the Stemona cp genome was found to contain an inversion in cemA and petA. The lengths of five genomes varied due to contraction/expansion of the IR/SC borders. A/T mononucleotides were the richest Simple Sequence Repeats (SSRs). A total of 46, 48, 47, 61 and 60 repeats were identified in C. japonica, C. heterosepala, C. pauciflora, S. japonica and S. mairei, respectively. A comparison of pairwise sequence divergence values across all introns and intergenic spacers revealed that the ndhF⁻rpl32, psbM⁻trnD and trnS⁻trnG regions are the fastest-evolving regions. These regions are therefore likely to be the best choices for molecular evolutionary and systematic studies at low taxonomic levels in Stemonaceae. Phylogenetic analyses of the complete cp genomes and 78 protein-coding genes strongly supported the monophyly of Croomia. Two Asian species were identified as sisters that likely diverged in the Early Pleistocene (1.62 Mya, 95% HPD: 1.125⁻2.251 Mya), whereas the divergence of C. pauciflora dated back to the Late Miocene (4.77 Mya, 95% HPD: 3.626⁻6.162 Mya). The availability of these cp genomes will provide valuable genetic resources for further population genetics and phylogeographic studies on Croomia.Entities:
Keywords: Croomia; Stemona; biogeography; chloroplast genome; comparative genomics; phylogeny
Mesh:
Year: 2018 PMID: 30104517 PMCID: PMC6122011 DOI: 10.3390/ijms19082383
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Gene maps of Croomia and Stemona chloroplast genomes. (A) Croomia japonica; (B) Stemona japonica.
List of genes in Stemonaceae chloroplast genomes.
| Category of Genes | Groups of Genes | Names of Genes |
|---|---|---|
| Self-replication | rRNA genes | |
| tRNA genes | ||
| Small subunit of ribosome | ||
| Large subunit of ribosome | ||
| DNA-dependent RNA polymerase | ||
| Genes for photosynthesis | Subunit of NADH-dehydrogenase | |
| Subunit of Photosystem 1 | ||
| Subunit of Photosystem 2 | ||
| Subunits of cytochrome b/f complex | ||
| Subunits of ATP synthase | ||
| Large subunit of rubisco |
| |
| Other genes | Maturase | |
| Protease | ||
| Envelope membrane protein |
| |
| Subunit of Acetyl-CoA-carboxylase |
| |
| c-type cytochrome synthesis gene |
| |
| Translation initiation factor IF-1 |
| |
| Genes of unknown function | Open reading frames (ORF, ycf) |
* Gene with one intron, ** gene with two introns; (×2) indicates genes duplicated in the IR region.
Figure 2Comparison of LSC, IR and SSC junction positions among five Stemonaceae chloroplast genomes.
Figure 3Sequence identity plots among five Stemonaceae chloroplast genomes, with Stemona japonica as a reference. CNS: conserved non-coding sequences; UTR: untranslated region.
Figure 4The nucleotide variability (Pi) values were compared among five Stemonaceae species.
Figure A1PIC values of the top ten most variable noncoding regions in Stemonaceae.
Figure 5Analysis of repeated sequences in five Stemonaceae chloroplast genomes. (A) Frequency of repeats by length; (B) frequency of repeat types.
Figure 6Simple Sequence Repeats (SSRs) in five Stemonaceae chloroplast genomes. (A) Numbers of SSRs by length; (B) distribution of SSR loci. IGS: intergenic spacer region; CDS: protein-coding regions.
Figure 7Phylogenetic relationships of three Croomia species inferred from Maximum Likelihood (ML) and Bayesian Inference (BI) and divergence time of three Croomia species estimated using Bayesian Evolutionary Analysis Sampling Trees (BEAST) analysis. Numbers above the lines represent ML bootstrap values and BI posterior probability. Blue bars indicate the 95% highest posterior density (HPD) credibility intervals for node ages (Mya). Numbers at the node represent divergence time (Mya) and 95% highest posterior density intervals. The phylogenetic tree based on 74 protein-coding genes is completely consistent with this topology.
Figure A2The Bayesian Binary MCMC (BBM) analysis of ancestral area reconstruction.