| Literature DB >> 31332227 |
Xiaoxue Li1, Wei Tan2, Jiqi Sun1, Junhua Du3, Chenguang Zheng1, Xiaoxuan Tian2, Min Zheng1, Beibei Xiang4, Yong Wang5.
Abstract
High-throughput sequencing of chloroplast genomes has been used to gain insight into the evolutionary relationships of plant species. In this study, we sequenced the complete chloroplast genomes of four species in the Meconopsis genus: M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea. These plants grow in the wild and are recognized as having important medicinal and ornamental applications. The sequencing results showed that the size of the Meconopsis chloroplast genome ranges from 151864 to 153816 bp. A total of 127 genes comprising 90 protein-coding genes, 37 tRNA genes and 8 rRNA genes were observed in all four chloroplast genomes. Comparative analysis of the four chloroplast genomes revealed five hotspot regions (matK, rpoC2, petA, ndhF, and ycf1), which could potentially be used as unique molecular markers for species identification. In addition, the ycf1 gene may also be used as an effective molecular marker to distinguish Papaveraceae and determine the evolutionary relationships among plant species in the Papaveraceae family. Futhermore, these four genomes can provide valuable genetic information for other related studies.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31332227 PMCID: PMC6646306 DOI: 10.1038/s41598-019-47008-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Chloroplast genome maps of M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea. Genes inside circles are transcribed clockwise, genes outside circles are transcribed counterclockwise. The light gray inner circle corresponds to the AT content, and the dark gray circle corresponds to the GC content. Genes belonging to different functional groups are shown in different colors.
Summary of assembly data for the Meconopsis chloroplast genome.
| Species |
|
|
| |
|---|---|---|---|---|
| Genome size (bp) | 153816 | 151864 | 153785 | 153259 |
| IR (bp) | 51988 | 51306 | 51988 | 51548 |
| LSC (bp) | 83930 | 82809 | 83899 | 83982 |
| SSC (bp) | 17898 | 17749 | 17898 | 17729 |
| Total number of genes | 127 | 127 | 127 | 127 |
| rRNA | 8 | 8 | 8 | 8 |
| tRNA | 37 | 37 | 37 | 37 |
| Protein-coding genes | 90 | 90 | 90 | 90 |
| A % | 30.4 | 30.4 | 30.4 | 30.5 |
| C % | 19.8 | 19.8 | 19.8 | 19.7 |
| G % | 18.9 | 19 | 18.9 | 18.8 |
| T % | 30.9 | 30.8 | 30.9 | 31 |
| G C% | 38.7 | 38.8 | 38.8 | 38.5 |
Chloroplast genome gene content and functional classification in M.
| Category | Group | Genes |
|---|---|---|
| Self-replication | Large subunit of ribosome (LSU) | |
| Small subunit of ribosome (SSU) | ||
| DNA dependent RNA polymerase | ||
| Ribosome RNA | ||
| Transfer RNAs (tRNA) | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| NADH dehydrogenase | ||
| Cytochrome b/f complex | ||
| Subunits of ATP synthase | ||
| Large subunit of rubisco |
| |
| Other genes | Translational initiation factor |
|
| ATP-dependent protease subunit p gene |
| |
| Maturase |
| |
| Envelop membrane protein |
| |
| Unknown function | Subunit of acetyl-CoA-carboxylase |
|
| C-type cytochrome synthesis gene |
| |
| Hypothetical chloroplast reading frames |
racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea. aGenes containing introns; bTwo gene copies in IR.
Figure 2Codon content of twenty amino acids and stop codons in all protein-coding genes of the chloroplast genomes of four Meconopsis species.
Types and numbers of SSRs in the chloroplast genomes of M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea.
| SSR type | Repeat unit | Species | |||
|---|---|---|---|---|---|
|
|
|
| |||
| Mono | A/T | 24 | 22 | 23 | 19 |
| Di | AG/CT | 1 | 1 | 1 | 1 |
| AC/GT | 0 | 0 | 0 | 1 | |
| AT/AT | 7 | 4 | 7 | 6 | |
| Tri | AAT/ATT | 2 | 2 | 2 | 2 |
| Tetra | AAAT/ATTT | 3 | 2 | 3 | 2 |
| AACC/GGTT | 1 | 1 | 1 | 1 | |
| AGAT/ATCT | 1 | 1 | 1 | 0 | |
| ATCC/ATGG | 0 | 0 | 0 | 1 | |
| Hexa | AATGAT/ATCATT | 0 | 0 | 0 | 1 |
| AAAAT/ATTTT | 1 | 0 | 0 | 0 | |
Figure 3Frequency of repeat sequences of the M. racemosa, M. integrifolia (Maxim.) Franch, M. horridula and M. punicea chloroplast genomes determined by REPuter.
Figure 4Nucleotide variability (%) values between pairs of the four Meconopsis species. (a) Using four species whole genomes; (b) Using four species coding regions.
Figure 5Sequence identity plot comparing the eight chloroplast genomes with Macleaya microcarpa as a reference by using mVista. Pink bars represent noncoding sequences (CNS), and white peaks represent genomic differences. The y-axis represents the percentage identity (shown: 50–100%).
Figure 6Phylogenetic tree reconstruction of the 42 species inferred from Bayesian inference (BI) and maximum likelihood (ML) based on 90 protein-coding genes. Numbers above the lines represent BI/ML posterior probabilities.
Figure 7Phylogenetic tree reconstruction of the 13 species inferred from maximum likelihood (ML), based on the ycf1 genes.