| Literature DB >> 30941367 |
Xinlian Chen1, Yingxian Cui1, Liping Nie1, Haoyu Hu2, Zhichao Xu1, Wei Sun2, Ting Gao3, Jingyuan Song1, Hui Yao1.
Abstract
Ephedrae Herba and Ephedrae Radix et Rhizoma (Mahuang) have been used as Chinese herbal medicines. Ephedra plants mainly live in deserts and have good governance of desertification. Despite their important medicinal and environmental protection value, dietary supplements containing ephedrine from Ephedra species may threaten the health of people. Morphological resemblance amongst species causes difficulty in identifying the original species of Ephedra herbs. Chloroplast (CP) genome shows good prospects in identification and phylogenetic analysis. This study introduced the structures of the CP genomes of three Ephedra species and analysed their phylogenetic relationships. Three complete CP genomes of Ephedra showed four-part annular structures, namely, two single-copy regions and two inverted repeat regions. The entire CP genomes of three Ephedra species in terms of size were 109,550 bp (E. sinica), 109,667 bp (E. intermedia), and 109,558 bp (E. equisetina). Each CP genome of the three Ephedra species encoded 118 genes, including 73 protein-coding genes, 37 tRNA genes and 8 ribosomal RNA genes. Eleven high-variation regions were screened through mVISTA to be potential specific DNA barcodes for identifying Ephedra species. Maximum likelihood and maximum parsimony trees showed that CP genomes could be used to identify Ephedra species. The Ephedra species had a close phylogenetic relationship with Gnetum species and Welwitschia mirabilis. This research provided valuable information for the identification and phylogenetic analysis of gymnosperms and drug safety of Ephedra.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30941367 PMCID: PMC6420972 DOI: 10.1155/2019/5921725
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Gene maps of the complete CP genomes of the three Ephedra species. Genes on the inside of the circle are transcribed clockwise, while those outside are transcribed counter clockwise. The darker gray in the inner circle corresponds to GC content, whereas the lighter gray corresponds to AT content.
Statistics for assembly of the three CP genomes and two published of Ephedra species.
| Latin name |
|
|
|
|
|
|---|---|---|---|---|---|
| Gene size (bp) | 109,667 | 109,550 | 109,558 | 109,518 | 109,584 |
| LSC (bp) | 59,936 | 59,961 | 59,976 | 59,906 | 60,027 |
| SSC (bp) | 8,247 | 8,103 | 8,078 | 8,104 | 8,079 |
| IR (bp) | 20,742 | 20,743 | 20,752 | 20,754 | 20,739 |
| GC Content (%) | 36.6 | 36.7 | 36.6 | 36.7 | 36.7 |
List of genes found in the three CP genomes of Ephedra species.
| No. | Group of genes | Gene names | Amount |
|---|---|---|---|
| 1 | Photosystem I |
| 5 |
| 2 | Photosystem II |
| 15 |
| 3 | Cytochrome b/f complex |
| 6 |
| 4 | ATP synthase |
| 6 |
| 5 | RubisCO large subunit |
| 1 |
| 6 | RNA polymerase |
| 4 |
| 7 | Ribosomal proteins (SSU) |
| 14 |
| 8 | Ribosomal proteins (LSU) | rpl2 | 7 |
| 9 | Photochlorophyllide reductase subunit B/L/N |
| 5 |
| 10 | Proteins of unknown function |
| 5 |
| 11 | Transfer RNAs | 37 | 37 |
| 12 | Ribosomal RNAs |
| 8 |
| 13 | Other genes |
| 5 |
Gene contains one intron; gene contains two introns; (×2) indicates the number of the repeat unit is 2.
Figure 2Codon content of 20 amino acid and stop codons in all protein-coding genes of the CP genomes of three Ephedra species. The order of every three columns is E. intermedia, E. sinica, and E. equisetina, respectively.
Figure 3Repeat sequences in three CP genomes. REPuter was used to identify repeat sequences with length ≥ 30 bp and sequences identified ≥ 90% in the CP genomes. F, P, R, and C indicate the repeat types F (forward), P (palindrome), R (reverse), and C (complement). Repeats with different lengths are indicated in different colors.
Figure 4Sequence identity plot comparing five CP genomes with E. intermedia as a reference using mVISTA. Gray arrows and thick black lines above the alignment indicate genes with their orientation and the position of the IRs, respectively. A cutoff of 70% identity was used for the plots, and the Y-scale represents the percent identity ranging from 50 to 100%.
Figure 5Comparison of the borders of LSC, SSC, and IR regions amongst three CP genomes. Number above the gene features means the distance between the ends of genes and the borders sites. These features are not to scale.
Figure 6Sliding window analysis of the whole CP genomes. Window length: 800 bp; step size: 200 bp. X-axis: position of the midpoint of a window. Y-axis: nucleotide diversity of each window. Pi amongst three Ephedra species. Mutational hotspots and highly divergent loci were marked.
Figure 7Phylogenetic trees constructed using ML and MP methods based on common protein-coding genes of three Ephedra and other 16 species. Numbers (MP/ML) above the branches are bootstrap support values. Only one support value means the same value.