| Literature DB >> 33790948 |
Shaojuan Qian1, Yonghong Zhang1, Shiou Yih Lee2.
Abstract
The complete chloroplast genomes of three species of Edgeworthia namely, Edgeworthia albiflora, Edgeworthia chrysantha, and Edgeworthia gardneri (Thymelaeaceae), are reported and characterized. The chloroplast genomes displayed a typical quadripartite structure with conserved genome arrangement and specific divergence. The genomes ranged in length from 172,708 to 173,621 bp and displayed similar GC content of 36.5-36.7%. A total of 138-139 genes were predicted, including 92-93 protein-coding, 38 tRNAs and eight rRNAs genes. Variation in the number of short simple repeats and inverted region boundaries of the three cp genomes were observed. A mutational hotspot was detected along the nucleotide sequence from the ndhF to the trnL-UAG genes. The chloroplast genome-based and internal transcribed spacer (ITS)-based phylogenetic analyses using maximum-likelihood (ML) and Bayesian inference (BI) revealed that E. albiflora diverged before E. chrysantha and E. gardneri and placed the Edgeworthia clade at the base of the Eurasian Daphne group with strong bootstrap support. With an effective taxonomic treatment of the species of Edgeworthia, further molecular analyses of their intra- and interspecific genetic variation are inclined to support the treatment of E. albiflora and E. gardneri as two natural groups. The genetic information obtained from this study will provide valuable genomic resources for the identification of additional species and for deducing the phylogenetic evolution of Edgeworthia.Entities:
Keywords: Edgeworthia; Thymelaeaceae; chloroplast genome; comparative analysis; internal transcribed spacers; phylogenetic relationship
Year: 2021 PMID: 33790948 PMCID: PMC8006312 DOI: 10.3389/fgene.2021.643552
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Basic characteristics of chloroplast genomes in three species of Edgeworthia.
| Sample location | Miyi, Sichuan, China | Kunming Botanical Garden, Yunnan, China | Motuo, Tibet, China |
| Collector and collection number | Zhang Y.; ZYH11 | Zhang Y.; RXK26 | Zhang Y.; RXK53 |
| Chloroplast genome GenBank accession number | |||
| ITS GenBank accession number | |||
| LSC (bp) | 86,862 | 85,824 | 86,388 |
| SSC (bp) | 2,681 | 2,816 | 3,017 |
| IR (bp) | 42,039 | 42,034 | 41,952 |
| Total (bp) | 173,621 | 172,708 | 173,309 |
| Protein-coding genes | 93 | 93 | 92 |
| tRNA genes | 38 | 38 | 38 |
| rRNA genes | 8 | 8 | 8 |
| Total | 139 | 139 | 138 |
| GC content (%) | 36.5 | 36.7 | 36.5 |
FIGURE 1Gene map for chloroplast genomes of three species of Edgeworthia. Annotated genes are colored according to functional categories. Genes placed outside the outer circle were transcribed clockwise; genes placed inside the circle were transcribed counterclockwise. Dark gray in inner circle represents GC content whereas light gray corresponds to AT content.
Gene contents in three Edgeworthia species chloroplast genome.
| Large ribosomal subunits | |
| Small ribosomal subunits | |
| DNA dependent RNA polymerase | |
| Ribosomal RNAs | |
| Transfer RNAs | |
| Photosystem I | |
| Photosystem II | |
| NAD(P)H dehydrogenase complex | |
| F-type ATP synthase | |
| Cytochrome b6/f complex | |
| Rubisco | |
| Envelope membrane protein | |
| Maturase | |
| Cytochrome c biogenesis protein | |
| Subunit of acetyl-CoA-carboxylase | |
| Conserved open reading frames | |
Locations and sizes of the intron-containing genes recorded in the chloroplast genome of the three Edgeworthia species.
| LSC | 144 | 864 | 411 | na | na | ||
| IR | 558 | 1165 | 540 | na | na | ||
| IR | 777 | 686 | 756 | na | na | ||
| LSC | 6 | 770 | 698 | na | na | ||
| LSC | 8 | 786 | 475 | na | na | ||
| IR | 393 | 724 | 391 | na | na | ||
| LSC | 9 | 1080 | 399 | na | na | ||
| IR | 38 | 812 | 35 | na | na | ||
| LSC | 23 | 720 | 49 | na | na | ||
| IR | 37 | 955 | 35 | na | na | ||
| LSC | 37 | 2559 | 35 | na | na | ||
| LSC | 34 | 546 | 50 | na | na | ||
| LSC | 38 | 615 | 35 | na | na | ||
| LSC | 126 | 732 | 228 | 761 | 153 | ||
| LSC | 144 | 867 | 411 | na | na | ||
| IR | 558 | 1163 | 540 | na | na | ||
| IR | 777 | 690 | 756 | na | na | ||
| LSC | 6 | 759 | 698 | na | na | ||
| LSC | 8 | 790 | 475 | na | na | ||
| IR | 391 | 683 | 434 | na | na | ||
| LSC | 9 | 1066 | 399 | na | na | ||
| LSC | 432 | 767 | 1617 | na | na | ||
| LSC | 40 | 934 | 206 | na | na | ||
| IR | 38 | 811 | 35 | na | na | ||
| LSC | 23 | 718 | 49 | na | na | ||
| IR | 37 | 955 | 35 | na | na | ||
| LSC | 37 | 2555 | 35 | na | na | ||
| LSC | 34 | 551 | 50 | na | na | ||
| LSC | 38 | 621 | 35 | na | na | ||
| LSC | 126 | 732 | 228 | 761 | 153 | ||
| LSC | 144 | 856 | 411 | na | na | ||
| IR | 558 | 1162 | 540 | na | na | ||
| IR | 777 | 690 | 756 | na | na | ||
| LSC | 6 | 766 | 698 | na | na | ||
| LSC | 8 | 781 | 475 | na | na | ||
| IR | 393 | 681 | 434 | na | na | ||
| LSC | 9 | 1076 | 399 | na | na | ||
| LSC | 432 | 757 | 1617 | na | na | ||
| LSC | 40 | 946 | 200 | na | na | ||
| IR | 38 | 812 | 35 | na | na | ||
| LSC | 23 | 725 | 49 | na | na | ||
| IR | 37 | 955 | 35 | na | na | ||
| LSC | 37 | 2531 | 35 | na | na | ||
| LSC | 34 | 547 | 50 | na | na | ||
| LSC | 38 | 620 | 35 | na | na | ||
| LSC | 126 | 727 | 228 | 762 | 153 |
FIGURE 2Distribution of small sequence repeats (SSRs) in the chloroplast genomes of three species of Edgeworthia. (A) Number of different SSR types detected in chloroplast genomes of three species of Edgeworthia; (B) Frequencies of identified SSRs in large single-copy (LSC), small single-copy (SSC), and inverted repeats (IRs) regions; (C) Frequencies of identified SSRs in the gene coding and intergenic region. (D) Frequencies of four different large repeat types (complement, forward, palindromic, and reverse) in the chloroplast genomes of three species of Edgeworthia; (E) Frequencies of large repeats based on different sequence length (bp) groups; (F) Frequencies of large repeats in the LSC, SSC, and IRs regions.
FIGURE 3Comparison of borders between large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regions across ten chloroplast genomes of Thymelaeaceae.
FIGURE 4Complete chloroplast genome comparison of three species of Edgeworthia using mVISTA alignment program. Genome regions are color-coded and described in the figure legend.
FIGURE 5Sliding window analysis of the complete chloroplast genome sequences among three species of Edgeworthia (Window length: 1000 bp; step size: 500 bp).
FIGURE 6Maximum-likelihood (ML) and MrBayes (BI) tree analyses were based on 1,000 bootstrap replicates. Bootstrap support values and Bayesian posterior probabilities are indicated at each branch nodes. Three species of Edgeworthia used in this study are highlighted in bold. (A) Combined phylogenetic tree of Edgeworthia and allied genera based on the chloroplast genome sequences of 15 taxa from Thymelaeaceae. Two related species, Hibiscus hamabo (Malvaceae; KR259988) and Eugenia uniflora (Myrtaceae; KR867678) were included as outgroups. (B) Phylogenetic analyses of Thymelaeaceae based on nuclear ribosomal DNA internal transcribed spacer (ITS) gene sequences. A total of 23 ITS sequences from the members of Thymelaeaceae, representing 21 taxa from eight genera in the Daphne group of tribe Daphneae, two taxa from tribe Aquilarieae and one taxon from subfamily Octolepidoideae.