| Literature DB >> 35588136 |
Ting Zhou1, Kun Ning2, Zhenghai Mo1, Fan Zhang1, Yanwei Zhou1, Xinran Chong1, Donglin Zhang3, Yousry A El-Kassaby4, Jian Bian5, Hong Chen1.
Abstract
Ilex dabieshanensis K. Yao & M. B. Deng is not only a highly valued tree species for landscaping, it is also a good material for making kuding tea due to its anti-inflammatory and lipid-lowering medicinal properties. Utilizing next-generation and long-read sequencing technologies, we assembled the whole chloroplast genome of I. dabieshanensis. The genome was 157,218 bp in length, exhibiting a typical quadripartite structure with a large single copy (LSC: 86,607 bp), a small single copy (SSC: 18,427 bp) and a pair of inverted repeat regions (IRA and IRB: each of 26,092 bp). A total of 121 predicted genes were encoded, including 113 distinctive (79 protein-coding genes, 30 tRNAs, and 4 rRNAs) and 8 duplicated (8 protein-coding genes) located in the IR regions. Overall, 132 SSRs and 43 long repeats were detected and could be used as potential molecular markers. Comparative analyses of four traditional Ilex tea species (I. dabieshanensis, I. paraguariensis, I. latifolia and I. cornuta) revealed seven divergent regions: matK-rps16, trnS-psbZ, trnT-trnL, atpB-rbcL, petB-petD, rpl14-rpl16, and rpl32-trnL. These variations might be applicable for distinguishing different species within the genus Ilex. Phylogenetic reconstruction strongly suggested that I. dabieshanensis formed a sister clade to I. cornuta and also showed a close relationship to I. latifolia. The generated chloroplast genome information in our study is significant for Ilex tea germplasm identification, phylogeny and genetic improvement.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35588136 PMCID: PMC9119449 DOI: 10.1371/journal.pone.0268679
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Circular gene map of I. dabieshanensis chloroplast genome.
The gray arrowheads indicate gene directions. Genes inside and outside the circle represent transcribed clockwisely and counterclockwisely. Different functional genes are color coded. The innermost darker gray conforms to GC content, whereas the lighter gray conforms to AT content. LSC, large single copy region; SSC, small single copy region; IR, inverted repeat.
Summary of I. dabieshanensis chloroplast genome.
| Genome Features | |
|---|---|
| Genome size (bp) / GC content (%) | 157,218 / 37.69 |
| LSC size (bp) / GC content (%) | 86,607 / 35.75 |
| SSC size (bp) / GC content (%) | 18,427 / 31.93 |
| IR size (bp) / GC content (%) | 26,092 / 42.94 |
| Total gene number | 121 |
| Unique gene number | 113 |
| Protein-coding gene | 79 |
| tRNAs | 30 |
| rRNAs | 4 |
| Genes duplicated in IRs | 8 |
LSC, large single copy region; SSC, small single copy region; IR, inverted repeat.
List of genes in the chloroplast genome of I. dabieshanensis.
| Category | Group of Genes/Function | Name of Genes |
|---|---|---|
| Photosynthesis | Subunits_of_photosystem_I | |
| Subunits_of_photosystem_II | ||
| Subunits_of_NADH_dehydrogenase | ||
| Subunits_of_cytochrome_b/f_complex | ||
| Subunits_of_ATP_synthase | ||
| Large_subunit_of_Rubisco |
| |
| Self-replication | Large_subunits_of_ribosome | |
| Small_subunits_of_ribosome | ||
| DNA-dependent_RNA_polymerase | ||
| Ribosomal_RNAs | ||
| Transfer_RNAs | ||
| Other genes | Maturase |
|
| Protease |
| |
| Envelope_membrane_protein |
| |
| Acetyl-CoA_carboxylase |
| |
| C-type_cytochrome_synthesis_gene |
| |
| Translation_initiation_factor |
| |
| Unknown genes | Proteins_of_unknown_function |
Two gene copies in IRS
b Genes containing introns
c Genes divided into two independent transcription units.
Fig 2Distribution of SSR Types in I. dabieshanensis chloroplast genome.
A) Number of SSRs detected; B) Number of identified SSRs in different repeat class types; C) Number of identified SSRs in different genomic regions; and D) Number of identified SSRs in intergenic regions, introns, and coding regions.
Fig 3Analysis of identified long repeat sequences in I. dabieshanensis chloroplast genome.
A) Number of long repeats in four identified types; B) Number of palindromic repeats by length; and C) Number of forward repeats by length.
Codon usage in I. dabieshanensis.
| Amino Acids | Codon | No. | RSCU | Amino Acids | Codon | No. | RSCU |
|---|---|---|---|---|---|---|---|
| Ala | GCA | 426 | 1.2136 | Pro | CCA | 326 | 1.156 |
| Ala | GCC | 212 | 0.6039 | Pro | CCC | 209 | 0.7411 |
| Ala | GCG | 125 | 0.3561 | Pro | CCG | 156 | 0.5531 |
| Ala | GCT | 641 | 1.8262 | Pro | CCT | 437 | 1.5496 |
| Cys | TGC | 69 | 0.4584 | Glu | CAA | 734 | 1.5244 |
| Cys | TGT | 232 | 1.5415 | Glu | CAG | 229 | 0.4755 |
| Asp | GAC | 209 | 0.3745 | Arg | AGA | 505 | 1.8486 |
| Asp | GAT | 907 | 1.6254 | Arg | AGG | 163 | 0.5967 |
| Glu | GAA | 1051 | 1.5155 | Arg | CGA | 380 | 1.391 |
| Glu | GAG | 336 | 0.4844 | Arg | CGC | 107 | 0.3917 |
| Phe | TTC | 543 | 0.7249 | Arg | CGG | 135 | 0.4942 |
| Phe | TTT | 955 | 1.275 | Arg | CGT | 349 | 1.2776 |
| Gly | GGA | 759 | 1.6517 | Ser | AGC | 113 | 0.3242 |
| Gly | GGC | 180 | 0.3917 | Ser | AGT | 422 | 1.2109 |
| Gly | GGG | 320 | 0.6964 | Ser | TCA | 424 | 1.2166 |
| Gly | GGT | 579 | 1.26 | Ser | TCC | 334 | 0.9583 |
| His | CAC | 150 | 0.4643 | Ser | TCG | 187 | 0.5365 |
| His | CAT | 496 | 1.5356 | Ser | TCT | 611 | 1.7532 |
| Ile | ATA | 705 | 0.9288 | STOP | TAA | 43 | 1.4659 |
| Ile | ATC | 464 | 0.6113 | STOP | TAG | 24 | 0.8181 |
| Ile | ATT | 1108 | 1.4598 | STOP | TGA | 21 | 0.7159 |
| Lys | AAA | 1057 | 1.4783 | Thr | ACA | 434 | 1.2534 |
| Lys | AAG | 373 | 0.5216 | Thr | ACC | 248 | 0.7162 |
| Leu | CTA | 389 | 0.8244 | Thr | ACG | 142 | 0.4101 |
| Leu | CTC | 201 | 0.4259 | Thr | ACT | 561 | 1.6202 |
| Leu | CTG | 190 | 0.4026 | Val | GTA | 539 | 1.4706 |
| Leu | CTT | 602 | 1.2758 | Val | GTC | 203 | 0.5538 |
| Leu | TTA | 869 | 1.8417 | Val | GTG | 209 | 0.5702 |
| Leu | TTG | 580 | 1.2292 | Val | GTT | 515 | 1.4051 |
| Met | ATG | 647 | 1 | Trp | TGG | 476 | 1 |
| Asn | AAC | 303 | 0.459 | Tyr | TAC | 194 | 0.3876 |
| Asn | AAT | 1017 | 1.5409 | Tyr | TAT | 807 | 1.6123 |
Fig 4Chloroplast genome comparisons among four Ilex species by mVISTA.
The chloroplast genome of I. dabieshanensis was used as a reference. Gray arrows above the alignment represent gene orientation. Different genome regions are color-coded as exons (purple bars), rRNA or tRNA (sky-blue bars), and non-coding sequences (CNS, pink bars). The horizontal axis indicates the coordinates within the chloroplast genome and vertical scale indicates the percentage of identity ranging from 50 to 100%. Regions with sequence variation among the four Ilex species were marked with white peaks.
Fig 5Comparisions of the SSC/IRs and LSC/IRs junctions among four Ilex chloroplast genomes.
Colored boxes above the strip with scaled sequence length represent the denoted genes. Between boxed genes and boundaries are the gaps in base length (bp). JLB, JSB, JSA, and JLA indicate IRB/LSC, IRB/SSC, IRA/SSC, and IRA/LSC junctions, respectively.
Fig 6Phylogenetic tree for 19 Ilex species in the family Aquifoliaceae using Maximum Likelihood (ML) method, based on alignments of complete chloroplast genomes.
The genomes of Helwingia chinensis and H. himalaica were set as outgroups. Numbers next to the branches indicated bootstrap values from 1,000 replicates.