| Literature DB >> 35873491 |
Linjun Yan1, Huanli Wang1, Xi Huang1, Yingchao Li2, Yuanhao Yue1, Zhongwei Wang1, Shijie Tang1.
Abstract
Tilia is a complex genus in the family Malvaceae that has high ecological and economical values. Owing to the lack of sufficient distinguishable morphological and molecular characteristics, interspecific relationships in this genus are not clear. Chloroplast (cp) genomes are small, meanwhile most angiosperms usually undergo matrilineal inheritance. Consequently, they can be used in molecular systematics and phylogenetic analyses. Here, we sequenced and assembled cp genomes from T. endochrysea, T. tomentosa, T. miqueliana, T. americana and T. cordata, and compared them with those of seven previously reported Tilia species. Similar gene contents, gene orders and GC contents existed among the 12 cp genomes, which ranged from 162,564 to 162,855 bp and encoded 113 unique genes. Abundant simple sequence repeats (119-127) and dispersed repeats (97-135) were detected in Tilia cp genomes. In total, 11 hypervariable regions were identified that could be suitable for species identification and phylogenetic studies. A phylogenetic analysis of Malvaceae based on 5 hypervariable genes (matK + ndhF + rpoB + rpoC2+ycf1) revealed that all eight subfamilies were monophyletic groups. Additionally, the genus Tilia was divided into three groups on the basis of all 521 molecular variation loci. The current study provides valuable insights into the genomic evolution of the genus Tilia.Entities:
Keywords: Malvaceae; chloroplast genome; comparison analysis; hypervariable regions; phylogenetic analysis
Year: 2022 PMID: 35873491 PMCID: PMC9305825 DOI: 10.3389/fgene.2022.925726
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Circular gene map of the chloroplast genomes of T. endochrysea, T. tomentosa, T. miqueliana, T. americana and T. cordata. Genes drawn within the circle are transcribed clockwise, whereas those drawn outside are transcribed counterclockwise. Genes are color-coded in accordance with their functional groups. The inner circle represents the GC content.
Characteristics of complete chloroplast genomes of Tilia species.
| Specimens | Length (bp)/GC% | LSC (bp)/GC% | IR (bp)/GC% | SSC (bp)/GC% |
|---|---|---|---|---|
|
| 162,838/36.5% | 91,264/34.1% | 25,580/42.9% | 20,414/31.0% |
|
| 162,746/36.5% | 91,055/34.1% | 25,655/42.8% | 20,381/31.0% |
|
| 162,753/36.5% | 91,159/34.1% | 25,578/42.9% | 20,438/30.9% |
|
| 162,715/36.5% | 91,205/34.1% | 25,571/42.9% | 20,368/31.0% |
|
| 162,855/36.5% | 91,164/34.1% | 25,655/42.8% | 20,381/31.0% |
|
| 162,796/36.5% | 91,127/34.1% | 25,649/42.8% | 20,371/31.0% |
|
| 162,653/36.5% | 91,139/34.1% | 25,567/42.9% | 20,380/31.0% |
|
| 162,803/36.5% | 91,114/34.1% | 25,655/42.8% | 20,379/31.0% |
|
| 162,734/36.5% | 91,095/34.1% | 25,629/42.9% | 20,381/31.0% |
|
| 162,564/36.5% | 91,100/34.1% | 25,491/42.9% | 20,478/31.0% |
|
| 162,715/36.5% | 91,124/34.1% | 25,597/42.9% | 20,397/31.0% |
|
| 162,804/36.5% | 91,255/34.1% | 25,597/42.9% | 20,355/31.0% |
Genes encoded in the chloroplast genomes of five Tilia species.
| Category | Gene groups | Gene name |
|---|---|---|
| Photosynthesis | Subunits of photosystem I |
|
| Subunits of photosystem II |
| |
| Subunits of NADH dehydrogenase |
| |
| Subunits of cytochrome b/f complex |
| |
| Subunits of ATP synthase |
| |
| Large subunit of Rubisco |
| |
| Self-replication | Large subunits of ribosome |
|
| Small subunits of ribosome |
| |
| DNA-dependent RNA polymerase |
| |
| Ribosomal RNAs |
| |
| Transfer RNAs |
| |
| Other genes | Maturase |
|
| Protease |
| |
| Envelope membrane protein |
| |
| Acetyl-CoA carboxylase |
| |
| C-type cytochrome synthesis gene |
| |
| Translation initiation factor |
| |
| Genes of unknown | Proteins of unknown function |
|
*/** genes containing one/two introns. (×2) two gene copies in IRs.
FIGURE 2Distribution of SSRs and dispersed repeats in the chloroplast genomes of Tilia. (A) Numbers of different SSR types detected in the cp genomes; (B) Numbers of different SSR class types; (C) Numbers of the four dispersed repeat types in Tilia; (D) Numbers of dispersed repeat types having a given length interval (30 to 39, 40 to 49, 50 to 59, 60 to 69 and ≥ 70).
FIGURE 3Comparison of the boundaries of large single-copy (LSC), small single-copy (SSC) an inverted repeat (IR) regions of 12 chloroplast genomes of Tilia.
FIGURE 4Single nucleotide polymorphic sites in 12 chloroplast genomes of Tilia. Track A represents the encoding genes. Tracks B–L represent variant sites in T. tomentosa, T. americana, T. miqueliana, T. cordata, T. mandshurica, T. paucicostata, T. taishanensis, T. oliveri, T. insularis, T. amurensis and T. mongolica compared with T. endorchrysea. The red, blue, yellow and green lines on each track indicate the kinds of variant, A, T, C and G nucleotides, respectively. Purple lines indicate InDel regions. The red rectangles represent the locations of the 11 selected hypervariable regions.
FIGURE 5Maximum-likelihood phylogenetic inference of 38 Malvaceae species based on matK + ndhF + rpoB + rpoC2 + ycf1 genes. The numbers associated with the nodes are bootstrap support and posterior probability values. All the nodes with bootstrapping equal to 100 was not mentioned. 1000 bootstrap replicates.
FIGURE 6Maximum-likelihood treeof 12 Tilia species based on all variation sites. The numbers associated with the nodes are bootstrap support and posterior probability values. All the nodes with bootstrapping equal to 100 was not mentioned. 1000 bootstrap replicates.