| Literature DB >> 36140840 |
Zhuyifu Chen1,2, Xiaolei Yu1,2, Yujiao Yang1,2, Pei Wei1,2, Wencai Zhang3, Xinzhong Li3, Chenlai Liu1,2, Shuqi Zhao1,2, Xiaoyan Li4, Xing Liu1,2,3.
Abstract
Saxifraga species are widely distributed in alpine and arctic regions in the Northern hemisphere. Highly morphological diversity within this genus brings great difficulties for species identification, and their typical highland living properties make it interesting how they adapt to the extreme environment. Here, we newly generated the chloroplast (cp) genomes of two Saxifraga species and compared them with another five Saxifraga cp genomes to understand the characteristics of cp genomes and their potential roles in highland adaptation. The genome size, structure, gene content, GC content, and codon usage pattern were found to be highly similar. Cp genomes ranged from 146,549 bp to 151,066 bp in length, most of which comprised 130 predicted genes. Yet, due to the expansion of IR regions, the second copy of rps19 in Saxifraga stolonifera was uniquely kept. Through sequence divergence analysis, we identified seven hypervariable regions and detected some signatures of regularity associated with genetic distance. We also identified 52 to 89 SSRs and some long repeats among seven Saxifraga species. Both ML and BI phylogenetic analyses confirmed that seven Saxifraga species formed a monophyletic clade in the Saxifragaceae family, and their intragenus relationship was also well supported. Additionally, the ndhI and ycf1 genes were considered under positive selection in species inhabiting relatively high altitudes. Given the conditions of intense light and low CO2 concentration in the highland, the products of these two genes might participate in the adaptation to the extreme environment.Entities:
Keywords: Saxifraga; chloroplast genome; comparative analysis; high-elevation adaptation; hypervariable regions
Mesh:
Substances:
Year: 2022 PMID: 36140840 PMCID: PMC9498722 DOI: 10.3390/genes13091673
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1cp genome map of two newly sequenced Saxifraga species, S. saginoides and S. sessiliflora. The outer circle shows the transcription direction; genes outside are transcribed clockwise while genes inside are transcribed counterclockwise. LSC/SSC/IR regions are exhibited in the inner circle. Genes of different functions are presented in different colors.
Gene annotation of the S. saginoides and S. sessiliflora chloroplast genome.
| Category | Group | Genes |
|---|---|---|
| Photosynthesis related genes | Rubisco |
|
| Photosystem I | ||
| Photosystem II | ||
| ATP synthase | ||
| Cytochrome b/f complex | ||
| Cytochrome C synthesis |
| |
| NADPH dehydrogenase | ||
| Transcription and translation related genes | Transcription | |
| Ribosomal proteins | ||
| Translation initiation factor |
| |
| RNA genes | Ribosomal RNA | |
| Transfer RNA | ||
| Other genes | RNA processing |
|
| Carbon metabolism |
| |
| Fatty acid synthesis |
| |
| Proteolysis |
| |
| Conserved ORFs |
genes with one intron, genes with two introns, two gene copies in IRs.
Summary statistics of chloroplast genomes for Saxifraga species.
| Genome Feature |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Genome size (bp) | 147,716 | 149,569 | 147,240 | 146,549 | 147,442 | 149,775 | 151,066 |
| LSC size (bp) | 79,912 | 81,306 | 79,310 | 79,317 | 79,985 | 81,998 | 82,738 |
| SSC size (bp) | 16,830 | 16,961 | 16,874 | 16,390 | 16,537 | 16,895 | 17,504 |
| IR size (bp) | 25,487 | 25,651 | 25,528 | 25,421 | 25,460 | 25,441 | 25,412 |
| Number of genes | 130 | 130 | 130 | 130 | 130 | 130 | 131 |
| Protein genes | 85 | 85 | 85 | 85 | 85 | 85 | 86 |
| tRNA genes | 37 | 37 | 37 | 37 | 37 | 37 | 37 |
| rRNA genes | 8 | 8 | 8 | 8 | 8 | 8 | 8 |
| Duplicated genes in IRs | 17 | 17 | 17 | 17 | 17 | 17 | 18 |
| GC content (%) | 38% | 37.9% | 38% | 38.1% | 38.1% | 37.8% | 37.8% |
| GC content in LSC (%) | 36.2% | 36% | 36.2% | 36.2% | 36.2% | 35.8% | 35.9% |
| GC content in SSC (%) | 32.1% | 32% | 32% | 32.4% | 32.4% | 31.9% | 32.2% |
| GC content in IRs (%) | 42.9% | 42.8% | 42.9% | 42.8% | 42.8% | 42.9% | 43% |
Figure 2A plot displaying the boundaries of the LSC, SSC, and IR regions and genes around the junction sites for each Saxifraga cp genome. JLB, JSB, JSA, and JLA denote the junction sites of LSC and IRb, IRb and SSC, SSC and IRa, and IRa and LSC, respectively.
Figure 3The alignment and comparative analysis of the whole cp genome for seven Saxifraga species. Among them, S. saginoides was set as reference. The horizontal axis represents the coordinates of cp genomes in the alignment result. Exons, introns, and conserved noncoding sequences (CNSs) were marked as different colors.
Figure 4Comparative analysis of the nucleotide polymorphism (Pi) values among the cp genomes of Saxifraga. (A) Pi values calculated as all the seven Saxifraga species were taken into account. (B) Pi values calculated as five species in Sect. Ciliatae was selected separately.
Figure 5Usage preference of amino acids (AAs) and codons for PCGs. (A) AA usage of all the PCGs in each Saxifraga species. (B) RSCU for every AA in S. saginoides. For each amino acid, a color represents a unique codon. (C) ENC-GC3 plot for S. saginoides; every gene was displayed as a dot, and different colors mean genes in distinct functional groups.
Figure 6Repeats analysis among cp genomes of Saxifraga. (A) Distribution of all repeat units for SSRs in each species. (B) The number of different types for SSRs in each species. (C) Distribution of SSRs, respectively, in LSC, SSC, and IR regions. (D) The number of different types for long repeats.
Figure 7Maximum likelihood (ML) phylogenetic tree of 47 species in the Saxifragaceae family, reconstructed with 79 PCGs. Three non-Saxifragaceae species, Myriophyllum spicatum, Ribes nevadense, and Ribes roezlii, were set as outgroups.
Figure 8Heatmap representing pairwise Ka/Ks ratios of PCGs among Saxifraga species. The color bias toward red indicates that there is a higher Ka/Ks ratio between genes.