| Literature DB >> 35860470 |
Xien Wu1, Dengli Luo1, Yingmin Zhang1, Congwei Yang1, M James C Crabbe2,3,4, Ticao Zhang1, Guodong Li1.
Abstract
The hawthorns (Crataegus spp.) are widely distributed and famous for their edible and medicinal values. There are ∼18 species and seven varieties of hawthorn in China distributed throughout the country. We now report the chloroplast genome sequences from C. scabrifolia, C. chungtienensis and C. oresbia, from the southwest of China and compare them with the previously released six species in Crataegus and four species in Rosaceae. The chloroplast genome structure of Crataegus is typical and can be divided into four parts. The genome sizes are between 159,654 and 159,898bp. The three newly sequenced chloroplast genomes encode 132 genes, including 85 protein-coding genes, 37 tRNA genes, and eight rRNA genes. Comparative analysis of the chloroplast genomes revealed six divergent hotspot regions, including ndhA, rps16-trnQ-UUG, ndhF-rpl32, rps16-psbK, trnR-UCU-atpA and rpl32-trnL-UAG. According to the correlation and co-occurrence analysis of repeats with indels and SNPs, the relationship between them cannot be ignored. The phylogenetic tree constructed based on the complete chloroplast genome and intergenic region sequences indicated that C. scabrifolia has a different origin from C. chungtienensis and C. oresbia. We support the placement of C. hupehensis, C. cuneata, C. scabrifolia in C. subg. Crataegus and C. kansuensis, C. oresbia, C. kansuensis in C. subg. Sanguineae. In addition, based on the morphology, geographic distribution and phylogenetic relationships of C. chungtienensis and C. oresbia, we speculate that these two species may be the same species. In conclusion, this study has enriched the chloroplast genome resources of Crataegus and provided valuable information for the phylogeny and species identification of this genus.Entities:
Keywords: Crataegus spp.; chloroplast genome; comparative analysis; hawthorn; phylogenetic analysis
Year: 2022 PMID: 35860470 PMCID: PMC9289535 DOI: 10.3389/fgene.2022.900357
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Chloroplast genome maps of Crataegus (C. oresbia, C. scabrifolia, C. chungtienesis). In the diagram, different colors indicate genes with different functions. The genes inside circles are transcribed clockwise and genes outside circles are transcribed counterclockwise. Two inverted repeats (IRa and IRb), a large single copy region (LSC) and small single copy region (SSC) regions are shown in the inner circles. The light gray inner circles indicate A/T content and the dark gray circles indicate G/C content.
Comparison of chloroplast genome features of the nine species of Crataegus and four other genera of Rosaceae (Eriobotrya, Malus, Rubus, and Spiraea).
| Species | Size (bp) | LSC (bp) | SSC (bp) | IR (bp) | Genes number | GC (%) | Reference |
|---|---|---|---|---|---|---|---|
|
| 159,851 | 87,819 | 19,263 | 26,384 | 132 | 36.6 | This study |
|
| 159,742 | 87,819 | 19,218 | 26,383 | 132 | 36.6 | This study |
|
| 159,847 | 87,814 | 19,263 | 26,384 | 132 | 36.6 | This study |
|
| 159,660 | 87,712 | 19,231 | 26,358 | 132 | 36.6 |
|
|
| 159,898 | 87,599 | 19,218 | 26,540 | 129 | 36.6 |
|
|
| 159,730 | 87,778 | 19,183 | 26.384 | 129 | 36.6 | |
|
| 159,865 | 87,815 | 19,231 | 26,384 | 132 | 36.6 |
|
|
| 159,766 | 87,852 | 19,143 | 26,385 | 132 | 36.6 |
|
|
| 159,654 | 87,747 | 19,138 | 26,384 | 131 | 36.6 |
|
|
| 159,488 | 87,380 | 19,290 | 26,384 | 132 | 36.7 |
|
|
| 159,834 | 87,950 | 19,176 | 26,409 | 129 | 36.6 |
|
|
| 155,949 | 84,375 | 18,894 | 26,354 | 131 | 36.7 |
|
|
| 155,144 | 84,818 | 18,580 | 25,937 | 130 | 37.3 |
|
Genes in the chloroplast genome of Crataegus chungtienesis.
| Gene Group | Gene name |
|---|---|
| Photosystem I |
|
| Photosystem II |
|
| NAD (P) H oxidoreductase |
|
| Cytochrome b6/f complex |
|
| ATP synthase |
|
| Rubisco |
|
| Large subunit of ribosomal |
|
| Small subunit of ribosomal |
|
| DNA dependent RNA polymerase |
|
| rRNA genes |
|
| tRNA genea |
|
| Maturase |
|
| Protease |
|
| Envelop membrane protein |
|
| Subunit of acetyl-CoA |
|
| c-type cytochrome synthesis gene |
|
| Translational |
|
| Conserved hypothetical chloroplast ORF |
|
Genea, Gene with one intron; Geneb, Gene with two introns; Genec, Number of copies of multi-copy genes.
FIGURE 2Heatmap analysis of relative synonymous codon usage (RSCU) values among the 9 species of Crataegus.
FIGURE 3Complete chloroplast genome comparison of 13 species of Rosaceae. Gray arrows indicate the direction of the gene. The dark blue regions represent exons. Pink regions represent noncoding sequences (CNS), and white peaks represent genomic differences. The Y-axis represents the percentage, from 50 to 100%.
FIGURE 4Sliding window analysis based on the cp genomes of 9 Crataegus species.
FIGURE 5Comparison of the borders of the LSC, SSC, and IR regions among 9 Crataegus chloroplast genomes and four species of Rosaceae.
FIGURE 6Phylogenetic tree of Crataegus within the Rosaceae. The entire genome data set was analyzed using maximum likelihood (ML) and Bayesian information (BI). Different colors represent different clades (A,B).
FIGURE 7Phylogenetic tree of Crataegus based on the sequences of five intergenic regions. The different colors represent the different evolutionary branches clades (clade A and clade B) in the phylogenetic tree constructed from the complete chloroplast genome. (A) Phylogenetic tree constructed by maximum likelihood (ML). (B) Phylogenetic tree constructed by maximum Bayesian method (BI).
Correlation values of indels with SNPs, indels with repeats and SNPs with repeats.
| Indels and SNPs | Indels and SSRs | Indels and Oligonucleotide repeats | Indels and Repeats | SNPs and SSRs | SNPs and Oligonucleotide repeats | SNPs and Repeats | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Rho |
| Rho |
| Rho |
| Rho |
| Rho |
| Rho |
| Rho |
| |
|
| 0.161 | <0.001 | 0.332 | <0.001 | 0.093 | 0.002 | 0.343 | <0.001 | 0.068 | 0.026 | 0.007 | 0.819 | 0.065 | 0.034 |
|
| 0.211 | <0.001 | 0.335 | <0.001 | 0.053 | 0.081 | 0.323 | <0.001 | 0.088 | 0.004 | −0.015 | 0.613 | 0.07 | 0.023 |
|
| 0.202 | <0.001 | 0.358 | <0.001 | 0.071 | 0.02 | 0.345 | <0.001 | 0.057 | 0.060 | −0.023 | 0.455 | 0.037 | 0.228 |
|
| 0.256 | <0.001 | 0.188 | <0.001 | 0.003 | 0.916 | 0.156 | <0.001 | −0.020 | 0.470 | −0.117 | <0.001 | −0.08 | 0.009 |
|
| 0.286 | <0.001 | 0.047 | 0.125 | −0.024 | 0.431 | 0.032 | 0.290 | −0.059 | 0.129 | −0.114 | <0.001 | −0.095 | 0.002 |