| Literature DB >> 32576920 |
Wei Wang1, Tao Yang2, Hui-Ling Wang3, Zhen-Jian Li1, Jian-Wei Ni1, Shang Su1, Xin-Qiao Xu4.
Abstract
As a source of genetic variation, almond germplasm resources are of great significance in breeding. To better reveal the mutation characteristics and evolution patterns of the almond chloroplast (cp) genome, the complete cp genomes from six almond species were analyzed. The lengths of the chloroplast genome of the six almond species ranged from 157,783 bp to 158,073 bp. For repeat sequence analysis, 53 pairs of repeats (30 bp or longer) were identified. A total of 117 SSR loci were observed, including 96 polymorphic SSR loci. Nine highly variable regions with a nucleotide variability (Pi) higher than 0.08, including rps16, rps16-psbK, atpF-atpH, rpoB, ycf3-rps4, rps4-ndhJ, accD-psaI and rps7-orf42 (two highly variable regions) were located. Based on the chloroplast genome evolution analysis, three species (P. tenella, P. pedunculata and P. triloba) and wild cherry (P. tomentosa) were grouped into clade I. Clade II consisted of two species (P. mongolica and P. tangutica) and wild peach (P. davidiana). Clade III included the common almond (P. dulcis), cultivated peach (P. persica) and GanSu peach (P. kansuensis). This result expands the researchers' vision of almond plant diversity and promotes an understanding of the evolutionary relationship among almond species. In brief, this study provides abundant resources for the study of the almond chloroplast genome, and has an important reference value for study of the evolution and species identification of almond.Entities:
Mesh:
Year: 2020 PMID: 32576920 PMCID: PMC7311419 DOI: 10.1038/s41598-020-67264-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Description of the six almond species.
| Species | Origin | Collection site | Latitude | Longitude |
|---|---|---|---|---|
| Cultivar | Kashi, Xinjiang, China | 38.22 | 77.18 | |
| Wild | Aba, Sichuan, China | 32.3 | 103.38 | |
| Wild | Yulin, Shanxi, China | 38.51 | 109.52 | |
| Wild | Alashan, Neimenggu, China | 39.46 | 105.3 | |
| Wild | Aertai, Xinjiang, China | 48.18 | 86.34 | |
| Wild | Chicheng, Hebei, China | 41.11 | 116.04 |
Comparison of the complete chloroplast genome contents of the six almond species.
| P. dulcis | P. pedunculata | P. triloba | P. mongolica | P. tangutica | P. tenella | |
|---|---|---|---|---|---|---|
| GeneBank Number | MH727486 | MG869261 | MH748555 | MH727485 | MH744156 | MH727487 |
| Total Sequence Length | 157,783 | 157,873 | 157,816 | 158,074 | 158,049 | 158,066 |
| Large Single Copy (LSC) | 85,921 | 86,074 | 86,021 | 86,316 | 86,226 | 86,260 |
| Small Single Copy (SSC) | 19,100 | 19,029 | 19,023 | 18,992 | 19,063 | 19,056 |
| Inverted Repeat Region (IR) | 26,380 | 26,384 | 26,385 | 26,382 | 26,379 | 26,374 |
| Total Number of Gene | 136 | 136 | 136 | 136 | 136 | 136 |
| Protein-Coding Genes | 91 | 91 | 91 | 91 | 91 | 91 |
| tRNA | 37 | 37 | 37 | 37 | 37 | 37 |
| rRNA | 8 | 8 | 8 | 8 | 8 | 8 |
| GC% | 36.77 | 36.78 | 36.79 | 36.76 | 36.76 | 36.73 |
| Reference | This article | Wang | This article | This article | This article | This article |
Figure 1Gene map of the five almond chloroplast genomes. Genes drawn inside the circle are transcribed clockwise, while genes outside the circle are transcribed counterclockwise. The thick lines indicate the extent of the inverted repeats (IRa and IRb) that separate the genomes into large single-copy (LSC) and small single-copy (SSC) regions. The gene functional groups are color-coded.
List of genes in the chloroplast genomes of the six almond species.
| Category | Gene group | Gene name |
|---|---|---|
| Rubisco | ||
| Photosystem I | ||
| Assembly/stability of photosystem I | ||
| Photosystem II | ||
| ATP synthase | ||
| Cytochrome b/f complex | ||
| Cytochrome c synthesis | ||
| NADPH dehydrogenase | ||
| Transcription and translation related genes | Transcription | |
| Ribosomal proteins | ||
| RNA genes | Ribosomal RNA | |
| Transfer RNA | a | |
| Other genes | RNA processing | |
| Carbon metabolism | ||
| Fatty acid synthesis | ||
| Proteolysis | ||
| Genes of unknown function | Conserved reading frames | |
| Hypothetical chloroplast protein | ||
| Pseudogenes | Ψ |
a gene containing a single intron, b gene containing two introns, c gene divided into two independent transcription units, (×2), gene with two copies; a pseudogene is represented by Ψ.
Figure 2Comparisons of the borders of the large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regions among the P. pedunculata, P. triloba, P. mongolica, P. tenella, P. tangutica, and P. dulcis chloroplast genomes. The colored boxes above and below the main line indicate the adjacent border genes, where yellow indicates rps19, blue indicates rpl2, purple indicates Ѱycf1 (Ѱ indicates a pseudogene), brown indicates ndhF, pink indicates ycf1, green indicates rpl2, and light green indicates trnH.
Figure 3Visualized alignments of the six almond species cp genomes. Sequence identity plots among the five almond species chloroplast genomes were made by using mVISTA, with P. pedunculata as a reference. The y-axis represents identity ranging from 50% to 100%.
Figure 4Analyses of repeated sequences in the six almond species chloroplast genomes. (A) Frequency of repeat types. (B) Frequency of repeat sequences by length.
Simple sequence repeats (SSRs) in the six almond cp genomes.
| Species | SSR loci no. | PolyM. loci no. | PolyM. loci (100%) | mono- | di- | tri- | tetra- | penta- | Location | Region | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| IGS | Intron | CDS | LSC | IR | SSC | |||||||||
| 80 | 60 | 75.00 | 56 | 14 | 1 | 8 | 1 | 51 | 16 | 13 | 71 | 2 | 7 | |
| 81 | 60 | 74.07 | 58 | 13 | 1 | 8 | 1 | 53 | 16 | 12 | 71 | 2 | 8 | |
| 76 | 55 | 72.37 | 52 | 17 | 0 | 5 | 2 | 49 | 15 | 12 | 69 | 3 | 4 | |
| 80 | 57 | 71.25 | 53 | 19 | 0 | 6 | 2 | 53 | 16 | 11 | 70 | 2 | 8 | |
| 85 | 63 | 74.12 | 63 | 13 | 0 | 7 | 2 | 56 | 16 | 13 | 73 | 4 | 8 | |
| 75 | 52 | 69.33 | 51 | 15 | 0 | 6 | 3 | 50 | 12 | 13 | 65 | 2 | 8 | |
| Total Loci | 117 | 96 | 82.05 | 84 | 19 | 2 | 8 | 4 | 80 | 25 | 12 | 100 | 5 | 12 |
Figure 5Sliding window analyses of the six almond species chloroplast genomes sequences. The X-axis represents the position of the midpoint of a window, while the Y-axis represents nucleotide variability (Pi) of each window (window length: 600 bp, step size: 200 bp).
Figure 6Phylogenetic relationships of 39 species inferred from maximum likelihood (ML) analysis based on the complete chloroplast genome sequences. The numbers above the nodes are the support values with maximum likelihood (ML) analyses, and the symbol * indicates that the support rate of the branch is 100%. Clade I (orange): P. tenella, P. pedunculata, P. triloba and P. tomentosa; clade II (yellow): P. mongolica, P. tangutica and P. davidiana; clade III (light blue): P. dulcis, P. persica and P. kansuensis.