| Literature DB >> 35456447 |
Lei Wang1,2, Yan Wang1,2, Jing Zhang1,2, Yan Feng1,2, Qing Chen1, Zhen-Shan Liu1,2, Cong-Li Liu3, Wen He1,2, Hao Wang1,2, Shao-Feng Yang1,2, Yong Zhang1, Ya Luo1, Hao-Ru Tang1,2, Xiao-Rong Wang1,2.
Abstract
The subgenus Cerasus and its relatives include many crucial economic drupe fruits and ornamental plants. Repetitive elements make up a large part of complex genomes, and some of them play an important role in gene regulation that can affect phenotypic variation. However, the variation in their genomes remains poorly understood. This work conducted a comprehensive repetitive sequence identification across the draft genomes of eight taxa of the genus Prunus, including four of the Prunus subgenus Cerasus (Prunus pseudocerasus, P. avium, P. yedoensis and P. × yedoensis) as well as congeneric species (Prunus salicina, P. armeniaca, P. dulcis and P. persica). Annotation results showed high proportions of transposable elements in their genomes, ranging from 52.28% (P. armeniaca) to 61.86% (P. pseudocerasus). The most notable differences in the contents of long terminal repeat retrotransposons (LTR-RTs) and tandem repeats (TRs) were confirmed with de novo identification based on the structure of each genome, which significantly contributed to their genome size variation, especially in P. avium and P.salicina. Sequence comparisons showed many similar LTR-RTs closely related to their phylogenetic relationships, and a highly similar monomer unit of the TR sequence was conserved among species. Additionally, the predicted centromere-associated sequence was located in centromeric regions with FISH in the 12 taxa of Prunus. It presented significantly different signal intensities, even within the diverse interindividual phenotypes for Prunus tomentosa. This study provides insight into the LTR-RT and TR variation within Prunus and increases our knowledge about its role in genome evolution.Entities:
Keywords: Oligo-FISH; Prunus (Cerasus); centromere-associated repetitive sequence; long terminal repeat retrotransposons (LTR-RTs); tandem repeats (TR)
Mesh:
Substances:
Year: 2022 PMID: 35456447 PMCID: PMC9028240 DOI: 10.3390/genes13040641
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Reference genomes and sample information used for comparative genomics and molecular cytogenetics in this study.
| Taxa | Reference Genomes | Origin and Chromosome Number of Samples Used in Molecular Cytogenetic Analyses | ||
|---|---|---|---|---|
| Code | Origin | Ploidy Level | ||
| subg. | ||||
|
| ‘Luoyang Guying’ | HC | Miyi, Sichuan, China | 2 |
| XC1 | Xichang, Sichuan, China | |||
|
| ‘Tieton’ (v2.0) [ | Van | ZFI, CAAS, China | 2 |
| Mazzard | ||||
|
| ‘Pyn-Jeju2′ (v1.0) [ | Pyedoensis | Chengdu, Sichuan, China | 2 |
| ‘Somei-yoshino’ a (v3.1) [ | - | - | 2 | |
|
| - | Pcampan | Chengdu, Sichuan, China | 2 |
| subg. | ||||
|
| - | red_fruit | ZFI, CAAS, China | 2 |
| white_fruit | 2 | |||
|
| - | Phumilis | Suqian, Jiangsu, China | 2 |
|
| ‘Sanyueli’ (v1.0) [ | Cuihongli | Chengdu, Sichuan, China | 2 |
| subg. | ||||
|
| ‘Chuanzhihong’ (v1.0) [ | Diaogan | Akesu, Xinjiang, China | 2 |
| subg. | ||||
|
| ‘Texas’ (v2.0) [ | Pdulcis | Luntai, Xinjiang, China | 2 |
|
| ‘Lovell’ (v2.0) [ | Ppersica | Chengdu, Sichuan, China | 2 |
Note: ZFI, CAAS: Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences. a: Two haplotype-phased genome sequences were assembled and named after CYEspachiana_r3.0 (Cerasus × yedoensis_spa) and CYEspeciosa_r3.0 (Cerasus × yedoensis_spe).
Proportions of TEs and TRs in eleven haploid genomes of eight taxa in the Prunus subgenus Cerasus and its relatives.
| Genome |
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| LINE | 1.87 | 1.56 | 1.88 | 1.75/1.81 | 1.69 | 1.71 | 1.51 | 1.5 |
| SINE | 0.3 | 0.23 | 0.31 | 0.3/0.3 | 0.29 | 0.33 | 0.3 | 0.29 |
| LTR | 27.66 | 29.03 | 24.01 | 24.87/23.79 | 31.38 | 21.16 | 24.83 | 23.23 |
| Copia | 10.76 | 8.19 | 9.5 | 9.73/9.11 | 9.32 | 7.48 | 9.06 | 9.43 |
| Gypsy | 16.9 | 20.84 | 14.51 | 15.14/14.68 | 22.06 | 13.68 | 15.77 | 13.8 |
| nLTR | 0.21 | 0.17 | 0.2 | 0.23/0.23 | 0.19 | 0.24 | 0.16 | 0.16 |
| DIRS | 0.03 | 0.03 | 0.03 | 0.03/0.03 | 0.03 | 0.03 | 0.02 | 0.02 |
| PLE | 0.18 | 0.14 | 0.17 | 0.2/0.2 | 0.16 | 0.21 | 0.14 | 0.14 |
| Subclass_1 | 27.82 | 22.06 | 25.76 | 26.93/27.91 | 20.77 | 24.97 | 22.94 | 25.61 |
| TIR/CACTA | 8.12 | 4.5 | 6.58 | 6.58/7.03 | 4.25 | 4.43 | 5.07 | 7.41 |
| TIR/MuDR | 7.9 | 5.68 | 7.83 | 7.93/7.78 | 6.1 | 8.14 | 7.16 | 7.56 |
| TIR/PIF-Harbinger | 3.25 | 5.56 | 3.24 | 3.95/5.14 | 3.06 | 3.73 | 3.65 | 3.47 |
| TIR/Tc1-Mariner | 2.73 | 2.12 | 2.69 | 2.77/2.6 | 2.37 | 2.82 | 2.3 | 2.38 |
| TIR/hAT | 5.82 | 4.2 | 5.42 | 5.7/5.36 | 4.99 | 5.85 | 4.76 | 4.79 |
| Subclass_2 | 4.01 | 2.77 | 3.98 | 3.95/3.93 | 3.71 | 3.85 | 3.56 | 3.63 |
| Helitron | 3.37 | 2.26 | 3.31 | 3.3/3.3 | 3.12 | 3.22 | 2.95 | 3.04 |
| MITE | 0.64 | 0.51 | 0.67 | 0.65/0.63 | 0.59 | 0.63 | 0.61 | 0.59 |
| Total | 61.86 | 55.81 | 56.14 | 58.03/57.94 | 58.05 | 52.28 | 53.31 | 54.41 |
Figure 1Analyses of LTR-RTs in nine haploid genomes. (A) The copy numbers (N) of full-length LTR-RTs and proportions (P) of TLR-RT content in genomes. (B) Copy number differences of full-length LTR-RTs in different clusters among genomes. (C) UpSetR of the LTR-RT clusters shared by the Prunus subgenus Cerasus and its relatives. Intersection sizes on the vertical bars represent the numbers of LTR-RT clusters for a given pattern. The horizontal bars on the left show the whole clusters of LTR-RTs detected in each species. Datasets appearing in intersections are shown with spots.
Figure 2Distribution of the insertion ages of LTR-RTs. I represents insertion age; P represents the percentage (%). Insertion time was split into bins of 0.05 MYA.
Figure 3Analyses of TR sequences in nine haploid genomes of eight taxa. (A) Length distribution of TRs in the genomes; (B) TR contents in the genomes.
Figure 4Chromosomal distribution of centromeres and telomeres among P. pseudocerasus and nine relatives. Green and red signals represent the distributions of centromeres and telomeres, respectively. (A–M) Chromosomal distribution of centromeres and telomeres of different species. a–m: Ideigram and karyotype formula of different species. (A(a),B(b)) P. pseudocerasus, HC and XC1; (C(c),D(d)) P. avium, ‘Mazzard’ and ‘Van’; (E(e)) P. campanulate, Pcampan; (F(f)) P. yedoensis, Pyedoensis; (G(g)) P. humilis, Phumilis; (H(h),I(i)) P. tomentosa, red and white fruit; (J(j)) P. salicina, ‘Cuihongli’; (K(k)) P. armeniaca, ‘diaogan’; (L(l)) P. dulcis, Pdulcis; and (M(m)) P. persica, Ppersica.
Figure 5Distribution of centromere-associated and telomeric repetitive sequences on pseudochromosomes. Green and red bars represent the centromere and telomere sequences, respectively. The Y-axis indicates the percentage. (bin width: 100 k, step size: 50 k). (A) P. pseudocerasus; (B) P. avium; (C-1) P. × yedoensis_spa; (C-2) P. × yedoensis_spe; (D) P. salicina; (E) P. armeniaca; (F) P. dulcis; and (G) P. persica. The contents and proportions of the centromere sequences are shown on the right side for each haploid genome. C represents contents (Mb); P represents percentages (%).
Figure 6Phylogenetic tree based on the amino acid sequences of CENH3. NJ phylogenetic trees were reconstructed using MEGAX with the Jones–Taylor–Thornton (JTT) evolutionary model. CENH3 from Ziziphus jujuba was used as an outgroup. The numbers on the nodes correspond to bootstrap values based on 1000 tests. Only values higher than 30% are shown.