Literature DB >> 36196069

The identification of the Rosa S-locus provides new insights into the breeding and wild origins of continuous-flowering roses.

Koji Kawamura1, Yoshihiro Ueda2,3, Shogo Matsumoto4, Takanori Horibe4,5, Shungo Otagaki4, Li Wang6, Guoliang Wang7,8, Laurence Hibrand-Saint Oyant9, Fabrice Foucher9, Marcus Linde10, Thomas Debener10.   

Abstract

This study aims to: (i) identify the Rosa S-locus controlling self-incompatibility (SI); (ii) test the genetic linkage of the S-locus with other loci controlling important ornamental traits, such as the continuous-flowering (CF) characteristic; (iii) identify the S-alleles (SC ) of old Chinese CF cultivars (e.g, Old Blush, Slater's Crimson China) and examine the changes in the frequency of cultivars with Sc through the history of breeding; (iv) identify wild species carrying the Sc-alleles to infer wild origins of CF cultivars. We identified a new S-RNase (SC2 ) of Rosa chinensis in a contig from a genome database that has not been integrated into one of the seven chromosomes yet. Genetic mapping indicated that SC2 is allelic to the previously-identified S-RNase (SC1 ) in chromosome 3. Pollination experiments with half-compatible pairs of roses confirmed that they are the pistil-determinant of SI. The segregation analysis of an F1 -population indicated genetic linkage between the S-locus and the floral repressor gene KSN. The non-functional allele ksn is responsible for the CF characteristic. A total of five S-alleles (SC1-5 ) were identified from old CF cultivars. The frequency of cultivars with SC dramatically increased after the introgression of ksn from Chinese to European cultivars and remains high (80%) in modern cultivars, suggesting that S-genotyping is helpful for effective breeding. Wild individuals carrying SC were found in Rosa multiflora (SC1 ), Rosa chinensis var. spontanea (SC3 ), and Rosa gigantea (SC2 , SC4 ), supporting the hypothesis of hybrid origins of CF cultivars and providing a new evidence for the involvement of Rosa multiflora.
© The Author(s) 2022. Published by Oxford University Press on behalf of Nanjing Agricultural University.

Entities:  

Year:  2022        PMID: 36196069      PMCID: PMC9527601          DOI: 10.1093/hr/uhac155

Source DB:  PubMed          Journal:  Hortic Res        ISSN: 2052-7276            Impact factor:   7.291


Introduction

The rose is one of the globally most popular ornamental plants, with a long history of breeding and cultivation. It is important not only economically, but also culturally. More than 30 000 rose cultivars have been developed [1] mainly by cross breeding. Patterns of inheritance are quite difficult to predict for most traits, as roses are predominantly outcrossing and highly heterozygous plants [2]. As a consequence, the success of cross breeding in the rose has largely depended on chance and the experience of breeders, requiring enormous efforts to make new cultivars with desirable traits [3]. Advances in scientific knowledge on rose genetics have been much awaited. Researchers have developed molecular markers to construct genetic linkage maps of the rose, clarifying the underlying genetic mechanisms controlling ornamental traits [4] and, genome sequencing finished in 2018 [5-7]. Key genetic factors controlling important ornamental traits, such as scent production [8, 9], continuous-flowering (CF) [10], and double-flower (DF) [5, 11] have been identified. Although identification of the S-locus controlling self-incompatibility (SI) in roses is essential for improving breeding at the diploid level, it has not been fully elucidated. The gametophytic SI is controlled by a single S-locus with multiple alleles, and when one of the two S-alleles of the pistil matches those of pollen, the pollen is recognized as self and is rejected [12]. The pistil S gene of Rosaceae encodes an extracellular ribonuclease called S-RNase, which acts as a cytotoxin in self-pollen tubes. The pollen S gene is involved in detoxifying non-self-S-RNases and encodes an F-box gene [12]. The “collaborative non-self-recognition” model was proposed for SI in the Solanaceous plant, Petunia [13], where multiple F-box genes are involved in pollen specificity, and each targets a subset of non-self-S-RNases for detoxification. The SI of Rosaceae may adopt this “non-self-recognition by a multiple factors” system [12], except for the self-recognition by a single F-box gene system in Prunus [14]. In the genome databases of R. chinensis “Old Blush”, two different regions have been proposed as the S-locus [5, 15], both of which are located on the same chromosome 3 within a few Mbp of each other. Vieira et al. [15] concluded that their candidate region was the true S-locus because its S-RNase (hereafter called as S) has a stronger similarity to the Prunus S-RNase than the S-RNase36 identified by Hibrand-Saint Oyant et al. [5]. Chen et al. [16] identified an S-locus-like region in the genome of Rosa rugosa, whose the S-RNase is orthologous to the S of Old Blush. Du et al. [17] identified the S-RNase controlling SI in Fragaria, which is also more similar to the S rather than S-RNase36. These results indicate that the S is the true S gene controlling SI in Old Blush. However, validation of the Rosa S-locus by pollination experiments has been conducted with only a few individuals [15]. Pollination tests with many individuals are required to confirm the result. Furthermore, Vieira et al. [15] identified the S of Old Blush from the genome database of Raymond et al. [6] but failed to identify its allelic gene in the other genome database of Hibrand-Saint Oyant et al. [5]. The S-RNase36 identified by Hibrand-Saint Oyant et al. [5] has a single intron and weak homology to the S, which has two introns. As the two genome databases of Old Blush represent different haplotypes of chromosome 3 [18], there should be unidentified S-RNase in the genome database of Hibrand-Saint Oyant et al. [5]. Thus, the first objective of this study is to (i) identify the other allele of S-RNases in Old Blush and to confirm the Rosa S-locus with strong evidence from a number of pollination experiments. The second objective of this study is to (ii) test the genetic linkages of the S-locus with other loci controlling important ornamental traits. In the chromosome 3 where the Rosa S-locus is assumed to be located, there are other important loci controlling valuable traits for ornamental plants, such as CF [19], double flower [20], thornlessness [21], and resistance against black spot disease [22]. These previous studies found strong skewness in the segregation of these traits and hypothesized that their underlying genes are genetically linked with the S-locus. The genetic linkage of the S-locus and other loci may have important consequences for rose breeding. For example, the CF characteristic is controlled by a single recessive locus on the chromosome 3, and the underlying gene is the non-functional allele (ksn) of RoKSN [5, 10], a floral repressor gene in roses [23]. Roses homozygous at the non-functional allele ksn will be targets in the cross breeding of CF roses. But, if the ksn alleles of parental roses are strongly linked with the same S-allele, breeding of CF roses with the ksn-homozygote will be hampered by SI, where the pistil rejects the pollen with the same S-allele linked to ksn. The third objective of this study is to (iii) determine the S-allele (S) of old Chinese CF cultivars and to clarify the changes in the frequency of rose cultivars with S during the history of rose breeding. The ksn allele conferring the CF characteristic in modern rose cultivars originated from old Chinese cultivars around 200 years ago [10, 24]. The original Chinese cultivars introducing the CF characteristic into modern roses are thought to be the four China roses, i.e. Slater’s Crimson China, Parsons’ Pink China (Old Blush), Hume’s Blush Tea-scented China, and Parks’ Yellow Tea-scented China [25]. After the introgression of ksn from China to Europe, the frequency of rose cultivars carrying the Sc should increase but may progressively decrease during the 200 years’ history of rose breeding due to the hybridizations with European cultivars. However, if Sc is genetically linked with ksn, the strong artificial selection on ksn [24] might result in a ksn-associated increase in the frequency of Sc during the history of rose breeding. The fourth objective of this study is to (iv) identify candidate genotypes for wild ancestors of old Chinese CF cultivars by using S as markers. The introduction of Chinese CF cultivars is one of the most revolutionary events throughout the history of rose breeding [24, 25], and the Chinese cultivars have profound effects on the genetic bases of modern rose cultivars [26]. However, the wild ancestral origin of the old Chinese CF cultivars have not been completely elucidated. Molecular phylogenetic studies have indicated their complex hybrid origins [27-32]. The whole genome sequencing will play a critical role in the elucidation of the hybrid origin of old Chinese cultivars but is too complicated to analyze the massive data for many candidate species. There is also a large intraspecific genetic variation within the species [33, 34], and there might be also natural hybrids between the species. Due to the high genetic divergences of the plant S-locus [35], we expect that the S-allele specific marker is useful to pinpoint the candidate wild ancestors of old cultivars. Genome-wide identification and genetic analyses of candidate S-RNase genes in the rose. (a) Distribution of the candidate genes in R. chinensis “Old Blush” genome. Numbers and genomic positions are shown based on the genome database of Raymond et al. [6]. Pistil-expressed genes are shown in red. (b) Pistil and stamen expression levels of the candidate genes in roses. FPKM values were calculated from RNA-seq data of the pistil and stamen. Averages and standard errors of three individuals are shown for R. chinensis “Old Blush”. From the transcriptome data of eight R. multiflora plants, genes orthologous to the reference genes of Old Blush were identified, and their averages and standard errors of the FPKM values are shown for R. multiflora. (c) Molecular phylogenetic tree of the pistil-expressed, candidate genes, including Prunus S-RNases as references. The tree was rooted with T2-RNase of Arabidopsis thaliana RNS2 (NM129536) [36]. Bold branches have FastTree support values [37, 38] above 0.95 based on 1000 resamples.

Results

Genome-wide identification of candidate S-RNase

We performed a genome-wide search for candidate S-RNase genes in the genome databases of Old Blush [5, 6], R. multiflora [7], and R. rugosa [16, 39] (Supplementary data Table D2) and identified 16 genes located on four chromosomes (Fig. 1a). Of these, seven genes were expressed in the pistil according to the pistil transcriptomes of Old Blush, R. multiflora, and R. rugosa (Fig. 1b). These candidate genes were named by chromosome number and letter, e.g. 3D and 6A (Fig. 1a). The specific genes of Old Blush, R. multiflora, and R. rugosa were named with Rc, Rm, and Rg, respectively (Fig. 1c). The putative amino acid sequences were obtained from the cDNA sequences, and a molecular phylogenetic tree of the pistil-expressed, S-RNase-like genes was constructed, including the Prunus S-RNase (Fig. 1c). The results show that the 3D and 0A genes are the primary candidates for S-RNase controlling SI in the rose due to their (i) high expression levels in the pistil (Fig. 1b), (ii) similarity to Prunus S-RNase (Fig. 1c), and (iii) high genetic divergences (Fig. 1c).
Figure 1

Genome-wide identification and genetic analyses of candidate S-RNase genes in the rose. (a) Distribution of the candidate genes in R. chinensis “Old Blush” genome. Numbers and genomic positions are shown based on the genome database of Raymond et al. [6]. Pistil-expressed genes are shown in red. (b) Pistil and stamen expression levels of the candidate genes in roses. FPKM values were calculated from RNA-seq data of the pistil and stamen. Averages and standard errors of three individuals are shown for R. chinensis “Old Blush”. From the transcriptome data of eight R. multiflora plants, genes orthologous to the reference genes of Old Blush were identified, and their averages and standard errors of the FPKM values are shown for R. multiflora. (c) Molecular phylogenetic tree of the pistil-expressed, candidate genes, including Prunus S-RNases as references. The tree was rooted with T2-RNase of Arabidopsis thaliana RNS2 (NM129536) [36]. Bold branches have FastTree support values [37, 38] above 0.95 based on 1000 resamples.

The 3D S-RNase gene in Old Blush named as “Rc3D_S” (Fig. 1c) is the same gene previously identified by Vieira et al. [15] as the candidate S-RNase in roses. We identified nine additional 3D S-RNase-like genes from the genome databases and pistil transcriptomes of R. multiflora and R. rugosa (Fig. 1c). One of them, named as “Rg3D_S”, is the same gene previously reported as S-RNase (Chr4.718) in the S-locus of R. rugosa [16]. The high genetic divergence (64.8%) in the 3D S-RNase gene agrees well with the assumption that this gene encodes the true S-RNase controlling SI in roses (Table 1). One of the 3D genes of R. multiflora, named as “Rm3D_S”, is identical to the Rc3D_S of Old Blush, indicating that the R. multiflora and Old Blush share the same S-alleles (S).
Table 1

The identification of candidate genes for the Rosa S-RNase. According to the phylogenetic tree (Fig. 1c), four genes (3D, 0A, 3A, 6C) with high similarities to Prunus S-RNases are considered to be candidate S-RNase genes

Candidate S-RNase genesa
SpeciesData sourceb3D0A3A6C
R. chinensis “Old Blush”Genome database [5] × Rc0A_SC2 Rc3A Rc6C_a
Genome database [6] Rc3D_SC1 × Rc3A Rc6C_a
R. rugosa Genome database [16] Rg3D_S18 × × Rg6C_GD1
Genome database [39] Rg3D_S20 × × ×
PT of the plant Rg46 Rg3D_S18 Rg0A_S19 × ×
R. multiflora Genome database [7] Rm3D_S15 × Rm3A Rm6C
Rm3D_S16
PT of the plant Rm08 Rm3D_S7 Rm0A_S6 Rm3A_Rm08 ×
PT of the plant Rm09 Rm3D_S7 Rm0A_S8 Rm3A_Rm09 Rm6C_Rm09
PT of the plant Rm27 Rm3D_S9 × Rm3A_Rm27 Rm6C_Rm27
Rm3D_S11
PT of the plant Rm28 Rm3D_S13 Rm0A_S10 × Rm6C_Rm28
PT of the plant Rm33 Rm3D_S9 Rm0A_S12 × Rm6C_Rm33
PT of the plant Rm3(tetraploid) Rm3D_SC1 × × ×
Rm3D_S15
PT of the plant Rm2 Rm3D_SC1 × × ×
Rm3D_S15
PT of the plant Rm1 Rm3D_SC1 × × ×
Rm3D_S14
Average pairwise identity of protein sequences 64.8% 62.9% 97.8%94.9%
Expression levels in pistilc *** *** * **

Genes identified in genome databases and pistil transcriptomes. × = unidentified.

PT = Pistil transcriptome (RNAseq).

The average FPKM values of R. multiflora (Fig. 1b): ***, >100; **, 50–100; *10–50.

The identification of candidate genes for the Rosa S-RNase. According to the phylogenetic tree (Fig. 1c), four genes (3D, 0A, 3A, 6C) with high similarities to Prunus S-RNases are considered to be candidate S-RNase genes Genes identified in genome databases and pistil transcriptomes. × = unidentified. PT = Pistil transcriptome (RNAseq). The average FPKM values of R. multiflora (Fig. 1b): ***, >100; **, 50–100; *10–50. In the same clade of the Prunus S-RNase and the Rosa 3D S-RNase, there were three other genes (6C, 3A, 0A; Fig. 1c). The 6C and 3A genes are located in different genomic positions from the 3D S-RNase (Fig. 1a). Therefore, they are not allelic genes to 3D genes. The 6C and 3A genes have low or undetectable expression levels in pistils (Fig. 1b) and low genetic divergences between alleles (>94% identity; Table 1). One allele of the 6C gene of Old Blush [5] and that of R. rugosa [39] are pseudogenes due to frameshift mutations (Supplementary Data Table D2). The 3A gene is homozygous in Old Blush and cannot be identified in two R. rugosa genomes. These results indicate that the 6C and 3A genes are not functional S-RNase genes. Genetic mapping of candidate S-RNase genes identified from the genome database of R. chinensis “Old Blush”. (a) Genetic linkage maps of a diploid population FW [19]: The chromosome 3 of the female parent map TF3, the male parent map RW3, and their integrated map IT3. (b)-(c) Estimations of genomic positions of the S gene from the map cM of IT3. By using regression lines made by surrounding markers (green), the genomic positions of the S gene (red) were estimated in the genome database of (b) Hibrand-Saint Oyant et al. [5] and (c) Raymond et al. [6]. Synteny of the two genome databases of R. chinensis “Old Blush” chromosome 3 and the estimated position of S-RNase. Dot plots of orthologous genes of two genome databases: Chr3 [6] and RC3 [5] are made using SynMap of CoGe [40]. The CDS of the two genome databases were used to generate the dot plots. (a) Whole chromosome scale, and (b) the focus of the region marked by a blue rectangle. The S (3D) S-RNase is located on the 5.5 Mbp position of Chr3, and there is a 500 kbp gap in the synteny at 41.4–41.9 Mbp of RC3. The S (0A) S-RNase is estimated to be located on this gap region. The 0A gene is supposed to be the allelic gene to 3D S-RNase in roses. The 0A gene of Old Blush, named as “Rc0A_S”, was identified in the chromosome 0 (i.e. a contig not assigned to one of the seven chromosomes) of the genome database of Hibrand-Saint Oyant et al. [5]. We identified five 0A S-RNase-like genes from the genome databases and pistil transcriptomes of R. multiflora and R. rugosa (Fig. 1c). These 0A S-RNase-like genes are strongly expressed in pistils (Fig. 1b) and exhibit high genetic divergence (62.9%; Table 1), like the 3D S-RNase gene. According to the qPCR analyses of different tissues and developmental stages (Supplementary information 2), the 0A and 3D S-RNase genes show pistil-specific expressions. The analyses of pistil transcriptomes of seven diploid R. multiflora plants showed that four plants (Rm08, Rm09, Rm28, Rm33) expressed one 3D gene and one 0A gene, while the other three plants (Rm27, Rm2, Rm1) expressed no 0A genes but two 3D genes (Table 1). This also suggests that the 0A and 3D genes are allelic. To test the hypothesis that the 0A S-RNase is allelic to the 3D S-RNase, we genetically mapped these genes using a diploid mapping population. The female parent of the diploid mapping population (FW), “The Fairy” (TF), has two 0A genes, one of which is identical to the Rc0A_S (hereafter called as S for simplicity) of Old Blush and the other called S (Supplementary information 4). The TF has an S / S genotype of the 0A S-RNase. In the male parent, RW, we identified one 3D S-RNase-like gene and named it S (Supplementary information 4); it has a sequence similar to the Rc3D_S of Old Blush (hereafter called as S for simplicity). The RW has an S / S genotype of the 3D S-RNase (S is an anonymous allele). The 0A and 3D S-RNase genes are mapped to homologous positions of chromosome 3 of the female (TF3) and male (RW3) genetic linkage maps, respectively (Fig. 2a). Estimation of genomic position of the S gene from map cM of the integrated map (IT3) indicates that the S gene is located on 41.5 Mbp of the chromosome 3 (RC3) in the genome database of Hibrand-Saint Oyant et al. [5] (Fig. 2b) and is located on 5.6 Mbp of the chromosome 3 (Chr3) of the other genome database of Raymond et al. [6] (Fig. 2c). In the latter genome database, the S is located at 5.5 Mbp of the Chr3 close to the estimated position (Fig. 2c).
Figure 2

Genetic mapping of candidate S-RNase genes identified from the genome database of R. chinensis “Old Blush”. (a) Genetic linkage maps of a diploid population FW [19]: The chromosome 3 of the female parent map TF3, the male parent map RW3, and their integrated map IT3. (b)-(c) Estimations of genomic positions of the S gene from the map cM of IT3. By using regression lines made by surrounding markers (green), the genomic positions of the S gene (red) were estimated in the genome database of (b) Hibrand-Saint Oyant et al. [5] and (c) Raymond et al. [6].

The two haploid genome databases of R. chinensis “Old Blush” are inversely oriented (Fig. 3a). The estimated position of the S gene in RC3 (41.5 Mbp) is in an orthologous position to the S position in Chr3 (5.5 Mbp) and is located in a gap of synteny between the two genome databases (Fig. 3b).
Figure 3

Synteny of the two genome databases of R. chinensis “Old Blush” chromosome 3 and the estimated position of S-RNase. Dot plots of orthologous genes of two genome databases: Chr3 [6] and RC3 [5] are made using SynMap of CoGe [40]. The CDS of the two genome databases were used to generate the dot plots. (a) Whole chromosome scale, and (b) the focus of the region marked by a blue rectangle. The S (3D) S-RNase is located on the 5.5 Mbp position of Chr3, and there is a 500 kbp gap in the synteny at 41.4–41.9 Mbp of RC3. The S (0A) S-RNase is estimated to be located on this gap region.

The identification of the S-locus F-box genes

There should be pollen-expressed F-box genes flanking the S-RNase. This prediction was confirmed by the genome database search and transcriptomes of the stamens of Old Blush (Table 2). From 5.3 to 5.8 Mbp region in Chr3, we identified 12 F-box genes and one S-RNase (S). Homologous F-box genes were identified in 41.3–41.9 Mbp region in RC3, and three other F-box genes were also identified in the contig RC0 containing the S (Table 2). All F-box genes were expressed in the stamen but not in the pistil. According to the nomenclature of Kubo et al. [13], these F-box genes were termed as SLF or FBX (Supplementary Table S7-2). We also confirmed that the similar number of SLFs were identified in 500–600 kbp regions flanking to the 3D S-RNase-like genes of R. multiflora (S, S) and R. rugosa (S, S) genome databases, and they were expressed in the stamens but not in the pistils (Table S7-1a,b).
Table 2

List of candidate S-locus F-box genes identified by in silico blast search on R.chinensis “Old Blush” genome databases

Genomic positionaFPKMb
TypecGene nameS-genotypeChromosomeStartEndLengthAnnotationdStamenPistil
FBX12SC1_FBX12_pseudo SC1 Chr35 391 4215 390 1021320NATruncated
SLF1SC1_SLF1 SC1 Chr35 434 3925 433 1451248Chr3g045586122 (1.6)1 (0.4)
SLF2SC1_SLF2 SC1 Chr35 475 9465 477 1541209Chr3g0455891m18 (2.9)1 (0.3)
S-RNase S C1 _SRNase S C1 Chr3 5 498 387 5 488 716 9672 Chr3g0455911m 0 (0) 100 (32)
SLF3SC1_SLF3 SC1 Chr35 559 4725 558 2191254Chr3g0455931m6 (1.9)1 (0.1)
SLF4SC1_SLF4 SC1 Chr35 618 7545 619 9921239Chr3g045599118 (2.8)1 (0.2)
FBX1SC1_FBX1 SC1 Chr35 722 0235 720 6951329Chr3g04560918 (0.8)0 (0.2)
SLF5SC1_SLF5 SC1 Chr35 735 4635 734 1831281Chr3g045613113 (2.5)1 (0.2)
SLF6SC1_SLF6 SC1 Chr35 751 3385 752 6061269Chr3g045617111 (0.5)1 (0.2)
SLF8SC1_SLF8 SC1 Chr35 803 9025 805 1581257Chr3g04562418 (2.4)1 (0.3)
SLF9SC1_SLF9 SC1 Chr35 809 2475 808 0211227Chr3g045625120 (2.6)1 (0.4)
SLF10SC1_SLF10 SC1 Chr35 857 6625 856 3911272Chr3g04562917 (0.6)0 (0.3)
SLF11SC1_SLF11 SC1 Chr35 880 7435 879 5231221Chr3g04563115 (2)1 (0.2)
FBX4SC1_FBX4_pseudo SC1 Chr35 890 7215 889 4441278NATruncated
FBX2SC2_FBX2 SC2 RC027 704 97027 703 7051266NA5 (0.6)1 (0.2)
S-RNase S C2 _SRNase S C2 RC0 27 779 508 27 755 726 23 783 NA 0 (0) 44 (44)
FBX3SC2_FBX3 SC2 RC027 852 91827 854 1771260RC0G020760017 (1)1 (0.4)
SLF7SC2_SLF7 SC2 RC027 963 60227 962 3731230RC0G0208100m4 (0.7)0 (0.1)
FBX4SC2_FBX4 SC2 RC341 290 22241 291 4991278RC3G034230016 (2.4)1 (0.5)
SLF11SC2_SLF11 SC2 RC341 310 05641 311 2761221RC3G03425005 (0.9)0 (0.1)
SLF10SC2_SLF10 SC2 RC341 344 31441 345 5821269RC3G03430005 (0.5)0 (0.2)
SLF9SC2_SLF9 SC2 RC341 348 09541 349 3211227RC3G034310018 (3.1)1 (0.5)
SLF8SC2_SLF8 SC2 RC341 352 23441 350 9781257RC3G03432009 (2.6)1 (0.2)
SLF6SC2_SLF6_pseudo SC2 RC341 369 68041 368 4101271RC3G0343500mTruncated
SLF5SC2_SLF5 SC2 RC341 380 17941 381 4171239RC3G0343800m16 (2.8)1 (0.3)
SLF4SC2_SLF4_1 SC2 RC341 646 96241 648 1941233RC3G0344900m4 (0.5)0 (0)
SC2_SLF4_2 SC2 RC341 695 11141 696 3431233RC3G03452004 (1.7)0 (0.1)
SLF3SC2_SLF3 SC2 RC341 876 74141 877 9791239RC3G034610010 (2.8)1 (0.2)
SLF2SC2_SLF2 SC2 RC341 896 10041 897 3201221RC3G0346200m13 (2.2)1 (0.2)
SLF1SC2_SLF1 SC2 RC341 907 65741 908 9041248RC3G034630015 (3.1)1 (0.2)
FBX5SC2_FBX5 SC2 RC341 932 56241 931 2431320RC3G03467004 (1.6)1 (0.2)
FBX12SC2_FBX12 SC2 RC341 965 10241 966 4371336RC3G0347000mTruncated

Genome data source is Raymond et al. [6] for S, Hibrand-Saint Oyant et al. [5] for S.

FPKM (Fragments per kilobase of exon per million reads mapped) was calculated from the RNA-seq data of of three individuals of R. chinensis “Old Blush” (OB15, OB20, OB75), and averages with standard errors in parenthesis are shown.

Type was defined as either SLF (S-locus linked F-box) or FBX, with numbers indicating groups with homologous protein sequences. We categorized the type as SLF when at least four S-haplotypes out of six (S) had the F-box genes, whereas we called them FBX only when less than three S- haplotypes had the F-box genes (Table S7-2).

NA = No Annotation in original database. The sufflex “m” indicates “modified” annotation of original one.

List of candidate S-locus F-box genes identified by in silico blast search on R.chinensis “Old Blush” genome databases Genome data source is Raymond et al. [6] for S, Hibrand-Saint Oyant et al. [5] for S. FPKM (Fragments per kilobase of exon per million reads mapped) was calculated from the RNA-seq data of of three individuals of R. chinensis “Old Blush” (OB15, OB20, OB75), and averages with standard errors in parenthesis are shown. Type was defined as either SLF (S-locus linked F-box) or FBX, with numbers indicating groups with homologous protein sequences. We categorized the type as SLF when at least four S-haplotypes out of six (S) had the F-box genes, whereas we called them FBX only when less than three S- haplotypes had the F-box genes (Table S7-2). NA = No Annotation in original database. The sufflex “m” indicates “modified” annotation of original one. Figure 4 shows the structures of the putative S-locus regions identified in the Rosa genome databases. In the S-locus of Old Blush chromosome RC3, there is no S-RNase but there is a poly-N region, and its adjacent 10kbp regions (Block-a, Block-b) show close sequence identities with the end regions of the contig RC0. Block-a is 11 220 bp with a 98.8% identity between the RC3 and RC0. Block-b is 11 382 bp with a 97.5% identity. The S in the contig RC0 appears to be integrated into the poly-N region of RC3. Furthermore, SLF7 is present in the contig RC0, and its orthologous genes were identified in the S-locus regions of R. rugosa and R. multiflora (See also Fig.S7-2 for the alignment). Further, the co-segregations of SLF5 in RC3 and S in RC0 were tested using the FW mapping population (Supplementary information 8) and found to be perfectly co-segregated (n = 97).
Figure 4

The structure of the Rosa S-locus. The putative S-locus genomic regions were extracted from genome databases, and the positions of the S-RNase and F-box (SLF, FBX) genes were plotted. Homologous F-box genes are connected by dotted lines. Data are available in Table 1 for R. chinensis “Old Blush” and Supplementary Table S7-1a,b for R. rugosa and R. multiflora.

Validation of the S-RNase based S-genotyping by pollination experiments

In order to validate these S-RNase genes as the pistil-determinant of SI, pollination experiments on the pairs of diploid roses that share the same S-alleles were performed (Table 3). For example, plants with the S-genotype S / S (where S is an anonymous genotype that includes several different S-RNase genotypes) were selected and pollinated by pollen collected from the plants with S / S (where S is another anonymous genotype). After seed maturation, the S-genotypes of the seeds were determined, and the genotypic frequency of pollen fertilizing the ovule were calculated. A total 111 seeds were analyzed, and results show that no S pollen fertilized the ovules (i.e. all seeds are derived from pollen with S genotypes). These pollination tests with half-compatible pairs were performed for R. multiflora plants sharing S-like genes, i.e. Rm3D genes, S, S, S, or S (Fig. 1c), providing the same results with S by analyzing total 174 seeds (Table 3). Furthermore, the pollination experiment of TF (S / S) with the pollen of Old Blush (S / S) supported the prediction that the S pollen of Old Blush was rejected by the pistil of TF by analyzing 60 seeds (Table 3). The cross pollinations with R. multiflora plants sharing S-like genes, i.e. Rm0A genes, S, S, S (Fig. 1c), providing the consistently same results for total 124 seeds (Table 3). These data strongly support the hypothesis that the 3D and 0A S-RNase genes are the pistil-determinant of SI in roses.
Table 3

Validation of the S-RNase-based genotyping by pollination experiments with half-compatible pairs. SeeSupplementary information 5 for full details of the pollination experiments

Recipient plant (♀)Pollen donor (♂)Number of seeds (n) by S-genotypeNumber of Pollen Genotypes fertilizing the ovule
Plant IDa S-genotypebPlant IDa S-genotypeb S-genotype n S-genotype n S-genotype n S-genotype n
M Sx S C1 M S C1 Sy Sx / SC10 S C1 / S C1 0 Sx / Sy 67 S C1  / Sy44 S C1 : Sy0: 111
Rm09 S8 S 7 Rm08 S 7 S6 S8 / S70 S 7 / S 7 0 S8 / S6 10 S 7  / S616 S 7 : S6 0: 26
M Sx S 9 M S 9 Sy Sx / S90 S 9 / S 9 0 Sx / Sy 26 S 9  / Sy24 S 9 : Sy 0: 50
M Sx S 11 M S 11 Sy Sx / S110 S 11 / S 11 0 Sx / Sy 27 S 11  / Sy21 S 11 : Sy 0: 48
M Sx S 13 M S 13 Sy Sx / S130 S 13 / S 13 0 Sx / Sy 23 S 13  / Sy27 S 13 : Sy 0: 50
TF S21 S C2 OB S C2 SC1 S21 / SC20 S C2 / S C2 0 S21 / SC1 35 S C2  / SC125 S C2 : SC1 0: 60
M Sx S 6 M S 6 Sy Sx / S60 S 6 / S 6 0 Sx / Sy 26 S 6  / Sy23 S 6 : Sy 0: 49
M Sx S 10 M S 10 Sy Sx / S100 S 10 / S 10 0 Sx / Sy 30 S 10  / Sy17 S 10 : Sy 0: 47
Rm32 S13 S 12 Rm33 S 12 S9 S13 / S120 S 12 / S 12 0 S13 / S9 13 S 12  / S915 S 12 : S9 0: 28

M = Multiple plants with different S-alleles of S and S were used for experiments. TF = The Fairy, OB = Rosa chinensis “Old Blush”. Rm = R. multiflora.

The common S-alleles shared by parents are underlined. S is S-RNase in Chr3 of the Old Blush genome database [6], and S is the new S-RNase identified in RC0 of the Old Blush genome database [5]. Other S-alleles are S-RNase genes identified in R. multiflora; S and S are orthologous to S, and S, S, and S are orthologous to S (Fig. 1c). S and S indicate anonymous S-alleles (S ≠ S in the same row).

The structure of the Rosa S-locus. The putative S-locus genomic regions were extracted from genome databases, and the positions of the S-RNase and F-box (SLF, FBX) genes were plotted. Homologous F-box genes are connected by dotted lines. Data are available in Table 1 for R. chinensis “Old Blush” and Supplementary Table S7-1a,b for R. rugosa and R. multiflora. Validation of the S-RNase-based genotyping by pollination experiments with half-compatible pairs. SeeSupplementary information 5 for full details of the pollination experiments M = Multiple plants with different S-alleles of S and S were used for experiments. TF = The Fairy, OB = Rosa chinensis “Old Blush”. Rm = R. multiflora. The common S-alleles shared by parents are underlined. S is S-RNase in Chr3 of the Old Blush genome database [6], and S is the new S-RNase identified in RC0 of the Old Blush genome database [5]. Other S-alleles are S-RNase genes identified in R. multiflora; S and S are orthologous to S, and S, S, and S are orthologous to S (Fig. 1c). S and S indicate anonymous S-alleles (S ≠ S in the same row).

Linkage between the S-locus and important ornamental traits

To consider the effect of SI on the breeding of roses, the degree of genetic linkage between the S-locus and the genes underlying important ornamental characteristics (CF and DF) was tested. The KSN gene (controlling CF) has a 13.5Mbp distance from the S-locus, and the AP2-like gene (controlling DF) has a 9.0 Mbp distance from the S-locus (Table 4). By using two diploid F hybrid populations [19, 41], the recombinant frequencies between the S-locus and these genes were estimated. The recombinant frequency between S and KSN is 20% in RW, and those between S and AP2-like is 13% in RW and 40% in TF in the FW population. The recombinant frequency between S and AP2-like is 22% in 93 / 1–119 in the 94 / 1 population. Except for the high recombination between the S and AP2-like loci in TF, there are significant genetic linkages between the S and the KSN loci and the S and the AP2-like loci. This may act as an internal constraint on rose breeding, which is discussed later.
Table 4

Linkage between the S-locus and important ornamental traits in the rose

S-locus AP2-like (Double-flower) KSN (Continuous-flowering)
Genomic position (bp)a5 488 716 - 5 498 38714 492 546 - 14 506 76518 979 892 - 18 989 895
Gene annotationChr3g0455911Chr3g0468481 - Chr3g0468491Chr3g0473011 Chr3g0473021
Physical distance (Mbp)09.013.5
Recombination frequency (%)bRW13% (13 / 97)20% (19 × 97)
TF40% (39 / 97)NA
93 / 1–11922% (11 / 50)NA

Chr3 in the homozygous genome of R. chinensis “Old Blush” [6]

Recombination frequencies in S − AP2 and S − KSN were estimated in two diploid F-hybrid populations: the FW population consists of 97 individuals derived from the pollen donor RW and the seed parent TF [19]. 94 / 1 population consists of 50 individuals derived from the pollen donor 93 / 1–117 and the seed parent 93 / 1–119 [41]. For the two populations, genotypes of S, AP2, and KSN were determined and used to calculate the recombination frequencies. SeeSupplementary information 9 for detailed methods for genotyping.

Linkage between the S-locus and important ornamental traits in the rose Chr3 in the homozygous genome of R. chinensis “Old Blush” [6] Recombination frequencies in S − AP2 and S − KSN were estimated in two diploid F-hybrid populations: the FW population consists of 97 individuals derived from the pollen donor RW and the seed parent TF [19]. 94 / 1 population consists of 50 individuals derived from the pollen donor 93 / 1–117 and the seed parent 93 / 1–119 [41]. For the two populations, genotypes of S, AP2, and KSN were determined and used to calculate the recombination frequencies. SeeSupplementary information 9 for detailed methods for genotyping.

Sc-alleles of old Chinese CF cultivars

To identify the Chinese S-alleles (S) introduced into Europe in the 18th century and examine their fate during the past 200 years’ history of rose breeding, we first analysed the S-genotypes of old Chinese CF cultivars (Table 5). The S-genotyping showed that the old Chinese cultivars frequently shared S and S alleles with Old Blush (Table 5). Hume’s Blush Tea-scented China has the same S-genotype (S / S) as Old Blush. The other CF cultivars, such as Slater’s Crimson China, R. chinensis “Mutabilis”, and R. chinensis, have either S or S, indicating that they have unidentified S-alleles. These unidentified S-alleles were amplified with RT-PCR using mRNA prepared from their pistils with degenerate primers for S-RNase in roses and sequenced (Fig. S10-1). Partial (S, S) and full (S) S-RNase cDNA sequences were obtained and analysed via BLASTX searches against a local protein database of the Rosa S-RNase and S-RNase-like proteins, shown in Figure 1c. The BLAST search confirmed that the S sequences are closest to the 0A S-RNase proteins (SeeFig. S10–2 for the alignment). As a consequence, S-genotypes were estimated as follows: Mutabilis = S / S, R. chinensis = S / S, Slater’s Crimson China = S / S or S / S / S (triploid type), and Sanguinea = S / S (Table 5).
Table 5

Genotypes of genes controlling important ornamental characteristics in old Chinese rose cultivars. The presence (●) or absence (×) of each gene was determined by PCR. Roses are classified into five types based on the KSN genotypes: Type 1 = ksn / ksn; Type 2 = ksn / ksn; Type 3 = ksn / ksn; Type4 = ksn / KSN; Type 5 = KSN / KSN. Roses in Types 1–3 are continuous-flowering (CF), and those in Types 4–5 are once-flowering (OF).

S-allele KSN AP2-like Phenotype
TypeName SC1 SC2 SC3 SC4 SC5 ksncopia ksnnull KSNw ap2 BloomFlower
1 Rosa chinensis “Old Blush”××××CFD
Hume’s Blush Tea-scented China××××CFD
Rosa chinensis “Mutabilis”×××××CFS
2 Rosa chinensis ×××××CFD
Slater’s Crimson China (×2)×××××CFD
Slater’s Crimson China (×3)××××CFD
3 Rosa chinensis “Sanguinea”××××××CFS
4 Rosa chinensis “Single white-eye”××××××OFS
Rosa chinensis “Narrow-leaflet”×××××OFD
Rosa chinensis “Major”××××OFD
5Fortune’s Double Yellow×××××OFD
Rosa odorata var. erubescens×××××OFD
Parks’ Yellow Tea-scented China×××××××OFD
Rosa multiflora “Carnea”××××××OFD
Rosa odorata “Double Light Yellow”××××××OFD

†, CF = Continuous-flowering, OF = Once-flowering, D = double-flower, S = single-flower.

Genotypes of genes controlling important ornamental characteristics in old Chinese rose cultivars. The presence (●) or absence (×) of each gene was determined by PCR. Roses are classified into five types based on the KSN genotypes: Type 1 = ksn / ksn; Type 2 = ksn / ksn; Type 3 = ksn / ksn; Type4 = ksn / KSN; Type 5 = KSN / KSN. Roses in Types 1–3 are continuous-flowering (CF), and those in Types 4–5 are once-flowering (OF). †, CF = Continuous-flowering, OF = Once-flowering, D = double-flower, S = single-flower. The genotyping of the KSN gene responsible for the CF characteristic identified five KSN genotypes (Table 5). Three KSN alleles, i.e. KSN (wild allele), ksn (the copia-retrotransposon inserted, non-functional allele) [10], and ksn (the deletion, non-functional allele) [5] were distinguished by PCR. New primers were designed to amplify the ksn allele (Fig.S10-3). Old Blush, Hume’s Blush Tea-scented China, and Mutabilis are heterozygous for the two non-functional alleles (ksn / ksn), whereas Slater’s Crimson China and R. chinensis are homozygous for one non-functional allele (ksn / ksn), and Sanguinea is homozygous for the other non-functional allele (ksn / ksn). Other old Chinese cultivars with once-flowering (OF) behaviour all have the wild KSN allele (KSN). The ap2 allele (a transposon-inserted allele) of AP2-like [5, 11] exists in all old Chinese cultivars with a double flower phenotype (Table 5).

Introgression of Chinese S-alleles into European roses

153 rose cultivars with a variety of breeding histories were genotyped to test for the presence of ksn and ap2 and their associated S (SeeSupplementary data Table D6 for the list of cultivars) The frequencies of rose cultivars with these genes were calculated, along with their breeding periods (Fig. 5). Only 25% of European roses bred before 1850 had either ksn or ksn. In particular, roses in the Gallica, Damask, Centifolia, and Alba groups have neither ksn nor ksn and show no signs of introgression from Chinese roses (Table S10-1). In contrast, some old roses in the Moss group, such as Mousseline (bred in 1855), had ksn alleles, indicating the onset of introgression from Chinese CF roses in this period. These roses also had S or S, demonstrating the parallel introgression of S into European roses.
Figure 5

Frequency of rose cultivars with genes that originated from old Chinese cultivars. (a) Frequency of roses carrying CF genes (ksn), associated S genes (S), and double-flower gene (ap2). ksn indicates the frequency with which roses have ksn and/or ksn; S indicates the frequency with which roses have S, S, S, S, and/or S. The number of rose cultivars studied are shown above breeding periods. (b) Individual frequency of roses that have ksn or ksn, and those roses that have S, S, S, S, or S. Breeding periods: I = before 1850, II = 1850–1900, III = 1900–1940, IV = 1940–1980, V = 1980–2020. The data used to calculate frequencies are available in Supplementary data Table D6.

Frequency of rose cultivars with genes that originated from old Chinese cultivars. (a) Frequency of roses carrying CF genes (ksn), associated S genes (S), and double-flower gene (ap2). ksn indicates the frequency with which roses have ksn and/or ksn; S indicates the frequency with which roses have S, S, S, S, and/or S. The number of rose cultivars studied are shown above breeding periods. (b) Individual frequency of roses that have ksn or ksn, and those roses that have S, S, S, S, or S. Breeding periods: I = before 1850, II = 1850–1900, III = 1900–1940, IV = 1940–1980, V = 1980–2020. The data used to calculate frequencies are available in Supplementary data Table D6. The frequency of roses carrying ksn alleles dramatically increased to 86% of roses bred from 1850–1900, gradually increasing to 100% in roses bred from 1980–2020 (Fig. 5). On the other hand, the frequency of roses with S reached a peak (95%) in roses bred from 1850–1900, slightly decreasing to 83% in roses bred from 1900–1940, but not changing much until recently (Fig. 5). Due to the small number of samples, these changes in the frequency of rose cultivars after the period II are not statistically significant. The frequency of rose cultivars with ksn in the period II (86%) is not significantly different from that in the period V (100%) (Fisher’s exact test, p = 0.0507). The frequency of rose cultivars with Sc in the period II (95%) is not significantly different from that in the period V (82%) (Fisher’s exact test, p = 0.2316). The S and S are major S alleles, and the frequency of roses with the S increased during the 19th century (Fig. 5c). In contrast, roses with the S and S were found with low frequency (<10%) in modern cultivars. The frequency of roses with ap2 showed the same trend of frequency changes with breeding periods as ksn (Fig. 5), suggesting that the ap2 allele also originated from Chinese roses. Old European roses with the DF phenotype, such as Rosa gallica officinalis, Quatre Saisons, and Chapeau de Napoleon, had no ap2 allele (Table S10-1), indicating that there is another genetic factor for DF phenotype.

The wild origin of SC-alleles

By screening a total of 95 plants from 25 wild Rosa species (Table S11-1) with S-specific PCRs, putative wild ancestors of S were identified (Table 6). Positive PCR amplifications with Sspecific primers were observed for two species, R. multiflora and Rosa brunonii (Table S11-2). The sequencing of the PCR products shows that R. multiflora sequences are 100% identical to the S–sequence of Old Blush, while R. brunonii has a 98.4% identity with S (8 SNPs per 500 bp). In the RNA-seq analysis of pistil-expressed genes, it was already found that three R. multiflora plants (Rm1, Rm2, Rm3) have S (= Rm3D_S in Fig. 1c), with 100% identical amino acid sequences to Old Blush S (= Rc3D_S). The cDNA sequence of Rm3D_S is also 100% identical to Rc3D_S(Supplementary data Table D4).
Table 6

A survey of Chinese wild Rosa species with S-specific PCRs to identify candidate ancestral species of old continuous-flowering roses

SectionSpeciesNo. of individuals studiedaPCR tests for the presence/absence of specific S-RNase allelesb
SC1 SC2 SC3 SC4 SC5
Chinenses Rosa chinensis var. spontanea34★5
Rosa gigantea c 8★1△2★1
Rosa lucidissima 2
Synstylae Rosa multiflora 10★1
Rosa multiflora var. cathayensis9△2
Rosa brunonii 4△2
Rosa helenae 2△2
Rosa soulieana 2△1
Rosa rubus 4△1
Rosa luciae 1
Rosa moschata 2
Rosa anemoneflora 1

Information of sample source is available in Supplementary data Table D6.

★ = no. of individuals with 100% identical sequences to CF cultivars, △ = no. of individuals with positive PCR amplifications, while the sequences were not identical to those of CF cultivars. Empty cells = no PCR amplifications for all individuals studied.

Results of one individual is based on a genome data (SRR6175515).

A survey of Chinese wild Rosa species with S-specific PCRs to identify candidate ancestral species of old continuous-flowering roses Information of sample source is available in Supplementary data Table D6. ★ = no. of individuals with 100% identical sequences to CF cultivars, △ = no. of individuals with positive PCR amplifications, while the sequences were not identical to those of CF cultivars. Empty cells = no PCR amplifications for all individuals studied. Results of one individual is based on a genome data (SRR6175515). No wild roses were found with positive PCR amplifications from S-specific primers (Supplementary data Table D6). However, it was found that the genome re-sequencing individual of R. gigantea (SRR6175515) has 100% identical sequences to S of Old Blush (Fig. S11-2). For the S isolated from Slater’s Crimson China, PCR amplifications with S-specific primers were found for three species, R. chinensis var. spontanea, R. gigantea, and R. multiflora var. cathayensis (Table 6). Sequencing of the PCR products showed that only R. chinensis var. spontanea has a 100% identical sequence with S of Slater’s Crimson China (Table S11-3). For the S isolated from R. chinensis, PCR amplifications with S-specific primers were found for three species, R. gigantea, Rosa soulieana, and Rosa helenae (Table 6). Sequencing of the PCR products showed that only one plant (pink flower type) of R. gigantea has a 100% identical sequence to the S (Table S11-4). For theS isolated from Mutabilis, a positive PCR amplification with S-specific primers was found only in a wild individual of R. rubus (Table 6), while the sequencing showed that the PCR product from Rosa rubus is not identical to S of Mutabilis (4 SNPs per 221 bp; Fig. S11-3). The putative genetic connections inferred from the shares of S-alleles are summarized in Figure 6.
Figure 6

Hypothetical wild ancestral species of old Chinese cultivars inferred from the shares of S-alleles. Five S-alleles (S,S, S, S, and S) were identified from old Chinese CF cultivars and can be traced to four wild ancestors. The uppermost roses are CF cultivars, and the others are once-flowering cultivars and species. The shares of S-alleles are shown by dotted line connections. Cultivars are classified into five types based on their KSN genotypes (Table 5), indicated by different colored boxes.

Hypothetical wild ancestral species of old Chinese cultivars inferred from the shares of S-alleles. Five S-alleles (S,S, S, S, and S) were identified from old Chinese CF cultivars and can be traced to four wild ancestors. The uppermost roses are CF cultivars, and the others are once-flowering cultivars and species. The shares of S-alleles are shown by dotted line connections. Cultivars are classified into five types based on their KSN genotypes (Table 5), indicated by different colored boxes.

Discussion

Identification and validation of the Rosa S-locus

The Rosa S-locus was identified and confirmed by clarifying some unresolved issues in previous studies. The genome sequencing of R. chinensis “Old Blush” proposed a candidate region of the S-locus [5]. This region corresponds to the region of 3C gene (Fig. 1a). We concluded that the 3C gene is not the true S-RNase because of (i) the low or undetectable expressions of 3C gene in the pistil (Fig. 1b) and (ii) the low level of sequence divergence between alleles and among individuals (Fig. 1c). Vieira et al. [15] analyzed another genome of Old Blush [6] and firstly reported that the 3D gene is the Rosa S-RNase. The S-RNase “Rchinensis1_3-Rchinensis2_27” in Vieira et al. [15] corresponds to the 3D gene (= S) in this study. Vieira et al. [15] validated the S-RNase based on the results of no fruit set of a few individuals pollinated by other individuals with the same S-genotypes. This cannot exclude the possibility that inbreeding depression results in no fruit set. We provide strong evidence that the 3D S-RNase is the pistil determinant of the SI in roses by genotyping more than 400 seeds produced by a number of half-compatible pairs of roses (Table 3). This study also identified the 0A gene (= S) in the contig RC0 of the Old Blush genome database (Fig. 1c) and suggested via the mapping approach (Fig. 2, 3) that it is an unidentified allele of the S of Old Blush. The sequence analysis of the Rosa S-locus suggested that the contig RC0 can be integrated into a poly-N region of the S-locus of RC3 (Fig. 4). The perfect co-segregation of SLF5 in RC3 and of S in RC0 in a mapping population (n = 97; SeeSupplementary information 8) supports this hypothesis. Furthermore, we identified S-like genes from the pistil transcriptomes of wild individuals of R. multiflora and R. rugosa (Rm0A and Rg0A genes; Table 1). The pollination experiments using half-compatible pairs of roses that share S or S-like genes indicated that these genes are the pistil-determinants of SI in the rose (Table 3). To determine the precise location of S in the S-locus of Old Blush, a BAC library will need to be constructed and sequenced, as shown by Liang et al. [42] in their analyses of the S-locus structure of Citrus. Based on the identification of SLFs flanking to the S-RNase, we estimated that the S-locus of Old Blush spans approximately 500 kbp (Fig. 4). Chen et al. [16] reported that the S-locus of R. rugosa spanned 667kbp, including one S-RNase and 19 F-box genes (Table S7-1b). In order to confirm the physical size of the Rosa S-locus, further co-segregation analyses of SLFs and S-RNase are required. The evolutionary divergence analysis of SLFs in comparison with S-RNase (Fig.S7-3) suggest that the SI of Rosa is controlled by the non-self recognition system as previously reported by Vieira et al. [15]. In the non-self recognition system, polyploidization will break down the SI [12, 13]. In support to the prediction, we found that the colchicine-induced chromosome doubling of a diploid rose resulted in self-compatible tetraploid (Table S6-1).

Insights into the breeding of CF roses

The estimations of the degree of genetic linkage between the S-locus and the ksn controlling CF and the ap2 controlling DF indicate weak but significant genetic linkages between them. Since the CF is a recessive characteristic, the CF rose (ksn-homozygote) cannot be created if ksn remains to link with a same S-allele. The 20% of recombination frequency (Table 4) suggests that when we cross roses carrying the ksn linked with a same S-allele, only 20% seedlings are expected to be CF. Therefore, information of S-alleles linked with ksn is helpful to make the CF rose breeding effective. The SI constraint on breeding would be lower in DF than in CF. Since DF is a dominant characteristic, one ap2-allele is enough to make DF roses [5, 11]. Furthermore, we suggest that there had been already other genes or alleles for DF in European roses before the introgression of ap2 from China (Table S10-1). Further studies are necessary to confirm and assess the degrees of genetic linkages between the S-locus and the loci controlling important ornamental traits, such as CF, DF, thornlessness, and resistance against black spot disease. Our estimation of the degree of genetic linkage depends on a few hybrid populations (Table 4), which cannot represent diverse rose cultivars used for breeding. In addition, the low recombination rates associated with regions adjacent to breakpoints in inversion heterozygotes [43] might result in a stronger genetic linkage of the S-locus with ksn than with ksn. As we expected, S alleles were introduced into European roses in the 18th century with the CF allele ksn. However, the frequency of rose cultivars having S remains high (> 80%) in modern cultivars, although it tended to decrease from 95% of cultivars from1850–1900 (Fig. 5a). Due to the fact that many rose cultivars possess the same Sc alleles, S-genotyping in advance of breeding is helpful for making the diploid rose breeding effective. Furthermore, due to the fact that the two Sc alleles S and S are not common (< 10%) in modern cultivars (Fig. 5c), original Chinese CF cultivars of S (R. chinensis) and S (Mutabilis) are still useful breeding materials to introduce these rare S-alleles into CF cultivars. As the SI can break down with polyploidization (Table S6-1), breeding between polyploid cultivars may not necessarily consider SI constraints. However, substantial portion (22–35%) of modern cultivars are estimated to be diploid [44, 45], and most wild species and Chinese old cultivars are diploid. Therefore, diploid rose breeding is still an important part of rose breeding.

Insights into the wild origin of CF roses

We identified the putative ancestors of the S-alleles of old Chinese cultivars by screening wild roses with the S-specific primers (Table 6). The results confirmed the hybrid origin of the old Chinese cultivars with R. chinensis var. spontanea and R. gigantea in the section Chinenses (Indicae) and the introgression from R. multiflora in the section Synstylae (Fig. 6). The genome sequencing of Old Blush reported a sign of introgression from the section Synstylae [6], but which species of the section is involved in the formation of the Old Blush genome is not clarified. Yang et al. [30] discussed that R. multiflora is a candidate species contributing to the introgression, but there are many other candidates in the section. We investigated nine candidate species in the section Synstylae in southwestern China (Table 6) and demonstrated that only R. multiflora has an identical S-allele with Old Blush, providing a new evidence for the genetic link between R. multiflora and old Chinese cultivars.

Materials and methods

By using S-RNase of other Rosaceae crops, including Malus domestica (AAA79841.1), Prunus dulcis (AAL35960.2), Prunus avium (BAA36389.1, CAC27788.1), and Prunus persica (BAF42768.1) as queries, the Old Blush genome databases [5, 6] were searched using TBLASTN to identify candidate S-RNase. All genomic regions with significant hits (E-value <10–10), including the regions without any annotations, were listed and manually annotated to infer the coding regions. The other genome databases of Rosa multiflora [7] and R. rugosa [16, 39] were then searched using the candidate S-RNase genes of Old Blush as queries to identify their orthologous genes. For the phylogenetic reconstruction of S-RNase-like genes in roses, deduced amino acid sequences were aligned using MUSCLE and then converted back to DNA sequences. The maximum-likelihood phylogenetic tree was constructed from the nucleotide protein-coding sequence alignment by FastTree [37, 38] using the Jukes-Cantor model of nucleotide evolution.

RNA-seq analysis

To confirm the expression of candidate S-RNase in the pistil and to isolate new S-RNase alleles, RNA sequencing (RNA-seq) was used. Flower buds one or a few days before anthesis were collected from three individuals of Old Blush, eight individuals of R. multiflora, and one individual of R. rugosa. Pistils and stamens were collected from the buds and immediately frozen with liquid nitrogen. Total RNAs were extracted using a commercial kit according to the protocol described in Dubois et al. [46], and RNA-seq data was obtained through poly-A purification and the 150 bp paired-end method. The RNA-seq reads were mapped to the CDSs of specific genes and whole genome data to calculate FPKM (Fragments per kilobase of exon per million reads mapped) values. To analyze the data of wild R. multiflora and R. rugosa plants without any specific reference genome databases, the RNA-seq reads were assembled first, followed by the construction of a local database of mRNA sequences, and the database was blasted using the Old Blush genes as queries to identify orthologous genes. The assembly of RNA-seq data was conducted by using the velvet and tadpole algorithm, with default parameters in Geneious Prime 2020.

Genetic mapping of the new S-RNase in chromosome zero

Genetic mapping of a newly-identified S-RNase in the contig that is not assigned to seven chromosomes was performed by using an F diploid mapping population (FW) [19, 47, 48]. Three new genetic markers linked to the candidate S-RNase were added to the previous map to estimate the genomic position of the S-gene (Supplementary information 4).

SLF identification

ORFs longer than 1kbp were extracted from the 1Mbp genomic regions surrounding the S-RNase in the Old Blush, R. multiflora, and R. rugosa genomes, and the F-box genes were identified by a Blast search of the ORFs. A total of 20 pairs of diploid roses that share one S-allele (i.e. half-compatible) were used for pollination experiments to validate the candidate S-RNase gene. Old Blush (S / S), The Fairy (S / S), R. chinensis “Single white-eye” (S / S), and 11 wild individuals of R. multiflora were selected based on their S-genotypes. Before anthesis, flower buds were bagged to prevent open pollination. Petals and anthers were removed at the balloon stage, and outcross pollen grains were put on the exposed stigma. Pollinations were carried out from April to May in 2018 and 2020, and matured fruits were collected from September to October of the same years. The fruits were opened in the laboratory, the achenes (seeds) were collected, and the S-genotypes of the seeds were determined by PCR (Supplementary information 5). Two F-diploid mapping populations (FW [19] and 94 / 1 [41]) were used to estimate the recombination frequencies between the S-locus and the genes controlling CF (KSN) and DF (AP2-like). The genotyping of the S-locus, KSN, and AP2-like was performed by PCR (Supplementary information 9), and recombination frequencies were calculated.

Introgression of the Chinese S-alleles into modern roses

A total of 153 rose cultivars were selected from a wide range of breeding ages and classifications, and young leaves were collected from 20 rose gardens and nurseries in Japan from 2014–2020 (Supplementary data Table D6). DNA was extracted from the young leaves by using the Nucleospin Plant II kit (Macherey-Nagel) according to the manufacturer’s protocol. A PCR was performed to test whether the roses have specific S-alleles, mutated-alleles ksn, wild-allele KSN, or the mutated-allele ap2 by using EmeraldAmp PCR Master Mix (TaKaRa) with thermal cycling: (1) 2 min of 95°C; (2) 30 s of 95°C; (3) 30 s of Tm°C; (4) 20–50 s of 72°C; and (5) go back to step (2) 29 times. Information on primer sequence and annealing temperature (Tm) for PCR is available from Supplementary data Table D1. To identify other Chinese S-alleles originally linked with ksn, we used three old Chinese CF cultivars, Slater’s Crimson China, Mutabilis, and Rosa chinensis, extracted RNAs from their pistils, and performed RT-PCR with primer sets designed on conserved sites of S-RNase. Partial or full sequences of three new S-alleles were determined and named as S, S, and S and specific primers were designed for each (Supplementary information 10).

Wild roses carrying the same S-alleles as old cultivars in China

By screening wild rose species carrying the same S-alleles as the old Chinese CF cultivars, the wild ancestral species were inferred. A total of 25 Rosa species were surveyed (Table S11-1), with a focus on 13 species in southwestern China (Sichuan and Yunnan Provinces; Fig. S11-1), where a wild type of R. chinensis, named R. chinensis var. spontanea, is naturally distributed. Field collections of wild plants in southwestern China were performed during the flowering seasons of 2018 and 2019. Forty wild plants of seven species were sampled, and S-genotyping was performed by PCR. The nucleotide sequences of the PCR products were determined to confirm the results.

Acknowledgements

This study was supported by the JSPS KAKENHI No. 24688004 and 17H04616. We would like to express our appreciation for Cheng Tingting and Jin Tang in Sichuan University, and Ryosuke Sakamoto in Osaka Institute of Technology, for their helpful support during the fieldwork in China. We are also appreciative of the support from The Rose Culture Institute, Japan, and Drs. Masaki Ochiai and Kunio Yamada at Gifu University, for providing rose materials.

Contributions

K.K. designed the project and wrote the manuscript. K.K., Y.U., W.L., and G.W. conducted fieldwork in China. K.K., S.M., M.L., and T.D. performed pollination experiments. K.K., T.H., and S.O. analyzed candidate genes, L.H.O., F.F., M.L., and T.D. provided mapping populations. All authors are involved in final manuscript editing.

Data availability

Sequence data are available in Supplementary data Tables. NGS read data of RNA-seq are available from DDBJ, BioProject PRJDB12320, with accession numbers from DRR321244 to DRR321267.

Conflict of interests

The authors declare no conflicts of interest associated with this manuscript.

Supplementary data

Supplementary data is available at Horticulture Research online. Click here for additional data file.
  36 in total

1.  Collaborative non-self recognition system in S-RNase-based self-incompatibility.

Authors:  Ken-ichi Kubo; Tetsuyuki Entani; Akie Takara; Ning Wang; Allison M Fields; Zhihua Hua; Mamiko Toyoda; Shin-ichi Kawashima; Toshio Ando; Akira Isogai; Teh-hui Kao; Seiji Takayama
Journal:  Science       Date:  2010-11-05       Impact factor: 47.728

2.  Comparative analysis of the within-population genetic structure in wild cherry (Prunus avium L.) at the self-incompatibility locus and nuclear microsatellites.

Authors:  Silvio Schueler; Alexandra Tusch; Florian Scholz
Journal:  Mol Ecol       Date:  2006-10       Impact factor: 6.185

Review 3.  How to usefully compare homologous plant genes and chromosomes as DNA sequences.

Authors:  Eric Lyons; Michael Freeling
Journal:  Plant J       Date:  2008-02       Impact factor: 6.417

4.  Tinkering with the C-function: a molecular frame for the selection of double flowers in cultivated roses.

Authors:  Annick Dubois; Olivier Raymond; Marion Maene; Sylvie Baudino; Nicolas B Langlade; Véronique Boltz; Philippe Vergne; Mohammed Bendahmane
Journal:  PLoS One       Date:  2010-02-18       Impact factor: 3.240

5.  Insights into the Prunus-Specific S-RNase-Based Self-Incompatibility System from a Genome-Wide Analysis of the Evolutionary Radiation of S Locus-Related F-box Genes.

Authors:  Takashi Akagi; Isabelle M Henry; Takuya Morimoto; Ryutaro Tao
Journal:  Plant Cell Physiol       Date:  2016-04-14       Impact factor: 4.927

6.  The TFL1 homologue KSN is a regulator of continuous flowering in rose and strawberry.

Authors:  Hikaru Iwata; Amèlia Gaston; Arnaud Remay; Tatiana Thouroude; Julien Jeauffre; Koji Kawamura; Laurence Hibrand-Saint Oyant; Takashi Araki; Béatrice Denoyes; Fabrice Foucher
Journal:  Plant J       Date:  2011-10-25       Impact factor: 6.417

7.  Characterization of black spot resistance in diploid roses with QTL detection, meta-analysis and candidate-gene identification.

Authors:  D C Lopez Arias; A Chastellier; T Thouroude; J Bradeen; L Van Eck; Yannick De Oliveira; S Paillard; F Foucher; L Hibrand-Saint Oyant; V Soufflet-Freslon
Journal:  Theor Appl Genet       Date:  2020-08-25       Impact factor: 5.699

8.  Genetic relationships and evolution of old Chinese garden roses based on SSRs and chromosome diversity.

Authors:  Jiongrui Tan; Jing Wang; Le Luo; Chao Yu; Tingliang Xu; Yuying Wu; Tangren Cheng; Jia Wang; Huitang Pan; Qixiang Zhang
Journal:  Sci Rep       Date:  2017-11-13       Impact factor: 4.379

9.  A miR172 target-deficient AP2-like gene correlates with the double flower phenotype in roses.

Authors:  Léa François; Marion Verdenaud; Xiaopeng Fu; Darcy Ruleman; Annick Dubois; Michiel Vandenbussche; Abdelhafid Bendahmane; Olivier Raymond; Jérémy Just; Mohammed Bendahmane
Journal:  Sci Rep       Date:  2018-08-27       Impact factor: 4.379

10.  A high-quality genome sequence of Rosa chinensis to elucidate ornamental traits.

Authors:  L Hibrand Saint-Oyant; T Ruttink; L Hamama; I Kirov; D Lakhwani; N N Zhou; P M Bourke; N Daccord; L Leus; D Schulz; H Van de Geest; T Hesselink; K Van Laere; K Debray; S Balzergue; T Thouroude; A Chastellier; J Jeauffre; L Voisine; S Gaillard; T J A Borm; P Arens; R E Voorrips; C Maliepaard; E Neu; M Linde; M C Le Paslier; A Bérard; R Bounon; J Clotault; N Choisne; H Quesneville; K Kawamura; S Aubourg; S Sakr; M J M Smulders; E Schijlen; E Bucher; T Debener; J De Riek; F Foucher
Journal:  Nat Plants       Date:  2018-06-11       Impact factor: 15.793

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.