Literature DB >> 22745530

Evolution of the RALF Gene Family in Plants: Gene Duplication and Selection Patterns.

Jun Cao1, Feng Shi.   

Abstract

Rapid alkalinization factors (RALFs) are plant small peptides that could induce a rapid pH increase in the medium of plant cell suspension culture and play a critical role in plant development. The evolutionary process of the RALF gene family remains unclear. To obtain details of the phylogeny of these genes, this study characterized RALF genes in Arabidopsis, rice, poplar and maize. Phylogenetic trees, evolutionary patterns and molecular evolutionary rates were used to elucidate the evolutionary process of this gene family. In addition, the different signatures of selection, expression patterns, and subcellular localization of RALFs were also analyzed. We found that the RALF gene family had a rapid birth process after the separation of the eudicot and monocot species about 145 million years ago, that tandem duplication played a dominant role in the expansion of Arabidopsis and rice RALF gene family, and that RALFs were under purifying selection according to estimations of the substitution rates of these genes. We also identified a diverse expression pattern of RALF genes and predominant extracellular localization feature of RALF proteins. Our findings shed light on several key differences in RALF gene family evolution among the plant species, which may provide a scaffold for future functional analysis of this family.

Entities:  

Keywords:  RALF; evolution; selection; tandem duplication

Year:  2012        PMID: 22745530      PMCID: PMC3382376          DOI: 10.4137/EBO.S9652

Source DB:  PubMed          Journal:  Evol Bioinform Online        ISSN: 1176-9343            Impact factor:   1.625


Introduction

Peptide signaling is important for cell-to-cell communication and participates in a variety of developmental processes and environmental responses. A number of genes encoding small-secreted peptides have been identified in plants and a certain proportion of them are hormones.1 These peptides play critical role in all aspects of the plant life cycle and have diverse functions. Such as, CLV3 (CLAVATA3) peptide regulates meristem size.2 Peptide systemin induces the systemic defense response.3 ENOD40 encodes two small peptides, both of which can affect the normal nodule development.4 Defensins are involved in the innate immune system of plants.5 PSK (phytosulfokine) peptide has been demonstrated to promote cellular proliferation and transdifferentiation.6,7 SCR peptide is the pollen self-incompatibility recognition factor in the Brassicaceae species.8,9 PLS (POLARIS) peptide is involved in vascularization, longitudinal cell expansion and increased radial expansion.10 ROT4/DVL1 (ROTUNDIFOLIA4/DEVIL1) peptide regulates polar cell proliferation on the longitudinal axis of organs.11,12 IDA (INFLORESCENCE DEFICIENT IN ABSCISSION) is a family of secreted peptides identified to be involved in petal abscission.13 LURE peptides produced by synergid cells attract pollen tubes to the embryo sac.14 RGF (root meristem growth factor) is a 13 amino acid secreted peptide involved in the maintenance of root stem cell niche.15 Dodeca-CLE peptides suppress the plant stem cell differentiation.16 In addition, RALF is a recently discovered family of plant peptide that plays a role in plant cell growth as will be described below.17 RALF is a small peptide and first discovered in tobacco leaf extracts due to its ability to cause a rapid alkalinization in the medium of tobacco cell suspension cultures.18 Subsequently, this gene is also identified in a wide variety of plant species, including gymnosperms, monocots and dicots.19–27 The ubiquity of RALF suggests its importance in plant growth and development. Like other plant polypeptide hormones, such as phytosulfokine6 and systemin,28 most RALF genes encode pro-peptides that need proteolytic processing. These RALF precursors have a conserved dibasic site upstream of the active peptide that is required for pro-peptide processing and activity.29,30 These results are consistent with localization of the Nicotiana benthamiana RALF-GFP fusion protein, which localizes first to the endoplasmic reticulum (ER) and later to the cell wall.31 Another characteristic of the peptide is the four conserved cysteines in the active peptide region that are likely to be involved in disulfide bridges and are required for activity.18 RALFs are a new type of plant peptide hormones that participate in diverse biological processes. Such as, activation of protein kinases, inhibition of root growth and development,18,21,27 regulation of fruit maturation,20 nodule formation,22 tissue expansion10 and pollen development,24,26 and so on. Interestingly, the number of RALF genes varies greatly from species to species. For instance, over 30 RALF genes have been identified in Arabidopsis,32 while only two RALF genes have been confirmed in Selaginella moellendorffii (see below). The critical role of RALFs and the diversity of RALF gene number from species to species prompt us to investigate how RALF genes have evolved in plant kingdom and how and why different species have acquired such different numbers of RALF gene. Here, we presented evidence that the evolution of the plant RALF gene family had a rapid birth process, and that tandem duplication rather than segmental duplication played a dominant role in the expansion of the RALF gene family. Our study also revealed a diverse expression pattern of RALF genes and the predominant extracellular localization feature of RALF proteins in Arabidopsis.

Materials and Methods

Sequence identification and conserved motif analysis of RALF genes

To identify potential members of the RALF gene family in Arabidopsis, poplar, rice and maize, we performed multiple database searches. Published Arabidopsis RALF gene sequences32 were retrieved and used as queries in BLAST searches against NCBI (http://www.ncbi.nlm.nih.gov) and Phytozome (http://www.phytozome.net). The program MEME33 (http://meme.sdsc.edu) was used to identify motifs in the candidate RALF protein sequences. MEME was run locally with the following parameters: number of repetitions = any, maximum number of motifs = 6, and with optimum motif widths constrained to between 6 and 200 residues.

Alignment and phylogenetic analysis

To generate the alignment of the 91 RALF proteins from the Arabidopsis, rice, poplar and maize, COBALT34 program was used. Phylogenetic analyses of the RALF proteins based on amino acid sequences were carried out using Neighbor-Joining (NJ) methods in MEGA 5.35 NJ analyses were done using p-distance methods, pairwise deletion of gaps, and the default assumptions that the substitution patterns among lineages and substitution rates among sites were homogeneous. Support for each node was tested with 1,000 bootstrap replicates.

Estimation of the maximum number of gained and lost RALFs

To determine the degrees of gene family expansion in the analyzed plant lineages, we divided the phylogeny into ancestral clades (those containing at least one representative of monocots and eudicots), recent clades (monocot specific or eudicot specific) and species- specific clades. Nodes basal to the split among lineages denoted the most recent common ancestor (MRCA) and were labeled as N0 to N4. Gene duplication and loss events were inferred by reconciling the gene tree for each cluster/subcluster with the species tree using Notung v2.5.36

Divergence levels analysis

To analyze positive or negative selection of the RALF sequences, substitution rate ratios of nonsynonymous (K) versus synonymous (K) mutations were calculated. We first identified the closest orthologs for each gene in the genome of the close relative A. lyrata (Fig. S6) and included only those A. thaliana genes that had a single ortholog in A. lyrata. Moreover, gene pairs were considered orthologs when they clearly formed a single subclade. Pairwise alignment of nucleotide sequences of the RALF orthologs was performed using MEGA 5.35 Alignments were performed using Clustal W (Codons). K and K values of the orthologous genes were estimated using K-Estimator 6.0.37 To calculate the K/K ratios in different Groups, the Selecton server38,39 was also used. It implements several evolutionary models that describe, in probabilistic terms, how characters evolve. The models are expressive enough to describe the biological reality. In this study, five models (M8, M8a, M7, M5 and MEC) were used. Each of the models uses different biological assumptions so that different hypotheses can be tested.

Inference of duplication time

Pairwise alignment of nucleotide sequences of the RALF paralogs was performed using MEGA 5.35 Alignments were performed using Clustal W (Codons). The K and K values of the paralogous genes were estimated using K-Estimator 6.0.37 To better explain the patterns of macroevolution, estimates of the evolutionary rates were considered extremely useful. Assuming a molecular clock, the synonymous substitution rates (K) of the paralogous genes will be expected to be similar over time. Thus, K can be used as the proxy for time to estimate the dates of the segmental duplication events. The K value was calculated for each of the gene pairs and then used to calculate the approximate date of the duplication event (T = K/2λ), assuming clock-like rates (λ) of synonymous substitution of 1.5 × 10−8 substitutions/ synonymous site/year for Arabidopsis,40 6.5 × 10−9 for rice and maize41 and 9.1 × 10−9 for poplar42.

Codon bias analysis

Codon bias can reflect the degree of selective constraint in a gene. To measure the extent of codon bias, effective number of codons (ENC) and codon bias index (CBI) were estimated using DnaSP v.5.10.01.43 The ENC values range from 20 to 61, meaning from the maximum codon bias (only one codon is used for each amino acid) to no codon bias (all synonymous codons for each amino acid are equally used).44 The CBI values range from 0 to 1, meaning from uniform use of synonymous codons to maximum codon bias.45 We also estimated some parameters related to codon bias, such as GC1,2 (the GC content at the first and second codon positions), GC3 (the GC content at the third codon positions) using DnaSP v.5.10.01.43

Correlation analysis of expression data and protein subcellular localization

Expression profiling can provide useful clues to gene function. To examine the expression patterns of the RALF genes, a comprehensive expression analysis was performed using the publicly available microarray data from Genevestigator.46,47 For genes with more than one set of probes, the median of expression values was used. Finally, the expression data were gene-wise normalized and hierarchically clustered based on Pearson coefficients with average linkage in the Genesis (version 1.7.6) program.48 Protein subcellular localization was predicted using WoFL PSORT software (http://wolfpsort.org).49

Results and Discussion

Identification, motif organization and phylogenetic analyses of the RALF genes

We identified 33, 23, 16, and 19 putative RALF genes from Arabidopsis, poplar, rice and maize, respectively. Arabidopsis has about doubled the collection of RALF genes than rice, whereas poplar and maize have fewer (30.3% and 42.4%, respectively) genes than Arabidopsis. By searching the PlantGDB (http://www.plantgdb.org),50 we found that the predicted genomes of poplar, rice and maize contain 45,778, 30,192 and 32,540 genes, respectively, which are 67.2%, 10.3% and 18.8% larger than that of Arabidopsis (27,379), respectively. This suggested that the numbers of the RALF genes are not proportional to the sizes of the predicted genomes. All the RALFs in the four species possess only one RALF domain through the CDD51,52 and Pfam (http://pfam.sanger.ac.uk) analysis. While these tools are suitable for defining the presence or absence of recognizable domains, they are unable to recognize smaller individual motifs and more divergent patterns. Thus, we further used the MEME program33 to study the diversification of RALF genes in Arabidopsis, poplar, rice and maize. Six distinct motifs were identified in these genes (Table 1 and Fig. S1). Details of the six motifs were presented in Fig. S2.
Table 1

Number and motif structure of RALF proteins from Arabidopsis (At), poplar (Pt), rice (Os) and maize (Zm).

GroupAtPtOsZmStructure
I15100Motif 4-1/Motif 5-1
II7000Motif 5-6-1
III0032Motif 5-1
IV0401Motif 4-1/Motif 5-1
V1500Motif 5-3-1/Motif 3-1/ Motif 5-3-2-1
VI2000Motif 5-3-1
VII001013Motif 5-2-1/Motif 5-3-2-1/ Motif 5-6-2-1/Motif 3-2
VIII1300Motif 5-3-2-1/Motif 5-2-1/ Motif 2-1
IX1233Motif 5-2-1/Motif 5-3-2-1/ Motif 2-1
X6800Motif 5-3-2-1/Motif 3-2-1/ Motif 2-1/Motif 3

Note: Detailed illustration of the six motif structures are shown in Figure S2.

Phylogenetic analyses can allow us to identify evolutionarily conservative and divergent of gene family. To achieve this goal, phylogenetic analyses of the 91 RALF members were performed. Based on phylogenetic relationships, we divided the RALF members into 10 groups (Fig. S1). Most of these genes encode proteins with the same or similar motif organization, while others are scattered in the families formed by proteins with other motifs, suggesting their complex evolutionary history. For convenience, we categorized the ortholog clades into 3 classes: (i) superstable: clades with orthologs containing at least one representative of monocots and eudicots, (ii) stable: clades including orthologs with monocot specific or eudicot specific, and (iii) unstable: lineage-specific clades. From Figure S1, it was clear that the superstable clade (Group IX) contained similar numbers of genes from each species, suggesting that major expansion/ contraction in gene number had not occurred since the divergence between eudicots (Arabidopsis and poplar) and monocots (rice and maize). This result was also consistent with the number of RALF genes in Selaginella moellendorffii, in which only two RALF genes were found (Table S1). Figure S1 also showed that some genes formed lineage-specific clusters. The largest of such cluster had seven Arabidopsis genes. Moreover, of 16 RALF genes in Group I, 15 genes came from Arabidopsis. All of these suggested that many subsets of the RALF gene family had experienced extensive gene duplications.

Contrasting changes in the numbers of RALF genes

To better understand how RALF genes have evolved in these species, we estimated the number of RALF genes in the MRCA of eudicots and monocots. Reconciliation of the gene trees with the species phylogeny suggested that there were about two ancestral RALF genes in the MRCA of eudicots and monocots (N1). Furthermore, we identified 5 orthologous genes in the eudicots MRCA (N2) and 11 in the MRCA of monocots (N3) (Fig. 1). We also found that the number of RALFs remained relatively stable through evolutionary history from the land plants (N0, Physcomitrella patens) to the vascular plants (N0, Selaginella moellendorffii) and the angiosperms (N1). Only after the separation of the eudicot and monocot species about 145 million years ago53 did RALFs once more expand significantly. When compared the number of ancestral genes with those in the extant species, it appeared that the RALF family had expanded in all the analyzed species. For example, the number of RALFs increased approximately 6.6-fold since the divergence of the various eudicot species from their respective MRCA in Arabidopsis. However, the expansion was uneven between these plant species. For example, there are 33, 23, 16 and 19 genes in Arabidopsis, poplar, rice and maize, respectively, while the estimated number of genes in the MACA of eudicots and monocots are two. Therefore, Arabidopsis, poplar, rice and maize have gained 31, 22, 14 and 17 genes, respectively, since their splits. Only one lost gene is found in poplar. Clearly, the numbers of genes gained in the Arabidopsis lineage are much greater than that in other three lineages.
Figure 1

Evolutionary change in the number of RALF genes in Arabidopsis, poplar, rice and maize.

Notes: The numbers in squares and ellipses represent the maximum numbers of genes in ancestral and extant species (At, Arabidopsis; Pt, poplar; Os, rice; Zm, maize), respectively. The numbers with plus and minus indicate the gene gains and losses, respectively, for each branch. N0, lower land plant ancestor; N1, angiosperm ancestor; N2, eudicot ancestor; N3, monocot ancestor.

Evolutionary patterns of RALF gene family

It has been suggested that the Arabidopsis genome experienced three duplication events within the past 250 million years,54 while the rice genome is believed to have experienced a genome-wide duplication approximately 70 million years ago.55,56 To investigate the relationship between the RALF genes and potential genomic duplications within the genome, the location of the genes in previously identified Arabidopsis and rice chromosomal duplications57,58 was noted. The distributions of the RALF genes relative to the corresponding duplicated genomic blocks were also illustrated in Arabidopsis (Fig. S3) and rice (Fig. S4). This result suggested that the generation of 17 (50.0% of 34) Arabidopsis and 7 (43.7% of 16) rice RALF genes could be due to tandem duplication. In Arabidopsis, the largest RALF gene cluster was located on chromosome 2 and contained four tandemly arrayed members: ie, At2g19020, At2g19030, At2g19040 and At2g19045 (Fig. 2). Phylogenetically, these four genes formed a single sub-clade in Group II, suggesting that they may result from recent tandem duplications. Because Group II also contains genes from other locations (At3g25165 and At3g25170 are located on chromosome 3, whereas At4g13075 is on chromosome 4), these genes may be the result of more ancient duplication events.
Figure 2

Evolution of the one subgroup of Arabidopsis RALF genes. (A) Phylogenetic relationships. (B) Hypothetical origins of seven Arabidopsis RALF genes by tandem duplication, segmental duplication and retroposition.

Notes: The letters S, R and T on the nodes of the phylogenetic tree indicate the positions where segmental duplication, retroposition and tandem duplication have occurred, respectively.

While segmental duplications were not the major factors that led to the expansion of the RALF gene family, it might be that dynamic changes occurred following segmental duplication, leading to loss of many of the genes. In contrast to Arabidopsis and rice, where 50.0% and 43.7%, respectively, of the RALFs were arranged in tandem repeats as described above, considerably fewer RALFs were arranged in tandem repeats in poplar (22.7%) and maize (10.5%), indicating that, in these species, RALFs mainly emerged by mechanisms other than tandem duplication. Next, we also investigated the distributions of the unstable and stable RALFs in Arabidopsis. This result indicated that unstable genes are strongly clustered (about 66.7%), while stable and superstable genes are evenly scattered (or only 38.1% genes clustered) over the chromosomes (Fig. 3). It is clear that the majority of unstable genes in Arabidopsis emerged after the most recent whole genome duplication event.57,59 We also found that, with the exception of Arabidopsis, three other species did not contain unstable genes, indicating divergent expansion of the RALF genes in different higher plants. In summary, our results suggested that after stable evolution of the RALF gene family in Angiosperms that followed the divergence from Tracheophyta (such as, only two RALF genes are identified in Selaginella moellendorffii), dramatic expansion had been largely occurred.
Figure 3

Chromosomal locations of Arabidopsis RALF genes.

Notes: Approximate positions of RALFs are displayed on the respective chromosome. Letters denote evolutionary classification. s, stable; ss, superstable; u, unstable.

In addition, when distantly related species compared, the newly added genes tended to form species-specific clusters or sub-clusters in the eudicots. For example, seven Arabidopsis RALF genes formed the most basal cluster within Group II. In Group IV, four poplar RALF genes also clustered. This suggested that, as F-box genes,53 the RALF genes in different species might have been derived from a series of gene duplication events that occurred after the split of the different lineages. A similar situation was found in the well supported clade of the monocot genes, in which most of the maize and rice genes also formed species-specific clades (such as Groups III and VII, see Fig. S1). The phylogenetic tree topology revealed several pairs of RALF members with a high degree of homology in the terminal nodes of each group, suggesting that they were putative paralogous pairs (Fig. S1). Totally, 13, 7, 6 and 3 pairs of putative paralogous RALF proteins were identified, accounting for more than 78.8%, 60.9%, 63.2% and 37.5% of the entire family in Arabidopsis, poplar, maize and rice, respectively, with sequence identities ranging from 30% to 100% (Table S2). These pairs of RALF members are evolutionarily very closely related, and each pair of genes has very similar structure (Fig. S1), indicating that they originated from duplications. About 38.4% of the paralogous RALF pairs in Arabidopsis have very consistent K values (from 0.66038 to 0.74663), suggesting that the duplication events in this species occurred within the last 22.01 to 24.89 million years. This period was consistent with the time when a recent large-scale genome duplication event was thought to have occurred in Arabidopsis.42,60 We also found that duplication of three of six RALF pairs originated from the recent large-scale duplication events (about 15.4 million years ago) in maize.40 This suggested that, as plant Sm and OPT genes,61,62 the recent genome wide duplication events contributed partially to expansion of the RALFs. In addition, in evolutionary terms, some of these RALF gene duplications appeared to have occurred relatively recently, such as Poptrdraft673738-Poptrdraft672089 (about 1.53 million years ago) and Poptrdraft578381-Poptrdraft578382 (about 0.8 million years ago). It might be associated with novel functional divergence and adaptation. Since codon bias can provides some examples of weak selection at the molecular level. Moreover, several researches have verified that selection on synonymous sites is correlated with stability of mRNA secondary structure, translation efficiency and accuracy, ribosome traffic and protein folding.63–65 We also verified the codon usage bias of RALF genes. Some information is list in Table S3. In which, CBI and ENC were calculated to measure the degree of codon bias. We can see that CBI showed a marked negative correlation with CBI, so, in this study, ENC was used to measure the degree of codon bias. To determine the relative effects of mutation pressure versus natural selection on codon composition, the relationship between GC3 content and GC1,2 content was examined. The result showed a tendency of positive correlation between GC3 and GC1,2, suggesting that the GC content is most likely the result of mutation pressure since natural selection acts differently on different codon position. In addition, we also confirmed that K was positively correlated with K (R2 = 0.655, P < 0.001), and very weakly negatively correlated with ENC and GC3 (but this was not significant) (Fig. S5), implying that codon bias might be a factor in K variation among RALF genes and might be under natural selection.

Different signatures of selection in RALFs

To examine whether RALFs confer adaptational properties, we determined K/K ratios for superstable, stable and unstable genes of A. thaliana with A. lyrata (Fig. S6). K/K ratios of 0.0269 for superstable RALFs (Fig. 4) strongly indicated purifying selective pressures. In contrast to that, unstable and stable genes seemed to be closer to neutral selection, as inferred by significantly higher K/K ratios (0.5257 and 0.4237) for stable genes and unstable genes, respectively (Fig. 4). We also analyzed the selection properties of the RALFs in different Groups. The results showed that the K/K ratios of the sequences from the different Groups were significantly different (Table S4). However, despite the differences in K/K values, all the estimated K/K values were substantially lower than 1, suggesting that the RALF sequences within each of the Groups were under strong purifying selection pressure and that positive selection might have acted on only a few sites during the evolutionary process.
Figure 4

Divergence levels of RALFs (A. thaliana versus A. lyrata).

Notes: Mean K/K ratios of stable (n = 15), unstable (n = 4) and superstable (n = 2) are shown. A. thaliana genes with a single A. lyrata ortholog are included in Figure S6.

Different expression profiles of the RALFs in Arabidopsis

We also examined the expression patterns of the Arabidopsis RALF genes. The results indicated that the divergent expression profiles were present in stable and unstable RALFs across the eight tissues/ developmental stages assessed. Furthermore, the stable genes in different evolutionary branches also displayed different expression patterns (Fig. 5). Whether do duplicated genes have similar expression patterns? To answer this question, we investigated their expression profiles and found that none of the pairs of genes shared similar expression patterns (Fig. 5), indicating that substantial neofunctionalization might have occurred during subsequent evolution of the RALF duplicated genes. It seems that the expression patterns of the paralogs have diverged during long-term evolution, suggesting functional diversification of the duplicated genes.66 Such a process ensures the duplicated genes to increase adaptability to environmental changes, thus conferring a possible evolutionary advantage.67 We also found that over 82% of the assessed genes were likely to be localized in the extracellular space. At2g32885, At2g19030, At2g19040, At1g61563, At1g61566 and At2g19045 have 100% probability of being localized to the extracellular space. For all the other RALFs, although the extracellular space was predicted as the most likely location, it was also possible that they were localized to the membranes of organelles such as the cytosol, vacuolar membrane or chloroplast. Taken together, while the selected RALFs showed similar subcellular localizations, they differed considerably in their expression profiles, indicating that possible functional diversification may be achieved by selection.
Figure 5

Expression profiles of the Arabidopsis RALF genes.

Notes: The dynamic expression profiles were extracted from Genevestigator.46,47 Green, yellow and red evolutionary branches denote stable, unstable and superstable RALFs in Arabidopsis, respectively.

Conclusion

This study explored the evolutionary process of RALF genes by phylogenetic trees, evolutionary patterns, molecular evolutionary rates, different signatures of selection and the expression patterns of RALFs. Tandem duplication rather than segmental duplication played a dominant role in the expansion of the RALF gene family. RALFs were under purifying selection. As well as on the diverse expression patterns of RALF genes and predominant extracellular space localization features of RALF proteins shed light on several key differences in RALF gene family evolution among the four plant species and highlighted the molecular evolution of the RALF gene family. All of these may provide a scaffold for future functional analysis of this family. Phylogenetic relationships and motif composition of RALF genes in Arabidopsis, poplar, rice and maize. Notes: The molecular phylogeny (left panel) was constructed using full length RALF protein sequences from the four species. Numbers associated with branches show bootstrap support values for Neighbor-Joining. The 10 major groups designated from I to X are marked with different colored backgrounds. A schematic representation of conserved motifs (obtained using MEME) in RALF proteins is displayed in the panel on the right. Different motifs are represented by different colored boxes. Details of the individual motifs are in Figure S2. Sequence logo and regular expression of the different motifs identified in the RALF gene family. Chromosomal locations of the Arabidopsis RALF genes. Notes: Letters denote evolutionary classification of RALFs. s, stable; ss, superstable; u, unstable. The lines join the segmental duplicated homologous blocks. Chromosomal locations of the rice RALF genes. Notes: Letters denote evolutionary classification of RALFs. s, stable; ss, superstable; u, unstable. The lines join the segmental duplicated homologous blocks that are indicated using the same colors. The relationships between ENC and CBI, ENC and Ks, ENC and K, K and K, GC1,2 and GC3, K and GC3. NJ tree generated using RALF protein sequences of A. thaliana and A. lyrata. Note: Numbers at branches indicate bootstrap values (1000 replicates). Boxed sequences designate proteins used for K/K ratios: green = stable, yellow = unstable and red = superstable. Number of RALF genes in lower plants. Note: GenBank ID. Pairwise identities and inference of duplication time in paralogous pairs of RALF genes from Arabidopsis, poplar, maize and rice. Codons information for the paralogous pairs of RALF genes list in Table S2. Likelihood values and parameter estimates for the RALF genes.
Table S1

Number of RALF genes in lower plants.

SpeciesNumber of RALFsGene ID*
Selaginella moellendorffii29636436, 9661207
Physcomitrella patens35920213, 5945963, 5942388

Note:

GenBank ID.

Table S2

Pairwise identities and inference of duplication time in paralogous pairs of RALF genes from Arabidopsis, poplar, maize and rice.

Paralogous pairsScoreExpectIdentitiesPositivesGapsKaKsDate (million years ago)
At1g35467-At4g1165387.82.00E-2348/90 (53%)59/90 (66%)4/90 (4%)0.355520.6603822.01
At2g34825-At2g328851225.00E-3458/71 (82%)64/71 (90%)0/71 (0%)0.115080.298879.96
At1g61563-At1g615661288.00E-3663/73 (86%)67/73 (92%)1/73 (1%)0.092390.188126.27
At4g11510-At3g0473577.44.00E-2048/93 (52%)57/93 (61%)14/93 (15%)0.305880.7078223.59
At1g23147-At1g2314568.62.00E-1740/82 (49%)51/82 (62%)16/82 (20%)0.398710.6839722.79
At1g60625-At1g608151204.00E-3355/80 (69%)66/80 (83%)0/80 (0%)0.173950.12314.1
At2g32835-At4g1402026.96.00E-0514/46 (30%)21/46 (46%)0/46 (0%)1.627311.7267657.56
At2g19030-At2g190451156.00E-3270/72 (97%)71/72 (99%)0/72 (0%)0.011420.099643.32
At2g33130-At2g206601082.00E-2962/104 (60%)70/104 (67%)8/104 (8%)0.241310.4662915.54
At2g33775-At1g282701796.00E-5185/107 (79%)94/107 (88%)0/107 (0%)0.150151.0912236.37
At4g13950-At3g238051461.00E-4079/118 (67%)90/118 (76%)5/118 (4%)0.202680.7466324.89
At3g05490-At1g029001291.00E-3569/120 (58%)83/120 (69%)3/120 (3%)0.35851.0779835.93
At4g15800-At3g165701539.00E-4388/137 (64%)98/137 (72%)21/137 (15%)0.1490.7176223.92
Poptrdraft578381-Poptrdraft5783821555.00E-4473/75 (97%)73/75 (97%)0/75 (0%)0.018650.014650.8
Poptrdraft578383-Poptrdraft9396641511.00E-4272/72 (100%)72/72 (100%)0/72 (0%)000
Poptrdraft752809-Poptrdraft5970781469.00E-4182/124 (66%)94/124 (76%)4/124 (3%)0.162830.2775815.25
Poptrdraft174729-Poptrdraft2775821703.00E-4884/84 (100%)84/84 (100%)0/84 (0%)000
Poptrdraft655136-Poptrdraft7162372122.00E-60104/135 (77%)111/135 (82%)5/135 (4%)0.123510.4503424.74
Poptrdraft1069360-Poptrdraft2979531132.00E-3152/59 (88%)57/59 (99%)0/59 (0%)0.056880.197710.86
Poptrdraft673738-Poptrdraft6720891625.00E-4678/83 (94%)79/83 (95%)0/83 (0%)0.027250.027911.53
Grmzm2g383303-Grmzm2 g0883711156.00E-3255/72 (76%)62/72 (86%)0/72 (0%)0.164840.2695220.73
Grmzm2g171394-Grmzm2g02945589.78.00E-2444/49 (90%)46/49 (94%)2/49 (4%)0.364960.4734436.42
Grmzm2g153206-Grmzm2g3016631344.00E-3780/102 (78%)81/102 (79%)11/102 (11%)0.055010.2285717.58
Grmzm2g095039-Grmzm2g0772591644.00E-4694/114 (82%)96/114 (84%)7/114 (6%)0.031130.1998215.37
Grmzm2g095164-Grmzm2g3322591752.00E-49102/121 (84%)104/121 (86%)5/121 (4%)0.05070.2026215.58
Grmzm2g357124-Grmzm2g0562211645.00E-4694/120 (78%)103/120 (86%)3/120 (3%)0.097890.101017.77
Os07g13310.1-Os07g13380.11181.00E-3255/76 (72%)65/76 (86%)0/76 (0%)0.151370.1910714.69
Os01g25540.1-Os02g44940.11122.00E-3077/90 (86%)80/90 (89%)2/90 (2%)0.078530.107958.3
Os01g010147.1-Os04g54090.187.84.00E-2361/113 (54%)63/113 (56%)18/113 (16%)0.215390.4680436
Table S3

Codons information for the paralogous pairs of RALF genes list in Table S2.

Sequence_nameENCCBISChi2GC1,2GC3 sGC3GC
At1g3546737.3360.6890.8880.4230.2120.2440.363
At4g1165333.5670.6360.9790.4390.1820.1980.359
At2g3482543.3840.6870.8280.4990.1520.2190.406
At2g3288561.0000.4800.6820.5140.2500.3010.443
At1g6156359.3830.4800.5510.5060.3880.3860.466
At1g6156651.2700.4970.7160.4670.4110.4340.456
At4g1151048.5050.5430.6600.4990.3290.3720.457
At3g0473548.3360.6410.7750.4390.2320.2740.384
At1g2314744.7720.6220.7960.4790.2050.1980.385
At1g2314542.4690.5470.6870.4850.2930.3470.439
At1g6062542.5060.4520.6320.4030.2690.2930.366
At1g6081536.6930.5490.8350.4030.2780.2930.366
At2g3283555.4380.5300.6050.4480.2420.2710.389
At4g1402053.6260.4290.4940.4550.4810.5090.473
At2g1903056.9050.4530.5670.5350.4860.4790.516
At2g1904549.9110.5220.6470.5410.4860.4520.511
Grmzm2g38330332.6890.7281.1000.5830.9210.9110.692
Grmzm2g08837140.4040.6801.0020.5540.8470.8400.649
Os07g1331049.5070.4680.5260.5160.6170.6360.556
Os07g1338052.3500.5750.6440.5120.7140.7130.579
Poptrdraft57838152.8780.4580.5890.5000.3330.3680.456
Poptrdraft57838251.8170.4460.6430.4870.3480.3950.456
Poptrdraft57838361.0000.4050.4800.4810.3700.3850.449
Poptrdraft93966450.6850.4660.5990.5000.3530.3890.463
Poptrdraft75280951.8490.4590.5230.4320.3780.4000.421
Poptrdraft59707860.0790.4070.4430.4840.3300.3550.441
Poptrdraft17472940.4820.5610.7950.4890.3130.3210.433
Poptrdraft27758240.4820.5610.7950.4890.3130.3210.433
Grmzm2g17139433.4240.7541.1840.5780.9230.9270.694
Grmzm2g02945541.0200.4930.6290.6550.7160.7480.686
Grmzm2g15320629.8120.7471.2700.6390.9320.9240.734
Grmzm2g30166326.8090.8371.5450.6640.9840.9760.768
Grmzm2g09516430.1700.8411.4940.6750.9850.9780.776
Grmzm2g33225929.4690.8491.5590.6690.9780.9710.770
Grmzm2g09503931.3420.7731.3230.6580.9550.9470.754
Grmzm2g07725928.6430.8311.4550.6560.9530.9450.752
Os01g2554030.5910.8101.4380.6660.9570.9570.763
Os02g4494029.8030.8011.3320.6930.9660.9580.781
Poptrdraft65513651.1920.4710.5200.4460.4000.4240.439
Poptrdraft71623745.4750.5370.6170.4630.3750.4070.444
Grmzm2g35712428.6040.8101.3400.6410.9720.9660.749
Grmzm2g05622129.0620.8421.3850.6290.9930.9860.748
Os01g1047033.0920.7731.0980.6190.9640.9590.732
Os04g5409039.1850.6250.8830.6540.8060.8040.704
At2g3377547.1770.5770.6220.4740.6260.6310.526
At1g2827045.1160.5130.6290.4460.4620.4770.456
At4g1395053.5930.4520.5000.4480.5190.5350.477
At3g2380549.8050.4870.5700.4500.4270.4620.454
Poptrdraft29795341.1930.4690.7930.4490.4000.4260.441
Poptrdraft106936044.0570.5270.8360.3960.5160.5520.448
At3g0549042.5560.4960.5750.4960.5690.5920.528
At1g0290056.6270.3730.3780.4760.4530.4710.474
At4g1580058.5320.4070.4090.4880.5610.5730.516
At3g1657055.1770.4540.4460.5580.5670.5680.561
Poptrdraft67373847.5740.5400.6890.3990.4340.4760.425
Poptrdraft67208945.3790.4770.5890.3990.4420.4760.425
At2g3313055.4180.4650.5340.3840.3960.5190.429
At2g2066046.5710.5100.5860.3970.4180.4510.415
Mean ± SE44.410 ± 9.8690.578 ± 0.1390.805 ± 0.3290.514 ± 0.0880.541 ± 0.2670.558 ± 0.2530.529 ± 0.137
Table S4

Likelihood values and parameter estimates for the RALF genes.

Gene branchesModelKa/KsLog-likelihoodPositive selection sites
Group IM80.5206−4162.59Not found
M8a0.4827−4163.25Not found
M70.5138−4162.07Not found
M50.5723−4166.4535,66,69,77,79,90,106
MEC0.6459−4096.2639,43,66,69,77,79,
Group IIM80.6145−1122.757,13,14,19,20,23,24,25,38,49,50,52,53,57,70,73,76
M8a0.4435−1123.65Not found
M70.4398−1123.63Not found
M50.4846−1124.257,52,57,76
MEC0.7122−1115.324,7,13,14,20,23,24,25,30,38,49,52,53,57,61,73,76
Group IIIM80.6031−1137.144,27,28,32,34,51,56,64,72,74,78
M8a0.4139−1136.04Not found
M70.4483−1136.32Not found
M50.4737−113864,78
MEC0.6697−1131.684,10,25,27,28,32,34,36,51,54,55,56,58,64,72,74,77,78
Group IVM80.4432−802.603Not found
M8a0.4913−803.102Not found
M70.4271−802.584Not found
M50.4917−803.795Not found
MEC0.5384−797.17948,58,60,68,72
Group VM80.372−1587.53Not found
M8a0.4232−1588.03Not found
M70.3795−1587.36Not found
M50.4154−1587.738,
MEC0.6176−1580.613,5,6,8,14,17,19,21,22,24,35,38,43,45,54,60,67,72,76,78,86,88,93,101,111,121
Group VIIM80.4258−7178.9Not found
M8a0.3071−7200.06Not found
M7
M50.3663−7211.88Not found
MEC0.4086−7047.7268,71,75,77,
Group VIIIM80.4547−1471.733,14,15,17,18,30,31,36,48,52,55,61,109,112,123
M8a0.2244−1466.05Not found
M70.2281−1466.04Not found
M50.2781−1467.72Not found
MEC0.4315−1472.7816,17,36,52,62,112
Group IXM80.3449−2778.46Not found
M8a0.342−2776.4Not found
M70.3533−2778.6Not found
M50.3557−2779.98Not found
MEC0.4434−2733.0125,31,37,38,39,40,43,44,81,84
Group XM80.3125−3496.94Not found
M8a0.2881−3500.02Not found
M70.3155−3500.78Not found
M50.3206−3504.9Not found
MEC0.3524−3441.277,9,10,23,25
  66 in total

1.  Ancestral genome duplication in rice.

Authors:  Romain Guyot; Beat Keller
Journal:  Genome       Date:  2004-06       Impact factor: 2.166

2.  Characterization of five RALF-like genes from Solanum chacoense provides support for a developmental role in plants.

Authors:  Hugo Germain; Eric Chevalier; Sébastien Caron; Daniel P Matton
Journal:  Planta       Date:  2004-08-04       Impact factor: 4.116

3.  Selecton: a server for detecting evolutionary forces at a single amino-acid site.

Authors:  Adi Doron-Faigenboim; Adi Stern; Itay Mayrose; Eran Bacharach; Tal Pupko
Journal:  Bioinformatics       Date:  2005-01-12       Impact factor: 6.937

4.  Evolution of F-box genes in plants: different modes of sequence divergence and their relationships with functional diversification.

Authors:  Guixia Xu; Hong Ma; Masatoshi Nei; Hongzhi Kong
Journal:  Proc Natl Acad Sci U S A       Date:  2009-01-06       Impact factor: 11.205

5.  Signaling of cell fate decisions by CLAVATA3 in Arabidopsis shoot meristems.

Authors:  J C Fletcher; U Brand; M P Running; R Simon; E M Meyerowitz
Journal:  Science       Date:  1999-03-19       Impact factor: 47.728

6.  Molecular evolution and selection patterns of plant F-box proteins with C-terminal kelch repeats.

Authors:  Nadine Schumann; Aura Navarro-Quezada; Kristian Ullrich; Carsten Kuhl; Marcel Quint
Journal:  Plant Physiol       Date:  2010-11-30       Impact factor: 8.340

7.  Regulation and processing of a plant peptide hormone, AtRALF23, in Arabidopsis.

Authors:  Renu Srivastava; Jian-Xiang Liu; Hongqing Guo; Yanhai Yin; Stephen H Howell
Journal:  Plant J       Date:  2009-05-18       Impact factor: 6.417

Review 8.  Post-translational modifications in secreted peptide hormones in plants.

Authors:  Yoshikatsu Matsubayashi
Journal:  Plant Cell Physiol       Date:  2010-11-11       Impact factor: 4.927

9.  CDD: specific functional annotation with the Conserved Domain Database.

Authors:  Aron Marchler-Bauer; John B Anderson; Farideh Chitsaz; Myra K Derbyshire; Carol DeWeese-Scott; Jessica H Fong; Lewis Y Geer; Renata C Geer; Noreen R Gonzales; Marc Gwadz; Siqian He; David I Hurwitz; John D Jackson; Zhaoxi Ke; Christopher J Lanczycki; Cynthia A Liebert; Chunlei Liu; Fu Lu; Shennan Lu; Gabriele H Marchler; Mikhail Mullokandov; James S Song; Asba Tasneem; Narmada Thanki; Roxanne A Yamashita; Dachuan Zhang; Naigong Zhang; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2008-11-04       Impact factor: 16.971

10.  Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approach.

Authors:  Adi Stern; Adi Doron-Faigenboim; Elana Erez; Eric Martz; Eran Bacharach; Tal Pupko
Journal:  Nucleic Acids Res       Date:  2007-06-22       Impact factor: 16.971

View more
  27 in total

1.  Genome-wide identification and comparative expression analysis of LEA genes in watermelon and melon genomes.

Authors:  Yasemin Celik Altunoglu; Mehmet Cengiz Baloglu; Pinar Baloglu; Esra Nurten Yer; Sibel Kara
Journal:  Physiol Mol Biol Plants       Date:  2017-01-06

Review 2.  Twenty Years of Progress in Physiological and Biochemical Investigation of RALF Peptides.

Authors:  Matthew R Blackburn; Miyoshi Haruta; Daniel S Moura
Journal:  Plant Physiol       Date:  2020-02-18       Impact factor: 8.340

3.  Perception of Damaged Self in Plants.

Authors:  Qi Li; Chenggang Wang; Zhonglin Mou
Journal:  Plant Physiol       Date:  2020-01-06       Impact factor: 8.340

4.  Arabidopsis thaliana rapid alkalinization factor 1-mediated root growth inhibition is dependent on calmodulin-like protein 38.

Authors:  Wellington F Campos; Keini Dressano; Paulo H O Ceciliato; Juan Carlos Guerrero-Abad; Aparecida Leonir Silva; Celso S Fiori; Amanda Morato do Canto; Tábata Bergonci; Lucas A N Claus; Marcio C Silva-Filho; Daniel S Moura
Journal:  J Biol Chem       Date:  2017-12-27       Impact factor: 5.157

5.  RALF1 peptide triggers biphasic root growth inhibition upstream of auxin biosynthesis.

Authors:  Lanxin Li; Huihuang Chen; Saqer S Alotaibi; Aleš Pěnčík; Maciek Adamowski; Ondřej Novák; Jiří Friml
Journal:  Proc Natl Acad Sci U S A       Date:  2022-07-25       Impact factor: 12.779

6.  Identification and phylogenetic analysis of late embryogenesis abundant proteins family in tomato (Solanum lycopersicum).

Authors:  Jun Cao; Xiang Li
Journal:  Planta       Date:  2014-12-10       Impact factor: 4.116

7.  RNA-seq of Arabidopsis pollen uncovers novel transcription and alternative splicing.

Authors:  Ann E Loraine; Sheila McCormick; April Estrada; Ketan Patel; Peng Qin
Journal:  Plant Physiol       Date:  2013-04-16       Impact factor: 8.340

8.  Lipid droplet-associated proteins (LDAPs) are involved in the compartmentalization of lipophilic compounds in plant cells.

Authors:  Satinder K Gidda; Samantha Watt; Jillian Collins-Silva; Aruna Kilaru; Vincent Arondel; Olga Yurchenko; Patrick J Horn; Christopher N James; David Shintani; John B Ohlrogge; Kent D Chapman; Robert T Mullen; John M Dyer
Journal:  Plant Signal Behav       Date:  2013-12-04

Review 9.  The Plant Peptidome: An Expanding Repertoire of Structural Features and Biological Functions.

Authors:  Patrizia Tavormina; Barbara De Coninck; Natalia Nikonorova; Ive De Smet; Bruno P A Cammue
Journal:  Plant Cell       Date:  2015-08-14       Impact factor: 11.277

10.  The pectin lyases in Arabidopsis thaliana: evolution, selection and expression profiles.

Authors:  Jun Cao
Journal:  PLoS One       Date:  2012-10-09       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.