Literature DB >> 20423940

Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.).

Haoli Ma1, Jie Zhao.   

Abstract

Arabinogalactan proteins (AGPs) comprise a family of hydroxyproline-rich glycoproteins that are implicated in plant growth and development. In this study, 69 AGPs are identified from the rice genome, including 13 classical AGPs, 15 arabinogalactan (AG) peptides, three non-classical AGPs, three early nodulin-like AGPs (eNod-like AGPs), eight non-specific lipid transfer protein-like AGPs (nsLTP-like AGPs), and 27 fasciclin-like AGPs (FLAs). The results from expressed sequence tags, microarrays, and massively parallel signature sequencing tags are used to analyse the expression of AGP-encoding genes, which is confirmed by real-time PCR. The results reveal that several rice AGP-encoding genes are predominantly expressed in anthers and display differential expression patterns in response to abscisic acid, gibberellic acid, and abiotic stresses. Based on the results obtained from this analysis, an attempt has been made to link the protein structures and expression patterns of rice AGP-encoding genes to their functions. Taken together, the genome-wide identification and expression analysis of the rice AGP gene family might facilitate further functional studies of rice AGPs.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20423940      PMCID: PMC2882264          DOI: 10.1093/jxb/erq104

Source DB:  PubMed          Journal:  J Exp Bot        ISSN: 0022-0957            Impact factor:   6.992


Introduction

Proteoglycans or glycoproteins are basal components of the cell wall implicated in various processes of plant growth and development throughout the plant kingdom. A large number of these proteins are rich in proline (Pro) or hydroxyproline (Hyp) and are named Pro-rich/Hyp-rich glycoproteins (P/HRGPs). The P/HRGPs share common features that consist of regions containing Hyp residues, which are called ‘glycomodules’ and in which most Hyp residues are usually glycosylated by a large branched arabinogalactan (AG) polysaccharide or by small non-branched arabinooligosaccharides (arabinosides). According to the ‘Hyp contiguity hypothesis’, contiguous and non-contiguous clustered Hyp residues are the sites attached by arabinoside and AG polysaccharide, respectively. Evidence for the ‘Hyp contiguity hypothesis’ was obtained using synthetic genes. In synthetic HRGPs composed entirely of (Ser-Hyp-Ser-Hyp) or (Ala-Hyp-Ala-Hyp) repeats, 100% of the Hyp residues have AG polysaccharides (Shpak , 2001), but Hyp that occurs singly is rarely glycosylated by AG polysaccharide. Short arabinosides are added to Hyp residues in (Ser/Thr)-Hyp-Hyp-Hyp extensin-like motifs and large arabinogalactan chains are added to the non-contiguous Hyp residues in Ser-Hyp-Thr-Hyp-Thr AGP-like motifs (Goodrum ). The P/HRGP superfamily is classified into three subfamilies distinguished by characteristic repetitive structural motifs yielding different degrees of O-glycosylation: the minimally glycosylated Pro-rich proteins (PRPs), the moderately glycosylated extensins, and the highly glycosylated arabinogalactan proteins (AGPs) (Schultz ). Using monoclonal antibodies (JIM4, JIM13, LM2, etc.) and a synthetic chemical reagent called β-glucosyl Yariv reagent (β-GlcY) that specifically binds to AGPs and blocks their normal functions, many studies have indicated that AGPs are implicated in various aspects of plant growth and development, including cell proliferation (Serpe and Nothnagel, 1994; Langan and Nothnagel, 1997; Yang ), cell expansion (Willats and Knox, 1996), programmed cell death (Gao and Showalter, 1999), pollen tube growth (Wu ; Qin ), xylem differentiation (Gao and Showalter, 2000; Motose a, b; Zhang ), somatic embryogenesis (van Hengel ), zygotic division, and embryo development (Hu ; Qin and Zhao, 2006, 2007). Reverse genetics approaches such as knockout mutants and RNA interference (RNAi) represent powerful means to investigate functions of individual AGPs. Molecular functions of AGPs have mainly been studied in Arabidopsis thaliana. AtAGP6 and AtAGP11 are two homologous Arabidopsis genes encoding classical AGPs specifically expressed in stamens, pollen grains, and pollen tubes. Loss of function of AtAGP6 and AtAGP11 leads to reduced fertility, indicating that AtAGP6 and AtAGP11 play an important role in pollen tube growth and stamen function (Levitin ). There are three Lys-rich AGPs in Arabidopsis (Sun ). The mutant of AtAGP17 with one insertion in its promoter, named rat1 (resistance to Agrobacterium), is defective in binding of Agrobacterium to its roots (Nam ; Gaspar ). In AtAGP18 RNAi plants, functional megaspores fail to enlarge and divide, resulting in the abortion of ovules and reduction of seeds, which proves that AtAGP18 is essential for female gametogenesis (Acosta-Garcia and Vielle-Calzada, 2004). In the atagp19 mutant, cell division and expansion are defective, indicating that AtAGP19 has functions in various aspects of plant growth and development (Yang ). The mutant of AtFLA4, also called sos5 (salt overly sensitive5), results in abnormal cell expansion, thinner cell walls, and increased sensitivity to salt (Shi ). The double mutant of two xylogen homologues AtXYP1 and AtXYP2 shows defective vascular development. Moreover, xylogen was recognized as ‘chimeric’ AGP, as indicated by its large molecular weight range that exceeds its predicted peptide mass, and its reactivity to β-GlcY reagent and JIM13 antibody (Motose ). AtAGP30, a non-classical AGP containing six cysteines in the C-terminus, is required for root regeneration and seed germination (van Hengel and Roberts, 2003). RNAi plants of PpAGP1 in Physcomitrella patens resulted in cell length reduction (Lee ). CsAGP1 in cucumber (Cucumis sativus) is responsive to gibberellin (GA), and overexpression of CsAGP1 results in a taller stature and earlier flowering than the wild-type plants (Park ). LeAGP1 from tomato (Solanum lycopersicum, formerly Lycopersicon esculentum) is up-regulated by cytokinin, and its overexpressing transgenic lines have cytokinin-overexpressing phenotypes (Sun ). By using the RNAi technique and attAGP-targeted virus-induced gene silencing, it was proved that the expression of tomato attAGP is induced by infection with Cuscuta reflexa, which promotes the parasite's adherence (Albert ). Based on the structure of AGP protein backbones, they have also been divided into several subclasses: classical AGPs, Lys-rich AGPs, AG peptides, fasciclin-like AGPs (FLAs), non-classical AGPs, and ‘chimeric’ AGPs (Gaspar ; Schultz ). Previously, 14 classical AGPs, three Lys-rich AGPs, 13 AG peptides, 21 FLAs, two non-classical AGPs, and several ‘chimeric’ AGPs had been identified in Arabidopsis (Schultz , 2004; Borner ; van Hengel and Roberts, 2003; Liu and Mehdy, 2007). Recently, 22 early nodulin-like proteins (ENODLs) have been identified in Arabidopsis; 18 of them are regarded as early nodulin-like (eNod-like) AGPs (Mashiguchi ). Most AGPs were identified in Arabidopsis based on biased amino acid composition analysis and a hidden Markov model built from the alignment of 88 fasciclin domains (Schultz ; Johnson ). Completion of the rice genome (International Rice Genome Sequencing Project, 2004) makes it possible to identify AGP-encoding genes from rice, while most genes are annotated as hypothetical. Although extensive studies had revealed the roles of some AGPs and the underlying mechanisms in Arabidopsis, no AGP-encoding gene was functionally analysed in rice. Therefore, there is an urgent need for a thorough bioinformatic analysis and characterization of AGPs in rice genomes. Previously, 33 and 21 FLAs had been identified in wheat (Triticum aestivum) and rice, respectively (Faik ). In this study, the predicted AGP-encoding genes were searched in the rice genome and a phylogenetic analysis was conducted. Furthermore, publicly available resources such as microarrays were evaluated to help in the selection of specific experiments for target genes. The expression patterns of selected target genes in rice organs and tissues at different developmental stages, and the regulation of their expression under abiotic stresses and phytohormone treatments were also examined. The studies indicate important physiological functions of AGPs and can be a base and guide for research into the AGP gene family in rice.

Materials and methods

Plant materials and treatment methods

The rice (Oryza sativa L. japonica cv. Nipponbare) plants were grown in a greenhouse at Wuhan University. Rice seeds were soaked in sterile water for 2 d, and then transferred to containers with sponges as supporting materials in sterile water; this is the first day of the seedling age. Plant materials for expression pattern analysis were: (i) 14-day-old root (YR, young root) and leaf (YL, young leaf); (ii) panicle <3 cm (P1), 5 cm (P2), 10 cm (P3), 15 cm (P4), 20 cm (P5), and 28 cm (P6); (iii) anther (An), and stigma and ovary (SO) from a 28 cm panicle; (iv) ovaries from 1 DAP (day after pollination) (O1) and 3 DAP (O2) seeds; and (v) embryos from 5 DAP (E1), 10 DAP (E2), and 30 DAP (E3) seeds. The temperature for plant growth was 30/25 °C under a photoperiod of 16 h light and 8 h dark. For hormone treatments, 14-day-old seedlings were transferred into containers and treated with deionized water, 25 μM abscisic acid (ABA), and 10 μM GA for 3–24 h. For stress treatments, the 14-day-old seedlings were carefully transferred onto filter papers in a 28 °C illumination incubator as drought stress, placed in 400 mM NaCl solution in a 28 °C illumination incubator as salt stress, and kept at 4 °C as low temperature stress, for 3–24 h, as described previously (Yuan ). All materials were taken and quickly frozen in liquid nitrogen, and stored at –80 °C until RNA extraction.

Finding AGPs based on biased amino acid composition, length, and conversed domain

A Perl script (amino acid bias) was downloaded to calculate the PAST (Pro, Ala, Ser, Thr) percentage for all of the proteins in rice (http://www.agwine.adelaide.edu.au/plant/cgb/pcf/bio/). All of the rice proteins (.pep files) were downloaded from RGAP (Rice Genome Annotation Project) annotation release 5 (http://rice.plantbiology.msu.edu/data_download.shtml) and RAP-DB (Rice Annotation Project Database) annotation release 2 (http://rapdb.dna.affrc.go.jp/download/index.html). The Perl program is available as free software for UNIX, Windows, and Mac operating systems (http://www.activeperl.com/). For Windows XP, ActivePerl-5.6.1.638 was used. The protein bias program generated two different lists of proteins. The output files of the Perl script are text files that include ‘long’ proteins >75 amino acid residues in length and above a certain PAST threshold (50% or 55%), and ‘short’ proteins between 55 and 75 amino acid residues in length and below a certain PAST percentage (35% or 40%). Additional output files that contained the first 50 amino acid residues were generated for both the ‘long’ and the ‘short’ proteins. To be AG modified, the protein has to enter the endoplasmic reticulum in eukaryotes. Typically, the existence of a signal peptide is necessary. Therefore, the additional output files were subjected to SignalP 3.0 (http://www.cbs.dtu.dk/services/SignalP/) to check for the presence of N-terminal signal sequences. To check for the presence of a C-terminal GPI (glycosylphosphatidylinositol) additional signal, both the ‘long’ and ‘short’ files were subjected to the big-PI Plant Predictor website: GPI Modification Site Prediction in Plants (http://mendel.imp.ac.at/gpi/plant_server.html) (Eisenhaber ). The search criteria for rice AGPs were as follows. (i) The proteins were classified as AGPs if they contained predominantly Ala-Pro, Ser-Pro, or Thr-Pro throughout the protein with no more than 11 amino acid residues between consecutive Pro residues, but did not contain repeats associated with extensins or PRPs (e.g. Ser-Pro4 or Pro-Pro-Xaa-Yaa-Lys) (Schultz ). (ii) The exceptions were the Lys-rich AGPs that were a subclass of classical AGPs. These AGPs have a Lys-rich domain of ∼16 amino acid residues that separate AGP glycomodules. (iii) AGPs were defined as non-classical AGPs if they contained an AGP-like region and other atypical regions except N- and C-terminal signals. (iv) The proteins were classified as AG peptides if their encoded protein backbone was between 55 and 75 residues in length and had XP (X=Ala, Ser, Thr) motifs. (v) The proteins were classified as nsLTP-like AGPs and eNod-like AGPs if they contained a conserved domain such as a non-specific lipid transfer (nsLTP)-like domain and eNod-like domain which was detected by ‘Search Conserved Domains on a protein’ at NCBI (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi). (vi) The proteins were classified as extensins if they contain Ser-Pro3 and/or Ser-Pro4 repeats which were mostly separated by Tyr, Lys, and Val residues. (vii) The proteins were classified as PRPs if they were rich in proline and possessed none of the characteristics mentioned above. (viii) Several prediction algorithms (http://www.ebi.ac.uk/interpro/, IPR000782, http://pfam.sanger.ac.uk/, PF02469, and http://smart.embl-heidelberg.de/, SM00554) were used for finding FLAs.

Chromosomal localization and gene duplications

The chromosomal distributions of AGP-encoding genes were determined by searching the physical positions of their corresponding locus numbers in the TIGR database. Physical maps of each japonica rice chromosome were downloaded from the IRGSP (International Rice Genome Sequencing Project) (http://rgp.dna.affrc.go.jp/IRGSP/download). BACs (bacterial artificial chromosomes) or PACs (bacteriophage P1 artificial chromosomes) containing AGP-encoding genes were searched. All the sequenced contigs of japonica cv Nipponbare were physically constructed as pseudomolecules by IRGSP (http://rgp.dna.affrc.go.jp/IRGSP), representing 12 rice chromosomes, and are available in GenBank (accession nos AP008207–AP008218). Each of the AGP-encoding genes was positioned on these rice chromosome pseudomolecules by BLASTN search. Genes separated by ≤5 genes were considered to be tandem duplicates. Genes belonging to segmental duplicates were detected by searching the ‘Segmental genome duplication of rice’ in the TIGR database (http://rice.plantbiology.msu.edu/segmental_dup/index.shtml).

Alignment and phylogenetic analysis

Multiple sequence alignments of amino acid sequences were generated using ClustalX (1.83) and were manually corrected. The obtained sequence alignments were used as input to construct phylogenetic trees with the Neighbor–Joining algorithm within MEGA 4.0 (Tamura ). Bootstrapping was performed 10 000 times to obtain support values for each branch. Branches corresponding to partitions reproduced in <50% bootstrap replicates were collapsed.

Digital expression analysis: EST expression profile, microarrays, and MPSS tags

EST (expressed sequence tag) expression profiles were obtained from UniGene at NCBI websites. The total numbers of ESTs were: callus, 164 803; flower, 136 501; leaf, 171 750; panicle, 138 119; root, 68 198; seed, 32 358; stem, 126 877; and vegetative meristem, 4594 (Supplementary Table S3 available at JXB online). Genes were defined as specifically expressed in one tissue if the ESTs of any tissue contributed more than half of the EST frequency. The results of rice microarrays are available in the Gene Expression Omnibus database at the NCBI website (http://www.ncbi.nlm.nih.gov/geo/) and the Rice Functional Genomic Express Database (http://signal.salk.edu/cgi-bin/RiceGE). For temporal and spatial expression analysis (GSE6893), different stages of panicle and seed development were categorized according to panicle length and days after pollination, respectively, based on landmark developmental events as follows. (i) Panicle: up to 0.5 mm, shoot apical meristem (SAM); 0–3 cm, floral transition and floral organ development (P1); 3–5 cm and 5–10 cm, meiotic stage (P2 and P3); 10–15 cm, young microspore stage (P4); 15–22 cm, vacuolated pollen stage (P5); 22–30 cm, mature pollen stage (P6). (ii) Seeds: 0–2 DAP, early globular embryo (S1); 3–4 DAP, middle and late globular embryo (S2); 5–10 DAP, embryo morphogenesis (S3); 11–20 DAP, embryo maturation (S4); 21–29 DAP, dormancy and desiccation tolerance (S5) (Itoh , Jain ). For abiotic stress analysis (GSE6901), rice seedlings were transferred to a beaker containing 200 mM NaCl solution for salt stress, dried between folds of tissue paper at 28±1 °C in a culture room for drought stress, and kept at 4±1 °C for cold stress, for 3 h (Jain ). For hormone response (ABA and GA; GSE661), the callus which had been cultured for 30 d was transferred to a medium containing either 50 mM ABA or 50 mM GA. The absolute signal values of Arabidopsis AGP-encoding genes were downloaded using ‘Bulk Gene Download’ at Nottingham Arabidopsis Stock Centre's microarray database, and the results of developmental stages (GSE5629–5633) and stress treatments (GSE5620–5621 and 5623–5624) were used to analyse the expression of AGP-encoding genes in Arabidopsis (http://affymetrix.arabidopsis.info/narrays/help/psp-wbubn.html). In order to make these absolute signal values suitable for cluster display, the absolute values were divided by the average of all absolute values, and the logarithmic values of the ratios in previous step were used as input for cluster display (Supplementary Table S5 at JXB online). Hierarchical cluster displays were generated from the logarithmic values of all AGP-encoding genes, using Cluster and Treeview (Eisen ). MPSS (massively parallel signature sequencing) tags of rice AGPs were obtained from the MPSS project (http://mpss.udel.edu/rice/) mapped to TIGR gene models (Nobuta ). The signature was considered to be significant if it uniquely identifies an individual gene and shows a perfect match (100% identity over the tag length). The normalized abundance (tpm, tags per million) of these signatures for a given gene in a given library represents a quantitative estimate of expression of that gene. MPSS expression data for 17-base and 20-base signatures representing 12 different organs and tissues or treatments of rice were used for the analysis. The description of these organs and tissues is: NCA, 35 d callus; NGD, 10 d germinating seedlings grown in the dark; NGS, 3 d germinating seed; NIP, 90 d immature panicle; NME, 60 d meristem tissue; NOS, ovary and mature stigma; NPO, mature pollen; NST, 60 d stem; NYL, 14 d young leaves; and NYR, 14 d young roots (Supplementary Table S4 at JXB online).

Real-time PCR analysis

To confirm the expression of representative AGP-encoding genes in rice organs and tissues at different developmental stages, and stress and hormone reactions from microarray data, real-time PCR analysis was performed using gene-specific primers (Supplementary Table S1 at JXB online). At least two independent biological replicates and three technical replicates of each biological replicate were made for real-time PCR analysis. The first-strand cDNA was synthesized from 1 μg of total RNA using reverse transcriptase (ReverTra Ace, TOYOBO). Real-time PCR was carried out by SYBR-green fluorescence using a Rotor-Gene 6000 real-time PCR machine (Corbett Research). The real-time quantitative PCR was performed on equal amounts of cDNA which were prepared from the various materials. The expression of each gene in different RNA samples was normalized to the expression of an internal gene, UBQ5 (Jain ). The relative expression levels were analysed as described previously (Yuan ).

Results

Identification of 69 AGP-encoding genes in the rice genome

The AGP gene family is generally classified into classical AGPs, Lys-rich AGPs, AG peptides, FLAs, non-classical AGPs, and ‘chimeric’ AGPs (Gaspar ; Schultz ). In order to identify classical AGPs, AG peptides, and ‘chimeric’ AGPs, the method derived from Arabidopsis was mainly adopted by calculating the proportion of PAST (Schultz ). First, text files that contained all protein sequences were downloaded from RGAP and RAP-DB, and an amino acid bias Perl script (http://agwine.adelaide.edu.au/plant/cgb/pcf/bio/) was used to calculate the PAST percentage of proteins. All the proteins >55% PAST were selected for further analysis, and 77 and 103 PAST-rich proteins were identified in RGAP and RAP-DB, respectively (Table 1 and Fig. 1). AGP protein backbones are expected to have an N-terminal secretion signal for targeting to the endoplasmic reticulum where glycosylation occurs, and the addition of a GPI anchor may also take place. Therefore, all PAST-rich proteins were scanned for the presence of N-terminal secretion signals using SignalP 3.0. Only 44 and 36 PAST-rich sequences were predicted to be secreted in RGAP and RAP-DB, respectively (Table 1). Not all known AGPs were identified using 55% PAST for Arabidopsis (Schultz ), so, as previously done for Arabidopsis, the threshold was reduced to 50% PAST, and 246 and 271 proteins were identified, of which 128 and 99 were predicted to be secreted (Table 1 and Fig. 1). The procedure used to identify AG peptides was similar to the one used by Schultz ; proteins between 50 and 75 amino acid residues in length were searched with a 35% PAST. A total of 127 and 42 proteins that are predicted to be secreted were found in RGAP and RAP-DB, respectively (Table 1 and Fig. 1).
Table 1.

AGPs predicted from the rice genome based on the biased amino acid composition and length

DatabaseaPASTbTotalSignal peptidecAGPsAG peptidesExtensinsPRPsOthersd
RGAP>55%77441134917
>50%24612819471286
>40% (50–75)16176ND102ND64
>35% (50–75)346127ND112ND114
RAP-DB>55%10336943911
>50%2719917571357
>40% (50–75)8527ND92ND16
>35% (50–75)15342ND102ND30

RGAP, Rice Genome Annotation Project; RAP-DB, Rice Annotation Project Database.

PAST, the proportion of Pro, Ala, Ser, and Thr.

N-terminal signal peptides were determined by SignalP (Nielsen ).

Others, proteins rich in Ala, Ser, and Thr. ND, not determined.

Fig. 1.

The workflow and parameters of AGP identification and data mining. 1, RGAP, Rice Genome Annotation Project; 2, RAP-DB, Rice Annotation Project Database; 3, PF02469 at http://pfam.sanger.ac.uk/; 4, SM00554 at http://smart.embl-heidelberg.de/; 5, IPR000782 at http://www.ebi.ac.uk/interpro/; 6, run a Perl script that divides proteins into several parts according to their amino acid composition; 7, aa, the length of candidate proteins in amino acids; 8, signal peptide prediction, http://www.cbs.dtu.dk/services/SignalP/; 9, the criteria to separate different kinds of HRGPs are detailed in Materials and methods; 10, GPI anchor signal prediction, http://mendel.imp.ac.at/gpi/plant_server.html; 11, EST expression profiles from UniGene at http://www.ncbi.nlm.nih.gov/unigene/; 12, MPSS tags, http://mpss.udel.edu/rice/; 13, the absolute signal values were downloaded at http://signal.salk.edu/cgi-bin/RiceGE; 14, GSE6893, expression at various developmental stages; 15, GSE661, expression under ABA and GA treatments; 16, GSE6901, expression under abiotic stresses treatments.

AGPs predicted from the rice genome based on the biased amino acid composition and length RGAP, Rice Genome Annotation Project; RAP-DB, Rice Annotation Project Database. PAST, the proportion of Pro, Ala, Ser, and Thr. N-terminal signal peptides were determined by SignalP (Nielsen ). Others, proteins rich in Ala, Ser, and Thr. ND, not determined. The workflow and parameters of AGP identification and data mining. 1, RGAP, Rice Genome Annotation Project; 2, RAP-DB, Rice Annotation Project Database; 3, PF02469 at http://pfam.sanger.ac.uk/; 4, SM00554 at http://smart.embl-heidelberg.de/; 5, IPR000782 at http://www.ebi.ac.uk/interpro/; 6, run a Perl script that divides proteins into several parts according to their amino acid composition; 7, aa, the length of candidate proteins in amino acids; 8, signal peptide prediction, http://www.cbs.dtu.dk/services/SignalP/; 9, the criteria to separate different kinds of HRGPs are detailed in Materials and methods; 10, GPI anchor signal prediction, http://mendel.imp.ac.at/gpi/plant_server.html; 11, EST expression profiles from UniGene at http://www.ncbi.nlm.nih.gov/unigene/; 12, MPSS tags, http://mpss.udel.edu/rice/; 13, the absolute signal values were downloaded at http://signal.salk.edu/cgi-bin/RiceGE; 14, GSE6893, expression at various developmental stages; 15, GSE661, expression under ABA and GA treatments; 16, GSE6901, expression under abiotic stresses treatments. All PAST-rich proteins used for final analysis were divided into AGPs, AG peptides, extensins, PRPs, and others according to the features of their backbones. Based on the criteria listed in the Materials and methods, there are 19 AGPs (classical AGPs, Lys-rich AGPs, non-classical AGPs, and ‘chimeric’ AGPs) and 11 AG peptides identified in TIGR, and 17 AGPs and 10 AG peptides in RAP-DB. The nsLTP-like AGPs and eNod-like AGPs were regarded as ‘chimeric’ AGPs before, but organized as two subfamilies of AGPs in the present study. After removing the redundancy, 13 classical AGPs, 15 AG peptides, three non-classical AGPs, three eNod-like AGPs, and eight nsLTP-like AGPs were identified in rice (Table 2). Whereas the length of OsAGP5 is 298 amino acids, much longer than other classical AGPs, OsAGP10 has two Ser-rich domains that block AGP glycomodules. Usually, the length of AG peptides is between 55 and 75 residues. One exception is AGP26, which contains 77 amino acids. OsAGP28 is an AG peptide that is not detected in RGAP and RAP-DB, but was identified by Mashiguchi .
Table 2.

AGP-encoding genes identified in the rice genome

GeneaSubfamilybRGAP locuscRAP-DB locusdChromosome locationeSize (aa)fGPIgEvidenceh
OsAGP1CNAOs08g0482300Chr8:23845140, 23845908146YA B C D E
OsAGP2CLOC_Os01g71170Os01g0937900Chr1:41178390, 41179324158YB D E
OsAGP3CNAOs03g0188500Chr3: 4633965, 4634593124YA B C D E
OsAGP4CLOC_Os04g37770NAChr4:22266907, 22267580131YC D
OsAGP5CNAOs04g0545250Chr4:27141445, 27142670298YA B C
OsAGP6CLOC_Os05g37110Os05g0443200Chr5:21629021, 21629428135YC
OsAGP7CLOC_Os06g17450Os06g0284900Chr6:10125749, 10126413128YB C E
OsAGP8CLOC_Os07g28660Os07g0469800Chr7:16774201, 16774828179YC
OsAGP9CNAOs10g0377150Chr10:12281284, 12281703139YNA
OsAGP10CLOC_Os08g38250Os08g0490400Chr8:24242332, 24242931199YC E
OsAGP11CLOC_Os04g41480Os04g0492200Chr4:24404213, 24404920235YC E
OsAGP12KCLOC_Os01g02010Os01g0110200Chr1:541987, 543510240YA B C E
OsAGP13KCLOC_Os07g40130Os07g0590800Chr7:24077193, 24079058230YA B C D E
OsAGP14PepLOC_Os01g37950Os01g0559900Chr1:21260852, 2126134770NA B C E
OsAGP15PepLOC_Os01g40950Os01g0592500Chr1:23166562, 2316728172NA B C D E
OsAGP16PepOs01g46850Os01g0657000Chr1:26721684, 2672387569YA B C E
OsAGP17PepOs01g55220Os01g0756900Chr1:31765586, 3176579870YA B C E
OsAGP18PepLOC_Os01g57030Os01g0778300Chr1:32943973, 3294450868YB C D
OsAGP19PepLOC_Os01g57040Os01g0778400Chr1:32945415, 3294589668YC
OsAGP20PepLOC_Os02g16500Os02g0264800Chr2:9417103, 941863173NA B C E
OsAGP21PepLOC_Os02g48710Os02g0718600Chr2:29807343, 2980788669YA B C D E
OsAGP22PepNAOs02g0718100Chr2:29799080, 2979967660YA B C D E
OsAGP23PepLOC_Os05g12580Os05g0217000Chr5:7219499, 721983767YA B C E
OsAGP24PepLOC_Os06g21400Os06g0318800Chr6:12358337, 1235902165YA B C D E
OsAGP25PepLOC_Os06g21410Os06g0318900Chr6:12365595, 1236716174NC D E
OsAGP26PepLOC_Os06g30920Os06g0505700Chr6:17983456, 1798384977YA B C E
OsAGP27PepLOC_Os07g38630Os07g0573900Chr7:23201833, 2320231964YB C D E
OsAGP28iPepNANAChr5:26814680, 2681509463YA
OsAGP29NCLOC_Os01g42210Os01g0607100Chr1:23918314, 23920085221YC E
OsAGP30NCLOC_Os02g10240Os02g0196100Chr2:5364379, 5366226229NA B C E
OsAGP31NCLOC_Os03g63540Os03g0852400Chr3:35888283, 35889672219NA B C D E
OsELA1eNodLOC_Os01g54430Os01g0748150Chr1:31316996, 31318114237YA B D E
OsELA2eNodLOC_Os02g06670Os02g0162200Chr2:3353562, 3354820261YC E
OsELA3eNodLOC_Os06g46740Os06g0681200Chr6:28387298, 28388576279YA B C E
OsLLA1nsLTP ILOC_Os03g26820Os03g0385400Chr3:15306262, 15308462178YA B C D E
OsLLA2nsLTP ILOC_Os03g46180Os03g0664600Chr3:26112985, 26114326228YC D E
OsLLA3nsLTP IILOC_Os04g52250Os04g0612300Chr4:30866056, 30866925202NB C
OsLLA4nsLTP IILOC_Os06g46780Os06g0681500Chr6:28405288, 28406220268NC
OsLLA5nsLTP IILOC_Os06g46870Os06g0682750Chr6:28447781, 28448967270NC E
OsLLA6nsLTP ILOC_Os07g07790Os07g0174400Chr7:3909569, 3910639188YC D E
OsLLA7nsLTP ILOC_Os07g43290Os07g0625800Chr7:25919683, 25921877177YA B C D E
OsLLA8nsLTP IILOC_Os10g11370Os10g0191100Chr10:6319745, 6322898357NC E
OsFLA1Fas IIILOC_Os04g48490Os04g0574200Chr4:28725108, 28726790431YA B C E
OsFLA2Fas IIILOC_Os03g03600Os03g0128000Chr3:1578312, 1579906401YA B C D E
OsFLA3Fas IIILOC_Os08g23180Os08g0321000Chr8:13982326, 13985506415YA B C E
OsFLA4Fas ILOC_Os08g38270Os08g0490600Chr8:24249970, 24251221271YA B C E
OsFLA5Fas ILOC_Os08g39270Os08g0502400Chr8:24815594, 24817738274YA B C E
OsFLA6Fas ILOC_Os05g48900Os05g0563600Chr5:27989048, 27990300272YA B C E
OsFLA7Fas ILOC_Os01g47780Os01g0668100Chr1:27339855, 27340906247YA B C D E
OsFLA8Fas ILOC_Os01g06580Os01g0159200Chr1:3096346, 3097527269YA B C D E
OsFLA9Fas ILOC_Os05g07060Os05g0163300Chr5:3715739, 3716868275YA B C D E
OsFLA10Fas ILOC_Os09g30010Os09g0476800Chr9:18256643, 18257801273YA B C E
OsFLA11Fas IIILOC_Os09g07350Os09g0248100Chr9:3653972, 3657804401YA B C E
OsFLA12Fas IIILOC_Os01g62380Os01g0841100Chr1:36098359, 36103250427YA B C D E
OsFLA13Fas ILOC_Os04g39600Os04g0472200Chr4:23414397, 23415562263YA B C E
OsFLA14Fas ILOC_Os04g39590Os04g0472100Chr4:23412950, 23414118277YA B C E
OsFLA15Fas ILOC_Os02g20560Os02g0308800Chr2:12127908, 12129258268YA B C E
OsFLA16Fas IVLOC_Os07g06680Os07g0160600Chr7:3257238, 3261152479NA B C D E
OsFLA17Fas IVLOC_Os03g57490Os03g0788700Chr3:32771257, 32775727479NA B C E
OsFLA18Fas ILOC_Os05g48890Os05g0563550Chr5:27984690, 27985698251YA B C E
OsFLA19Fas ILOC_Os02g20540Os02g0308400Chr2:12113830, 12114971267YA B C E
OsFLA20Fas IILOC_Os02g26320Os02g0461500Chr2:15453153, 15454310304YA B C E
OsFLA21Fas IILOC_Os02g49420Os02g0726000Chr2:30192724, 30193979266YA B C E
OsFLA22Fas IILOC_Os02g26290Os02g0461000Chr2:15430699, 15431764213NA B C E
OsFLA23Fas ILOC_Os06g17460Os06g0285100Chr6:10130034, 10130805211YE
OsFLA24Fas IVLOC_Os03g57460Os03g0788600Chr3:32749877, 32753506474NA B C E
OsFLA25Fas IILOC_Os06g44660Os06g0656800Chr6:26956539, 26957614289YA B C E
OsFLA26Fas IIILOC_Os05g38500Os05g0459700Chr5:22524615, 22526317472NA B C E
OsFLA27Fas ILOC_Os09g30486Os09g0482780Chr9:18610276, 18612149267NA B C E

Systematic designation given to rice AGPs. FLAs that have previously been identified by bioinformatic analysis are indicated in bold (Faik ); β-GlcY-reactive AGPs are indicated in bold and underlined (Mashiguchi ).

C, classical AGP; KC, Lys-rich classical AGP; Pep, AG peptide; NC, non-classical AGP; eNod, early nodulin-like AGP; nsLTP, non-specific lipid transfer protein-like AGP; Fas, fasciclin-like AGP.

Locus identity no. assigned by RGAP and RAP-DB, which could be converted by ID converter (http://rapdb.dna.affrc.go.jp/tools/converter/).

Chromosomal localization of rice AGP-encoding genes.

Length of the open reading frame in amino acids.

GPI anchor signal predicted by big-PI (http://mendel.imp.ac.at/gpi/plant_server.html); Y, yes, N, no.

Evidence for gene expression from (A) full-length cDNA, (B) ESTs, (C) MPSS tags, (D) serial analysis of gene expression (SAGE), and (E) microarray data.

OsAGP28 does not exist in RGAP and RAP-DB.

NA, not available.

AGP-encoding genes identified in the rice genome Systematic designation given to rice AGPs. FLAs that have previously been identified by bioinformatic analysis are indicated in bold (Faik ); β-GlcY-reactive AGPs are indicated in bold and underlined (Mashiguchi ). C, classical AGP; KC, Lys-rich classical AGP; Pep, AG peptide; NC, non-classical AGP; eNod, early nodulin-like AGP; nsLTP, non-specific lipid transfer protein-like AGP; Fas, fasciclin-like AGP. Locus identity no. assigned by RGAP and RAP-DB, which could be converted by ID converter (http://rapdb.dna.affrc.go.jp/tools/converter/). Chromosomal localization of rice AGP-encoding genes. Length of the open reading frame in amino acids. GPI anchor signal predicted by big-PI (http://mendel.imp.ac.at/gpi/plant_server.html); Y, yes, N, no. Evidence for gene expression from (A) full-length cDNA, (B) ESTs, (C) MPSS tags, (D) serial analysis of gene expression (SAGE), and (E) microarray data. OsAGP28 does not exist in RGAP and RAP-DB. NA, not available. It is difficult to find FLAs using the amino acid bias (Perl script) because of their low proportion of PAST (from 31.74 to 45.35) (Supplementary Table S2 at JXB online). In previous research, 21 rice FLA genes were identified together with 33 wheat FLA genes (Faik ). All rice fasciclin-like proteins from Pfam (PF02469, 34 proteins from japonica), Smart (SM00554, 58 proteins from indica and japonica), and Interpro (IPR000782, 89 proteins from indica and japonica) were downloaded as a source, the proteins from indica and the redundant proteins were deleted, then they were scanned for the presence of N-terminal secretion signals using SignalP 3.0. After excluding proteins that have no signal peptides, the remaining fasciclin-like proteins were scanned for the presence of the putative AG polysaccharide additional domain, where Pro/Hyp residues are organized in non-contiguous clusters. To predict the occurrence of such glycomodules in these putative rice FLAs, Pro/Hyp-containing sequence motifs in each of them were manually counted. Finally, 27 OsFLAs genes were identified by integrating research results and annotation advances. At least 77.78% of rice FLAs are predicted to be GPI anchored to the plasma membrane (Table 2).

Protein structure and phylogenetic analysis

According to the structural characteristics of rice AGP backbones, they can be classified into seven subfamilies, including classical AGPs, Lys-rich AGPs, AG peptides, non-classical AGPs, eNod-like AGPs, nsLTP-like AGPs, and FLAs. The protein backbones of OsAGPs have been organized into domains leading to their classification as either ‘classical’ or ‘non-classical’ (Showalter, 1993, 2001; Gaspar ). Classical AGPs consist of a core protein of highly varying length and domain complexity, and often contain a GPI lipid anchor (Fig. 2A). Non-classical AGPs are defined as having regions that are atypical of an AGP-like region, for example regions rich in Asn or Cys residues in addition to regions containing Pro/Hyp (Du ). Three non-classical AGPs were found in rice; they have an atypical region in addition to the AGP-like region. OsAGP29 contains a GPI lipid anchor signal, but OsAGP30 and OsAGP31 do not. Some OsAGPs, designated as lysine-rich AGPs, contain a short lysine-rich domain between the Pro-rich domain and the hydrophobic C-terminus (Figs 2B, 3B). Several rice AGP backbones are only 50–75 residues long and are termed AG peptides (Fig. 2C). There are many kinds of ‘chimeric’ AGPs, such as OsELAs, OsLLAs, and OsFLAs. An early nodulin (eNod)-like domain related to copper binding is present between the signal peptide and the AGP-like region in ELAs (Figs 2D, 3C). Similarly, an nsLTP-like domain exists between the signal peptide and the AGP-like region in LLAs (or xylogen) (Figs 2E, 3D). The eight OsLLAs were divided into two groups, the group I OsLLAs have an nsLTP-like domain between the signal peptide and the AGP-like region; and the AGP-like regions of group II OsLLAs are separated into two parts by nsLTP-like domains, and no GPI anchor signals are present (Fig. 2E). Moreover, 27 rice FLAs were divided into four groups due to the numbers of AGP-like regions and fasciclin domains (Fig. 2F). There are two AGP-like regions and one fasciclin domain in group I OsFLAs, one AGP-like region and one fasciclin domain in group II OsFLAs, two AGP-like regions and two fasciclin domains in group III OsFLAs, and one AGP-like region and two fasciclin domains in group IV OsFLAs.
Fig. 2.

Schematic representations of the different subclasses of rice AGP protein backbones. Not drawn to scale

Fig. 3.

Multiple sequence alignments of the SDGT region (A), Lys-rich region (B), eNod-like domain (C), and nsLTP-like domain (D).

Schematic representations of the different subclasses of rice AGP protein backbones. Not drawn to scale Multiple sequence alignments of the SDGT region (A), Lys-rich region (B), eNod-like domain (C), and nsLTP-like domain (D). To investigate the phylogenetic relationship of AGPs, six unrooted trees of classical AGPs, Lys-rich AGPs, AG peptides, ELAs, LLAs, and FLAs were constructed from alignments of their full-length amino acid sequences from rice, Arabidopsis, and other plants (Fig. 4). Classical AGPs expand in an overall species-specific manner; 11 proteins in the first clade are from Arabidopsis, 10 proteins in the second clade are from rice, and the proteins in the third clade are three from Arabidopsis and one from rice (Fig. 4A). The phylogenetic relationship of two family members of Lys-rich AGPs (OsAGP12 and OsAGP13) is close to AtAGP19 (Fig. 4B). There is one clade of AG peptides termed SDGT region-containing AG peptides because they all have four conversed residues (Ser, Asp, Gly, and Thr) in their protein backbones (Figs 3A, 4C). Although OsFLA21 was phylogeneticly close to the members of group III FLAs, it was recognized as a member of group II FLAs, because there is one fasciclin-like domain and two AGP-like regions in its protein backbone. Group II of LLAs is absent in Arabidopsis, so they may specifically exist in monocots (Fig. 4E). However, the family members in clades of AG peptides, group I of OsLLAs, OsELAs, and OsFLAs are present in both Arabidopsis and rice, indicating that they may play similar roles in the two plants (Fig. 4C–F).
Fig. 4.

Phylogenetic relationship of AGPs between rice and other species. (A) Classical AGPs, (B) Lys-rich AGPs, (C) AG peptides, (D) eNod-like AGPs, (E) nsLTP-like AGPs, and (F) fasciclin-like AGPs. The phylogenetic tree is based on the multiple sequence alignments of AGPs and was generated using MEGA 4.0 by the Neighbor–Joining method. Bootstrap support from 10 000 reiterations is indicated above the branches.

Phylogenetic relationship of AGPs between rice and other species. (A) Classical AGPs, (B) Lys-rich AGPs, (C) AG peptides, (D) eNod-like AGPs, (E) nsLTP-like AGPs, and (F) fasciclin-like AGPs. The phylogenetic tree is based on the multiple sequence alignments of AGPs and was generated using MEGA 4.0 by the Neighbor–Joining method. Bootstrap support from 10 000 reiterations is indicated above the branches.

Chromosomal localization and gene duplication

The approximate positions of AGP-encoding genes were marked on the physical map of the 12 rice chromosomes. According to the genomic localization of AGP-encoding genes, the 69 AGP-encoding genes of rice distribute in 10 of the 12 chromosomes: 13 genes on chromosome 1, 10 genes on chromosome 2, nine genes on chromosome 6, seven genes each on chromosomes 3 and 4, six genes each on chromosomes 5 and 8, five genes on chromosome 7, three genes on chromosome 9, and two genes on chromosome 10 (Fig. 5).
Fig. 5.

Genomic localization of AGP-encoding genes on rice chromosomes. AGP-encoding genes classified into different subfamilies are shown in different colours. White ovals on the chromosomes indicate the position of centromeres. Chromosome numbers are indicated at the top of each chromosome. The AGP-encoding genes presented on duplicated chromosomal segments between two chromosomes are connected by black lines, and tandem duplicated genes on the same chromosome are connected by a vertical bar.

Genomic localization of AGP-encoding genes on rice chromosomes. AGP-encoding genes classified into different subfamilies are shown in different colours. White ovals on the chromosomes indicate the position of centromeres. Chromosome numbers are indicated at the top of each chromosome. The AGP-encoding genes presented on duplicated chromosomal segments between two chromosomes are connected by black lines, and tandem duplicated genes on the same chromosome are connected by a vertical bar. Nine pairs of rice AGP-encoding genes are arranged in tandem repeats (Fig. 5), representing localized gene duplications, which are separated by a maximum of five intervening genes. The tandem duplicated genes belong to three subfamilies, including five pairs of OsFLAs, three pairs of AG peptides, and one pair of OsLLAs. The duplication of the AGP-encoding genes is also associated with chromosomal segment duplications. Six pairs of OsFLAs and one pair of OsLLAs are located on the duplicated segmental regions of rice chromosomes mapped by TIGR when the maximal length between collinear gene pairs is 500 kb. All of these duplicated genes exhibit high sequence similarity in both the AGP-like regions and the additional domains. The gene duplication analysis shows that 22 of 27 OsFLA genes expand through gene duplication (Fig. 5). Interestingly, OsFLA6 and OsFLA18, and OsFLA17 and OsFLA24 are gene pairs of tandem duplication, and OsFLA7 and OsFLA18, and OsFLA16 and OsFLA24 are gene pairs of chromosomal segmental duplication, revealing that these gene clusters expand through both tandem and segmental duplications. In addition, one pair of tandem duplicated genes (OsLLA4 and OsLLA5) and one pair of chromosomal duplicated genes (OsLLA1 and OsLLA7) exist in the nsLTP-like AGPs. It is noteworthy that tandem duplication plays important roles in the expansion of the AG peptide subfamily; three pairs of tandem duplicated genes are found in AG peptides (OsAGP18 and OsAGP19, OsAGP21 and OsAGP22, and OsAGP25 and OsAGP26) (Fig. 6).
Fig. 6.

Expression profiles of AGP-encoding genes in various rice organs and tissues at different development stages. The microarray data sets (GSE6893) of gene expression at various developmental stages were used for cluster display. A heat map representing hierarchical clustering of average log signal values of all the AGP-encoding genes in various organs and tissues at different developmental stages (indicated at the top of each lane). The genes which share similar expression patterns are divided into 12 clusters: (A) all examined tissues; (B) all examined tissues except ML and YL; (C) S1 and S2; (D) YR and P5; (E) P3; (F) low expression in all examined tissues; (G) P6; (H) P5; (I) YR, P5, and P6; (J) P6, S4, and S5; (K) YR, P6, and S1; (L) YL, ML, P6, and S1–S6. The representative AGP-encoding genes differentially expressed during various stages of development for which real-time PCR analysis was performed are indicated by a black triangle on the left. The colour scale (representing average log signal values) is shown at the bottom.

Expression profiles of AGP-encoding genes in various rice organs and tissues at different development stages. The microarray data sets (GSE6893) of gene expression at various developmental stages were used for cluster display. A heat map representing hierarchical clustering of average log signal values of all the AGP-encoding genes in various organs and tissues at different developmental stages (indicated at the top of each lane). The genes which share similar expression patterns are divided into 12 clusters: (A) all examined tissues; (B) all examined tissues except ML and YL; (C) S1 and S2; (D) YR and P5; (E) P3; (F) low expression in all examined tissues; (G) P6; (H) P5; (I) YR, P5, and P6; (J) P6, S4, and S5; (K) YR, P6, and S1; (L) YL, ML, P6, and S1–S6. The representative AGP-encoding genes differentially expressed during various stages of development for which real-time PCR analysis was performed are indicated by a black triangle on the left. The colour scale (representing average log signal values) is shown at the bottom.

Expression analysis of rice AGP-encoding genes in various organs and tissues at different developmental stages

The gene expression pattern provides important clues for investigating gene function. The expression of rice AGP-encoding genes was analysed using three publicly available resources: EST expression profiles, microarrays, and MPSS tags. EST expression profiles of AGP-encoding genes were obtained by searching the RAP-DB locus across the UniGene database at NCBI. It was found that 52 of the 69 AGP-encoding genes have at least one corresponding full-length cDNA and/or EST, 47 AGP-encoding genes have both full-length cDNA and EST evidence, whereas five genes have only EST evidence (Table 2). According to EST abundance data of rice organs and tissues, 48 of 52 AGP-encoding genes can be transcriptionally analysed using EST expression profiles. A large number of AGP-encoding genes show high expression abundance in stem, vegetative meristem, panicle, and seed. Some of them show tissue-specific expression patterns: OsFLA15 and OsFLA19 in stem; OsAGP13, OsAGP17, OsAGP20, and OsFLA2 in vegetative meristem; OsAGP7 and OsFLA22 in panicle; and OsAGP1 in seed (Supplementary Table S3 at JXB online). Microarrays provide a high-throughput means to analyse the expression of genes of interest at the transcriptional level. Sixty of 69 AGP-encoding genes have at least one probe on Affymetrix rice whole-genome arrays (GPL2025). The results of microarray analysis revealed that most of the AGP-encoding genes were expressed in at least one of the reproductive and vegetative developmental stages (SAM, P1–P6, S1–S5, young root and leaf, and mature leaf) (Fig. 6). Eleven genes are abundantly expressed in all examined tissues (Fig. 6A). Ten genes are abundantly expressed in root, SAM, panicles, and seed (Fig. 6B). OsAGP30 is abundantly expressed in early stages of seed development (Fig. 6C). Five genes are abundantly expressed in roots and panicles at the stage of vacuolated pollen (P5) (Fig. 6D); OsFLA17 was specifically expressed in early stages of panicle development (Fig. 6E). The expression levels of 10 genes were relatively low in all examined tissues (Fig. 6F). Interestingly, five genes (OsAGP16, OsAGP27, OsFLA14, OsFLA22, and OsFLA25) and one gene (OsFLA15) were specifically expressed in panicles at the stage of vacuolated pollen (P5) and at the stage of mature pollen (P6), respectively, indicating that these genes may be correlated to pollen development (Fig. 6G, H). Ten genes were abundantly expressed in roots, late stages of panicle development, and early stages of seed development (Fig. 6I). OsAGP7 was dominantly expressed in panicles at the stage of mature pollen (P6) and seeds at stages of embryo maturation (S4) and desiccation (S5) (Fig. 6J), OsAGP25 in roots, panicles at the stage of mature pollen (P6), and seeds at the stage of early globular embryo (S1) (Fig. 6K), and OsELA3 and OsAGP15 in leaves, panicles at the stage of mature pollen (P6), and seeds (Fig. 6L). Moreover, differential expression analysis was performed to identify AGP-encoding genes which have more abundant expression in the processes of seedling, panicle, and seed development by comparing the expression between vegetative and reproductive tissues (Table 3). A gene is considered as differentially expressed if its expression level at a certain stage is significantly lower or higher (<0.5 or >2) than in the compared tissues (Table 3). A large number (30) of rice AGP-encoding genes are differentially expressed in reproductive tissues compared with vegetative tissues, indicating that these genes may play important roles in panicle and seed development (Table 3). Six rice AGP-encoding genes are differentially expressed in vegetative tissues, and OsLLA2 and OsLLA6 were dominantly expressed in root (Fig. 6D and Table 3).
Table 3.

Differential expression analysis of rice AGP-encoding genes in vegetative and reproductive tissues

GeneProbeaMOVbMORcMOR/MOVdDEVeDERf
OsAGP1Os-21950-1-S1_at1264.821 933.717.34Y
OsAGP10OsAffx-6065-1-S1_x_at252.9311.51.23
OsAGP11OsAffx-26415-1-S1_at58.3165.42.84Y
OsAGP12Os-56224-1-S1-at3766.06611.01.76
OsAGP13Os-5095-1-S1-at14 091.47888.90.56
OsAGP14Os-6768-1-S1_at16 850.321 308.11.26
OsAGP15Os-57295-1-S1_at1961.82595.21.32
OsAGP16Os-30521-1-S1_at53.9763.914.17Y
OsAGP17Os-5831-1-S1_at8865.819 596.62.21Y
OsAGP2OsAffx-11813-1-S1_at1180.72311.91.96
OsAGP20Os-12412-1-S1_at4099.25674.41.38
OsAGP21Os-5592-1-S1_at10 774.216 628.31.54
OsAGP22Os-5823-1-S1_at7667.52714.30.35Y
OsAGP23Os-11520-1-S1_at6995.23350.90.48Y
OsAGP24Os-5384-1-S1_at26 966.348 847.01.81
OsAGP25OsAffx-27752-1-S1_s_at4398.87348.51.67
OsAGP26Os-50508-1-S1_at63.294.21.49
OsAGP27OsAffx-5568-1-S1_at52.11817.634.89Y
OsAGP29OsAffx-23631-1-S1_at86.2219.32.54Y
OsAGP30Os-55753-1-S1_at44.0876.719.93Y
OsAGP3Os-55382-1-S1_at5047.514 647.82.90Y
OsAGP31Os-5927-1-S1_at2157.14118.31.91
OsAGP7OsAffx-15474-1-S1_s_at96.64359.545.13Y
OsELA1Os-34474-2-S1_at44.5140.43.16Y
OsELA2OsAffx-24136-1-S1_at60.598.21.62
OsELA3Os-55380-1-S1_at1639.55631.93.44Y
OsFLA1OsAffx-14326-1-S1_s_at878.512 342.814.05Y
OsFLA10Os-56233-1-S1_s_at477.31726.23.62Y
OsFLA11Os-49544-1-S1_at4495.44285.70.95
OsFLA12Os-42475-1-S1_at597.82387.93.99Y
OsFLA13Os-5963-1-S1_at353.681.40.23Y
OsFLA14Os-53497-1-S1_at62.3731.311.74Y
OsFLA15Os-17961-1-S1_a_at66.310 701.1161.40Y
OsFLA16Os-17949-1-S1_a_at2138.44617.12.16Y
OsFLA17Os-23180-1-S1_at36.2850.923.51Y
OsFLA18OsAffx-15160–1-S1_at4545.8582.50.13Y
OsFLA19Os-14430-1-S1_at77.2728.69.44Y
OsFLA2Os-5826-2-S1_a_at3439.25314.11.55
OsFLA20Os-55769-1-S1_at47.8100.42.10Y
OsFLA21Os-53492-1-S1_at243.21278.55.26Y
OsFLA22Os-6347-1-S1_at33.91932.257.00Y
OsFLA23OsAffx-15475–1-S1_at50.576.01.50
OsFLA24Os-22627-1-S1_at3010.45033.81.67
OsFLA25Os-20322-1-S1_at53.84595.485.42Y
OsFLA26OsAffx-27225-1-S1_at224.5474.72.11Y
OsFLA27Os-52641-1-S1_at6064.74457.60.74
OsFLA3Os-5500-1-S1_s_at13 961.017 519.01.25
OsFLA4Os-55554-1-S1_at1780.76680.53.75Y
OsFLA5Os-54871-1-S1_at1362.33671.72.70Y
OsFLA6Os-53704-1-S1_x_at19 535.720 590.01.05
OsFLA7Os-5908-1-S1_at11 698.08593.80.73
OsFLA8Os-1437-1-S1_at6607.417 431.62.64Y
OsFLA9Os-54555-1-S1_at897.715 115.816.84Y
OsLLA1Os-9710-1-S1_at4940.34113.90.83
OsLLA2OsAffx-13377-1-S1_at1117.9539.30.48Y
OsLLA5OsAffx-15958-1-S1_at95.7300.03.13Y
OsLLA6OsAffx-28335-1-S1_at5696.2233.30.04Y
OsLLA7Os-9342-1-S1_at3120.11746.40.56
OsLLA8OsAffx-18220-1-S1_x_at87.0212.52.44Y

The probe ID is used on microarray plate GPL2025. The experiment GSE6893 was used for differential expression analysis.

MOV, the maximum absolute values of vegetative tissues.

MOR, the maximum absolute values of reproductive tissues.

The values of MOR divided by MOV. Values considered significant (i.e. <0.5 or >2) are in bold.

DEV, differential expression in vegetative tissues.

DER, differential expression in reproductive tissues.

Differential expression analysis of rice AGP-encoding genes in vegetative and reproductive tissues The probe ID is used on microarray plate GPL2025. The experiment GSE6893 was used for differential expression analysis. MOV, the maximum absolute values of vegetative tissues. MOR, the maximum absolute values of reproductive tissues. The values of MOR divided by MOV. Values considered significant (i.e. <0.5 or >2) are in bold. DEV, differential expression in vegetative tissues. DER, differential expression in reproductive tissues. MPSS generates thousands of molecules per reaction and provides a sensitive quantitative measure of gene expression for nearly all genes in the genome (Brenner ). It was found that MPSS signatures are available for 58 rice AGP-encoding genes in at least one of the libraries (Supplementary Table S4 at JXB online). This further strengthens the idea that most rice AGP-encoding genes are expressed. Differences are displayed by number of tags (tpm, transcripts per million), being low in <50, moderate in between 50 tpm and 500 tpm, and strong in >500 tpm. A large percentage of AGP-encoding genes (37) are expressed at a high level, while 14 and seven AGP-encoding genes exhibit moderate and low expression, respectively (Supplementary Table S4). Further, both 17-base and 20-base signature data sets are used to compare the differential expression of AGP-encoding genes in different MPSS libraries. Thirteen AGP-encoding genes were identified with tissue-specific expression patterns, suggesting that these genes may play important roles in pollen, stigmas and ovaries, immature panicles, and germinating seeds. Moreover, the highest number of genes (eight) show preponderant expression in mature pollen (Supplementary Table S4). In order to validate the results of digital expression analysis, real-time PCR analysis was performed for some representative genes. The results of real-time PCR showed that the expression patterns of these genes were in general agreement with the data of microarrays and MPSS tags. For example, OsAGP1, OsFLA7, OsAGP30, and OsAGP27 are expressed during the processes of panicle and ovary development, and also predominantly in stigmas and ovaries as compared with anther (Fig. 7A, C, F, L). OsAGP13 is mainly expressed in panicles at the stage of floral transition (P1) (Fig. 7B), but was not detected in either anther or stigmas and ovaries. OsFLA24, OsFLA4, OsFLA18, OsFLA17, OsFLA11, OsAGP31, and OsFLA2 were mainly expressed in panicles at the stages of megaspore and microspore development (P2–P5), and OsFLA24, OsFLA4, OsFLA18, and OsFLA11 were also predominantly expressed in stigmas and ovaries (Fig. 7D, E, G, H, O, P, R). OsFLA14, OsAGP16, OsFLA22, and OsFLA25 were specifically expressed in panicles at the stages of flower maturation (P5 and P6), and predominantly in anther (Fig. 7I, J, K, M). The expression levels of OsFLA8 and OsFLA27 were highest in panicles at the stage of the young microspore (P4) and were mainly expressed in anther (Fig. 7N, Q), while OsAGP7 was specifically expressed in P5 and anther, and OsELA3 in 30 DAP embryos (Fig. 7T).
Fig. 7.

Real-time PCR analysis for confirmation of the differential expression of representative AGP-encoding genes in various organs and tissues at different development stages. The small pictures inserted in the figures represent their relative expression levels in anther (An) and stigma and ovary (SO) from 28 cm panicles.

Real-time PCR analysis for confirmation of the differential expression of representative AGP-encoding genes in various organs and tissues at different development stages. The small pictures inserted in the figures represent their relative expression levels in anther (An) and stigma and ovary (SO) from 28 cm panicles.

Expression analysis of rice AGP-encoding genes under abiotic stress, ABA, and GA treatments

To investigate the abiotic stress response of rice AGP-encoding genes, the results of microarray (GSE6901) from 7-day-old seedlings subjected to drought, salt, and cold stresses were analysed. The data revealed that a total of 15 genes were significantly down- or up-regulated (<0.5 or >2) as compared with the control in at least one of the stress conditions examined (Fig. 8). The transcript levels of two genes (OsAGP3 and OsAGP24) were up-regulated by all three stresses (Fig. 8A), three genes (OsELA3, OsAGP1, and OsAGP25) were up-regulated significantly by drought and salt stresses (Fig. 8B), one gene (OsAGP15) was up-regulated by salt stress (Fig. 8C), two genes (OsFLA1 and OsFLA4) were down-regulated by cold stress (Fig. 8D), and seven genes (OsAGP23, OsAGP2, OsFLA27, OsAGP20, OsFLA5, OsFLA19, and OsFLA18) were down-regulated by drought and salt stresses (Fig. 8E). By using the real-time PCR technique, the expression levels of four representative AGP-encoding genes were investigated under various stress conditions. The expression of OsAGP15, OsELA3, and OsAGP1 was induced by drought and salt stresses, and showed a gradual increased expression pattern after the stresses were applied (Fig. 8). On the other hand, the expression level of OsAGP20 was up-regulated by cold at 6, 12, and 24 h, and down-regulated by drought and salt stresses (Fig. 8).
Fig. 8.

Expression profiles of rice AGP-encoding genes differentially expressed under abiotic stresses. The microarray data sets (GSE6901) of gene expression under various abiotic stresses were used for cluster display. The average log signal values of AGP-encoding genes under control and various stress conditions (indicated at the top of each lane) are presented by a heat map. Only those genes that exhibited ≥2-fold differential expression, under any of the given abiotic stress conditions, are shown. (A) Up-regulated by drought, salt, and cold stresses; (B) up-regulated by drought and salt stresses; (C) up-regulated by salt stress; (D) down-regulated by cold stress; (E) down-regulated by drought and salt stresses. The representative AGP-encoding genes differentially expressed under different abiotic stress conditions for which real-time PCR analysis was performed are indicated as a black triangle to the left. The results of real-time PCR analysis for confirmation of the differential expression of rice AGP-encoding genes under abiotic stresses are shown on the right. Two asterisks (**, P <0.01) and one asterisk (*, 0.01

Expression profiles of rice AGP-encoding genes differentially expressed under abiotic stresses. The microarray data sets (GSE6901) of gene expression under various abiotic stresses were used for cluster display. The average log signal values of AGP-encoding genes under control and various stress conditions (indicated at the top of each lane) are presented by a heat map. Only those genes that exhibited ≥2-fold differential expression, under any of the given abiotic stress conditions, are shown. (A) Up-regulated by drought, salt, and cold stresses; (B) up-regulated by drought and salt stresses; (C) up-regulated by salt stress; (D) down-regulated by cold stress; (E) down-regulated by drought and salt stresses. The representative AGP-encoding genes differentially expressed under different abiotic stress conditions for which real-time PCR analysis was performed are indicated as a black triangle to the left. The results of real-time PCR analysis for confirmation of the differential expression of rice AGP-encoding genes under abiotic stresses are shown on the right. Two asterisks (**, P <0.01) and one asterisk (*, 0.01

salt stress; CS, cold stress. The results of ABA- and GA-treated callus were used to analyse the regulation of AGP-encoding genes (GSE661). It was found that many AGP-encoding genes were regulated by ABA and GA (Supplementary Table S5 at JXB online). To verify the results of microarray under ABA and GA treatments, the transcriptional level of six representative AGP-encoding genes in young seedlings treated with ABA and GA were investigated by using real-time PCR (Fig. 9). The expression levels of OsAGP1, OsAGP15, and OsELA3 were markedly increased in seedlings under ABA treatment (Fig. 9A, B, D). The expression level of OsAGP20 was down-regulated by ABA, but up-regulated by GA (Fig. 9C). The expression of OsFLA24 was almost totally inhibited by ABA, but induced by GA (Fig. 9F). However, the expression of OsFLA11 was more complex, being up-regulated by GA at 3–12 h and transiently up-regulated by ABA at 12 h (Fig. 9E).

Fig. 9.

Real-time PCR analysis for confirmation of the differential expression of rice AGP-encoding genes under ABA and GA treatments. Two asterisks (**, P <0.01) and one asterisk (*, 0.01

Real-time PCR analysis for confirmation of the differential expression of rice AGP-encoding genes under ABA and GA treatments. Two asterisks (**, P <0.01) and one asterisk (*, 0.01

Expression analysis of AGP-encoding genes in rice and Arabidopsis

Gene expression patterns can provide important clues for the study of gene function. Therefore, a comparative analysis of the expression patterns of the rice and Arabidopsis AGP gene family has been performed. Using microarray and MPSS data, the expression of rice and Arabidopsis AGP-encoding genes was examined in inflorescence, leaf, root, and seed/silique, and under ABA, GA, and abiotic stress treatments (Fig. 10).
Fig. 10.

The expression patterns in different tissues for Arabidopsis and rice AGP-encoding genes. AGP-encoding genes are presented in the same order as in the corresponding phylogenetic trees. The expression data of AGP-encoding genes in different tissues were combined from microarrays and MPSS tags. The expression data in pollen of rice AGP-encoding genes were extracted from MPSS tags. Letters on the top indicate different tissues and databases. M1 and M2 represent microarrays and MPSS tags, respectively. The ratios of the absolute values divided by the average of all microarray values were used for analysis (Supplementary Table S5 at JXB online). Light grey, dark grey, and black boxes indicate low (between 0.5 and 1 or the signature numbers between 0 tpm and 50 tpm), moderate (between 1 and 2 or between 50 tpm and 500 tpm), and high (>2 or >500 tpm) expression levels, respectively. The white boxes indicate that no expression could be detected (<0.5). ‘×’ represents no probe or signature on microarray and MPSS. SF, subfamily; R, root; L, leaf; I, inflorescence; Po, pollen; S, silique or seed.

The expression patterns in different tissues for Arabidopsis and rice AGP-encoding genes. AGP-encoding genes are presented in the same order as in the corresponding phylogenetic trees. The expression data of AGP-encoding genes in different tissues were combined from microarrays and MPSS tags. The expression data in pollen of rice AGP-encoding genes were extracted from MPSS tags. Letters on the top indicate different tissues and databases. M1 and M2 represent microarrays and MPSS tags, respectively. The ratios of the absolute values divided by the average of all microarray values were used for analysis (Supplementary Table S5 at JXB online). Light grey, dark grey, and black boxes indicate low (between 0.5 and 1 or the signature numbers between 0 tpm and 50 tpm), moderate (between 1 and 2 or between 50 tpm and 500 tpm), and high (>2 or >500 tpm) expression levels, respectively. The white boxes indicate that no expression could be detected (<0.5). ‘×’ represents no probe or signature on microarray and MPSS. SF, subfamily; R, root; L, leaf; I, inflorescence; Po, pollen; S, silique or seed. OsAGP7 and OsAGP10 have similar expression patterns in flower, seed, and pollen to AtAGP6 and AtAGP11 which are essential for pollen function (Levitin ) (Fig. 10A). The two rice Lys-rich AGPs (OsAGP12 and OsAGP13) have similar expression patterns in root, flower, and seed and a close phylogenetic relationship with AtAGP19 which functions in various aspects of plant growth, including cell division and expansion (Yang ) (Fig. 10B). There are 15 and 13 AG peptide-encoding genes in rice and Arabidopsis, respectively. The expression results of OsAGP28 and AtAGP41 are not available in both the microarray and MPSS, AtAGP20 and OsAGP26 are expressed at an extremely low level in all examined tissues, and most of the remaining AG peptide-encoding genes (24) have high expression levels in root, flower, pollen, and seed (Fig. 10C). It is interesting that 15 of 24 AG peptide-encoding genes are expressed in pollen, OsAGP16 and OsAGP18 are specifically expressed in pollen, and AtAGP23 and AtAGP40 are highly expressed in flower, pollen, and seed (Fig. 10C). Although the amino acid sequence similarity of non-classical AGPs is relatively low, OsAGP31, AtAGP30, and AtAGP31 are highly expressed in roots (Fig. 10D), and AtAGP30 plays a role in root regeneration (Hengel and Roberts, 2003). There are two xylogen genes in Arabidopsis, AtXYP1 and AtXYP2 (Motose ), and four xylogen-like genes (nsLTP-AGPs) in rice, OsLLA1, OsLLA2, OsLLA6, and OsLLA7. OsLLA1 and OsLLA7 have similar expression patterns in root, flower, and seed to AtXYP1, and OsLLA2 and OsLLA6 have similar expression patterns in root to AtXYP2 (Fig. 10E). OsFLA12 and OsFLA26 are the rice homologues of AtFLA4 which is also characterized as SOS5 (Shi ), OsFLA12 is similarly expressed to AtFLA4, and OsFLA26 is specifically expressed in flower (Fig. 10F). Interestingly, all of the group II FLAs, except AtFLA5, were expressed in the reproductive processes; OsFLA20, OsFLA22, OsFLA25, AtFLA3, and AtFLA14 were predominantly expressed in flower and pollen (Fig. 10G). Four Arabidopsis classical AGPs (AtAGP1, AtAGP2, AtAGP5, and AtAGP10) were up-regulated by ABA and various abiotic stresses, and two rice classical AGPs (OsAGP1 and OsAGP3) were up-regulated by drought stress (Fig. 11A). AtELA3 and AtELA6 were up-regulated by ABA and salt stress, and OsELA3 was up-regulated by drought and salt stresses (Fig. 11B). These results indicated that classical AGPs and eNod-like AGPs may play important roles in ABA signalling and abiotic stresses response.
Fig. 11.

The expression patterns under ABA, GA, and abiotic stress treatments for Arabidopsis and rice AGP-encoding genes. AGP-encoding genes are presented in the same order as in the corresponding phylogenetic trees. The expression data of AGP-encoding genes under ABA, GA, drought stress, salt stress, and cold stress treatments were from microarrays. Upright triangles and inverted triangle represent values that are significantly lower (<0.5) and higher (>2) than the control, and ‘×’ represents no probe on microarray. SF, subfamily; DS, drought stress; SS, salt stress; CS, cold stress.

The expression patterns under ABA, GA, and abiotic stress treatments for Arabidopsis and rice AGP-encoding genes. AGP-encoding genes are presented in the same order as in the corresponding phylogenetic trees. The expression data of AGP-encoding genes under ABA, GA, drought stress, salt stress, and cold stress treatments were from microarrays. Upright triangles and inverted triangle represent values that are significantly lower (<0.5) and higher (>2) than the control, and ‘×’ represents no probe on microarray. SF, subfamily; DS, drought stress; SS, salt stress; CS, cold stress. To sum up, the gene expression patterns provide a solid foundation for future functional studies of AGPs in both rice and Arabidopsis.

Discussion

AGP gene family in rice

It is widely accepted that AGPs are a type of P/HRGP with an N-terminal signal sequence and a C-terminal GPI anchor additional signal, can react with β-GlcY, and are recognized by a series of monoclonal antibodies (Yariv ; Eisenhaber ; Schultz , 2004). A similar number of AGP-encoding genes exist in rice (69 genes) and Arabidopsis (>61 genes, including six putative Arabidopsis eNod-like AGPs), although the rice genome size is ∼3.7 times, and the gene number is ∼1.5 times, more than those of Arabidopsis. Most AGPs identified in the rice genome are classical AGPs >50% PAST, AG peptides >35% PAST, and FLAs that contain one or two fasciclin-like domains. However, only three non-classical AGPs, three ELAs, and eight LLAs are identified together with classical AGPs; other proteins with a lower PAST proportion cannot be identified using the amino acid biased method. Therefore, efforts to identify AGPs with a <50% PAST proportion will be the focus of AGP identification. Moreover, OsELA1 has two putative AG glycomodules (TPPP and APPP) and one probable AG polysaccharide attachment site (APEPA), but not typical AG glycomodules such as XP (X=A, S, T) repeats. The idea that (A/T) PPP could work as AG glycomodules was also validated in other AGPs of Arabidopsis (Schultz ). Proteins with putative AG glycomodules could be recognized as AGPs, which broadens the definition of AGPs. The eNod-like and nsLTP-like AGPs were first grouped as subfamilies of the AGP gene family. It had also been reported that there are at least 248 GPI-anchored proteins in Arabidopsis, 40% of them having putative AG glycomodules, including 10 eNod-like proteins and 11 nsLTP-like proteins (Borner ). There is sufficient evidence to show that the eNod-like and nsLTP-like AGPs really existed in both Arabidopsis and rice. In Arabidopsis, a hybrid protein termed a xylogen is homologous to nsLTPs and contains AG glycomodules and a GPI anchor additional signal. Xylogen is considered as an AGP because its molecular weight ranges from 50 kDa to 100 kDa which exceeds its predicted molecular mass of 16 kDa, and it can react with β-Yariv reagent and JIM13 antibody (Motose ). Six AGPs, namely a classical AGP (OsAGP1), an nsLTP-like AGP (OsLTPL1, designated as OsLLA1), an eNod-like AGP (OsENODL1, designated as OsELA1), and three AG peptides (OsAGPEP1, OsAGPEP2, and OsAGPEP3) were identified as β-Yariv reagent-reactive glycopeptides from rice aleurone cells (Mashiguchi ). One of them, OsAGPEP3 (designated as OsAGP28), was not identified in the present whole-genome analysis, because it does not exist in either RGAP or RAP-DB, but five of them were obtained in the present study (designated OsAGP1, OsLLA1, OsELA1, OsAGP14, and OsAGP19). The results of EST abundance, microarray, and MPSS signature analysis revealed that most AGP-encoding genes from the present prediction are expressed, except OsAGP9 which is suggested to be a pseudogene (Table 2). The expression levels of 10 AGP-encoding genes were relatively low in all organs and tissues at various developmental stages examined by microarray (Fig. 6F). The reasons for this are complicated; some of them might be pseudogenes, some might be expressed at relatively lower levels than other family members, and some might be expressed in other organs and tissues or developmental stages that were not selected for microarray analysis, such as stem, etc. The present expression analysis provides evidences that rice AGP-encoding genes with different expression patterns may play different functional roles in distinct developmental stages.

Putative functions of rice AGPs in various developmental processes

The expression of the majority of the AGP-encoding genes was spatially regulated, and many rice AGP-encoding genes were mainly expressed during sexual reproduction, especially in the processes of pollen development. This observation is consistent with the role of AGP-encoding genes in controlling pollen function (Coimbra , 2009; Levitin ; Anand and Tyagi, 2009). The two classical AGP-encoding genes, OsAGP7 and OsAGP10, are highly expressed in pollen, similar to AtAGP6 and AtAGP11 (Fig. 10A). The phylogenetically closest rice gene is OsAGP6, but it has a different expression pattern from that of AtAGP6 and AtAGP11. In this case, it is possible that the genes sharing the same function are not those with higher identity. Therefore, OsAGP7 and OsAGP10 may play a conserved role in pollen development, like AtAGP6 and AtAGP11 which redundantly control pollen development and fertility (Table 4) (Levitin ; Coimbra , 2010). The group III FLA genes are mainly expressed in the sexual reproductive processes. In anther, 15 of 24 AG peptide-encoding genes are expressed (Fig. 10C, G), implying their important functions in pollen development. Six AG peptide-encoding genes (AtAGP16, AtAGP21, AtAGP22, AtAGP23, AtAGP24, and AtAGP40) are also found to be expressed in mature pollen by transcriptomic analysis of GPI-anchored protein genes (Lalanne ). To sum up, these results suggest that classical AGPs, AG peptides, and group III FLAs may play an important role in rice anther and pollen development.
Table 4.

Comparison between rice AGPs and their Arabidopsis homologues

SubfamilyGroupsRiceArabidopsisKnown function
CFig. 9AOsAGP7 OsAGP10AtAGP6 AtAGP11Pollen development
KCFig. 9BAtAGP17Agrobacterium infection
AtAGP18Macrosporogenesis
OsAGP12 OsAGP13AtAGP19Cell division
NCFig. 9DAtAGP30Root regeneration
AtAGP31Defence response
nsLTP IFig. 9EOsLLA1 OsLLA7AtXYP1Vascular differentiation
OsLLA2 OsLLA6AtXYP2Vascular differentiation
Fas IIIFig. 9FOsFLA12 OsFLA26AtFLA4Cell expansion
Comparison between rice AGPs and their Arabidopsis homologues However, the physiological and biochemical processes and molecular mechanism of how AGPs regulated anther and pollen development are still unclear. The complex structure of the sugar chains of AGPs makes them a potential source of small signalling molecules, such as biologically active oligosaccharides. A novel tetrasaccharide that accumulated specifically in rice anthers during microsporogenesis, with a structure similar to a tetrasaccharide unit found in the glycan chain of AGP, played an important role in the development of rice anther (Kawaguchi ). It is notable that AGPs might be partially deglycosylated by glycosidases after being glycosylated in endoplasmic reticulum, and the released oligosaccharides might execute the function of AGPs, such as the tetrasaccharide mentioned above. Furthermore, the expression and activity of AGP-specific glycosylhydrolases might therefore provide mechanisms to control the biological activity of AGPs. The Lys-rich AGP-encoding genes are essential for Arabidopsis: AtAGP17, AtAGP18, and AtAGP19 participate in Agrobacterium infection, macrosporogenesis, and cell division, respectively (Acosta-Garcia and Vielle-Calzada, 2004; Gaspar ; Yang ). There are two Lys-rich AGP-encoding genes in rice, OsAGP12 and OsAGP13, which are similarly expressed and phylogenetically close to AtAGP19 (Figs 4C, 10B), indicating that OsAGP12 and OsAGP13 may function like AtAGP19 which controls cell division (Table 4). Moreover, the EST expression profile of OsAGP13 shows a specific expression in the SAM, indicating that it may control the cell division of the SAM. The situation for group I nsLTP-like AGPs is more complicated. Double knockouts of two xylogen-encoding genes (AtXYP1 and AtXYP2) in Arabidopsis show defects of vascular development (Motose ). There are two homologues of AtXYP1 in rice, OsLLA1 and OsLLA7, and two homologues of AtXYP2 in rice, OsLLA2 and OsLLA6, respectively (Fig. 5E). OsLLA1, OsLLA7, and AtXYP1 are similarly expressed in root, flower, and seed; OsLLA2, OsLLA6, and AtXYP2 are highly expressed in root. OsLLA1 and OsLLA7 are gene pairs from chromosomal duplication (Fig. 6). The redundancy of group I nsLTP-like AGPs will increase the difficulties in identifying their functions. OsFLA12 and OsFLA26 are gene pairs from chromosomal duplication; OsFLA12 show similar expression patterns to its Arabidopsis homologue, AtFLA4 (SOS5), but OsFLA26 is expressed in flower. OsFLA12 may be the rice homologue of SOS5, and OsFLA26 may change its function in the course of evolution.

Phytohormone- and abiotic stress-responsive AGP-encoding genes

ABA is an important hormone that regulates many downstream genes via ABREs (ABA-responsive elements), and is involved in response to abiotic stresses throughout the plant kingdom (Busk and Pages, 1998). Several classical AGPs (OsAGP1), AG peptides (OsAGP15), and eNod-like AGPs (OsELA3) are responsive to ABA and drought and salt stresses. The relative expression levels of OsELA3 in young seedlings under drought and salt treatments are >400 and 200 times those of non-treated seedlings, respectively, suggesting that it responds markedly to drought and salt stresses, and OsAGP1 and OsAGP15 were significantly up-regulated by drought and salt stresses (Fig. 8). Moreover, OsAGP1, OsAGP15, and OsELA3 are also up-regulated by ABA, indicating that they may be stress-inducible genes and participate in the ABA signalling pathway (Fig. 9A, B, D). The findings are consistent with those in Arabidopsis, such as the atagp30 mutant which shows suppression of the ABA-induced delay in germination and altered expression of some ABA-regulated genes (van Hengel and Roberts, 2003), and a non-classical AGP gene, AtAGP31, which showed a decreased mRNA level in response to ABA treatment (Liu and Mehdy, 2004). The molecular mechanism by which AGPs are involved in drought and salt responses remains to be elucidated. It is probable that the polysaccharide chains of AGPs which could be deglycosylated by glycosidases might be a source of oligosaccharides, and these oligosaccharides might increase the intracellular osmotic pressure, and reduce the speed of dehydration. The expression level of an AG peptide gene, OsAGP20, was down-regulated by drought and salt stresses, but up-regulated by cold stress. It had been reported that a tetrasaccharide with similar structural characters to the sugar chains of AGP might play an important role in both the development of anther and its response to chilling (Kawaguchi ), and AGPs that were induced by cold stresses, such as OsAGP20, might be a potential resource which could be degraded into tetrasaccharides. It had been previously reported that GA signalling was specifically inhibited by β-GlcY treatment in barley aleurone protoplasts (Mashiguchi ). Microarray analysis of β-GlcY-treated aleurone cells revealed that β-GlcY was largely effective in repressing GA-induced gene expression. It is also probable that AGPs are involved in the perception of stimuli causing defence responses. Five genes (AtAGP5, AtAGP11, OsFLA7, OsFLA11, and AtFLA21) were significantly up-regulated by GA (Fig. 11), indicating that GA plays important roles in regulation of the functions of AGPs. CsAGP1, a GA-responsive classical AGP, was isolated from cucumber hypocotyls (Park ). However, direct proof of how AGPs work as regulators that control the expression of GA-induced genes is still lacking.

Conclusion

The detailed structural, phylogenetic, and expression analyses in this study provide insights into the functions of AGPs in rice. For example, several AGP-encoding genes displayed specific expression during various stages of panicle and seed development and were induced by ABA, GA, and abiotic stresses, implying their important functions in the processes of reproductive development, ABA and GA signalling pathways, and abiotic stress responses. Moreover, genes of orthologous groups are similarly expressed in rice and Arabidopsis, suggesting conserved roles for AGP-encoding genes between monocots and dicots. In the future, reverse genetics methods such as RNAi and T-DNA insertion mutants could be used for functional analysis of rice AGP-encoding genes. However, gene redundancies should be considered, as several gene pairs with similar expression patterns and a high degree of homology might have overlapping functions. Therefore, specific RNAi constructs that silence two genes at a time could be used and sometimes double and triple mutants should be screened for a notable phenotype. Extensive studies will improve our understanding of the functions of AGPs in relation to various aspects of reproductive development and abiotic stresses in monocots.

Supplementary data

Supplementary data are available at JXB online. Primers used in real-time PCR. Proline, alanine, serine, and threonine (PAST) proportion of rice AGPs. EST expression profile analysis of rice AGP-encoding genes. MPSS analysis of rice AGP-encoding genes in various organs and tissues. Expression analysis of rice and Arabidopsis AGP-encoding genes in various developmental stages and under phytohormone and abiotic stress treatments.
  61 in total

1.  A relationship between seed development, Arabinogalactan-proteins (AGPs) and the AGP mediated promotion of somatic embryogenesis.

Authors:  Arjon J Van Hengel; Ab Van Kammen; Sacco C De Vries
Journal:  Physiol Plant       Date:  2002-04       Impact factor: 4.500

2.  Putative fasciclin-like arabinogalactan-proteins (FLA) in wheat (Triticum aestivum) and rice (Oryza sativa): identification and bioinformatic analyses.

Authors:  Ahmed Faik; Jaouad Abouzouhair; Fathey Sarhan
Journal:  Mol Genet Genomics       Date:  2006-08-31       Impact factor: 3.291

3.  Localization of an arabinogalactan protein epitope and the effects of Yariv phenylglycoside during zygotic embryo development of Arabidopsis thaliana.

Authors:  Ying Hu; Yuan Qin; Jie Zhao
Journal:  Protoplasma       Date:  2006-10-06       Impact factor: 3.356

4.  Synthetic genes for glycoprotein design and the elucidation of hydroxyproline-O-glycosylation codes.

Authors:  E Shpak; J F Leykam; M J Kieliszewski
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-21       Impact factor: 11.205

5.  A pollen tube growth stimulatory glycoprotein is deglycosylated by pollen tubes and displays a glycosylation gradient in the flower.

Authors:  H M Wu; H Wang; A Y Cheung
Journal:  Cell       Date:  1995-08-11       Impact factor: 41.582

6.  The classical arabinogalactan protein gene family of arabidopsis.

Authors:  C J Schultz; K L Johnson; G Currie; A Bacic
Journal:  Plant Cell       Date:  2000-09       Impact factor: 11.277

7.  Overexpression of tomato LeAGP-1 arabinogalactan-protein promotes lateral branching and hampers reproductive development.

Authors:  Wenxian Sun; Marcia J Kieliszewski; Allan M Showalter
Journal:  Plant J       Date:  2004-12       Impact factor: 6.417

8.  An attack of the plant parasite Cuscuta reflexa induces the expression of attAGP, an attachment protein of the host tomato.

Authors:  Markus Albert; Xana Belastegui-Macadam; Ralf Kaldenhoff
Journal:  Plant J       Date:  2006-11       Impact factor: 6.417

9.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

10.  Genome-wide identification, structure and expression studies, and mutant collection of 22 early nodulin-like protein genes in Arabidopsis.

Authors:  Kiyoshi Mashiguchi; Tadao Asami; Yoshihito Suzuki
Journal:  Biosci Biotechnol Biochem       Date:  2009-11-07       Impact factor: 2.043

View more
  68 in total

1.  Expression dynamics of metabolic and regulatory components across stages of panicle and seed development in indica rice.

Authors:  Rita Sharma; Pinky Agarwal; Swatismita Ray; Priyanka Deveshwar; Pooja Sharma; Niharika Sharma; Aashima Nijhawan; Mukesh Jain; Ashok Kumar Singh; Vijay Pal Singh; Jitendra Paul Khurana; Akhilesh Kumar Tyagi; Sanjay Kapoor
Journal:  Funct Integr Genomics       Date:  2012-03-31       Impact factor: 3.410

Review 2.  Arabinogalactan proteins in root and pollen-tube cells: distribution and functional aspects.

Authors:  Eric Nguema-Ona; Sílvia Coimbra; Maïté Vicré-Gibouin; Jean-Claude Mollet; Azeddine Driouich
Journal:  Ann Bot       Date:  2012-07       Impact factor: 4.357

3.  Pipeline to Identify Hydroxyproline-Rich Glycoproteins.

Authors:  Kim L Johnson; Andrew M Cassin; Andrew Lonsdale; Antony Bacic; Monika S Doblin; Carolyn J Schultz
Journal:  Plant Physiol       Date:  2017-04-26       Impact factor: 8.340

4.  Genome-wide identification, classification and expression analysis of genes encoding putative fasciclin-like arabinogalactan proteins in Chinese cabbage (Brassica rapa L.).

Authors:  Li Jun; Wu Xiaoming
Journal:  Mol Biol Rep       Date:  2012-10-09       Impact factor: 2.316

5.  Identification, duplication, evolution and expression analyses of caleosins in Brassica plants and Arabidopsis subspecies.

Authors:  Yue Shen; Mingzhe Liu; Lili Wang; Zhuowei Li; David C Taylor; Zhixi Li; Meng Zhang
Journal:  Mol Genet Genomics       Date:  2016-01-19       Impact factor: 3.291

6.  Abscisic acid- and stress-induced highly proline-rich glycoproteins regulate root growth in rice.

Authors:  I-Chieh Tseng; Chwan-Yang Hong; Su-May Yu; Tuan-Hua David Ho
Journal:  Plant Physiol       Date:  2013-07-25       Impact factor: 8.340

7.  Evaluation of the presence of arabinogalactan proteins and pectins during Quercus suber male gametogenesis.

Authors:  Mário Luís Costa; Rómulo Sobral; Maria Manuela Ribeiro Costa; Maria Isabel Amorim; Sílvia Coimbra
Journal:  Ann Bot       Date:  2014-11-30       Impact factor: 4.357

8.  Roles of arabinogalactan proteins in cotyledon formation and cell wall deposition during embryo development of Arabidopsis.

Authors:  Jing Zhong; YuJun Ren; Miao Yu; TengFei Ma; XueLian Zhang; Jie Zhao
Journal:  Protoplasma       Date:  2010-09-10       Impact factor: 3.356

9.  Machine learning approaches distinguish multiple stress conditions using stress-responsive genes and identify candidate genes for broad resistance in rice.

Authors:  Rafi Shaik; Wusirika Ramakrishna
Journal:  Plant Physiol       Date:  2013-11-14       Impact factor: 8.340

10.  The putative phytocyanin genes in Chinese cabbage (Brassica rapa L.): genome-wide identification, classification and expression analysis.

Authors:  Jun Li; Guizhen Gao; Tianyao Zhang; Xiaoming Wu
Journal:  Mol Genet Genomics       Date:  2012-12-02       Impact factor: 3.291

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.