Literature DB >> 35415619

Proteomic analysis of 2-chloroethanol extracts of rice (Oryza sativa L.) seeds.

Songlin Ruan1,2, Wenfei Xiao1, Jieren Qiu1, Weimin Hu3, Wu Ying1, Huizhe Chen2, Jianxin Tong1, Huasheng Ma1.   

Abstract

Ethanol-soluble proteins, including prolamins, are one of the most important seed proteins in rice (Oryza sativa L.). However, little is known about the proteomic profile of ethanol-soluble protein fraction extracted from rice grain. In this work, the differential profile of ethanol-soluble proteins extracted by 2-chloroethanol and ethanol has been documented. Proteome analysis utilizing LC-MS/MS identified a total of 64 unique proteins in the 2-chloroethanol extract of rice seeds. The majority of these proteins had low molecular weight ranging from 10 to 25 kD and isoelectric point (pI) in mid-acidic (pH 5-pH 7) and mid-basic (pH 7-pH 9) ranges. Database searches combined with transmembrane domain (TMD) analysis revealed that >70% of identified proteins were hydrophobic, i.e., had at least one TMD. Gene ontology classification and enrichment analysis showed that the identified proteins were involved in13 types of biological processes, 5 types of cell components, and 17 types of molecular functions. These results were significant based on the hyper p-value of <0.05. The most frequent categories of biological processes, cell components, and molecular functions were, respectively, type I hypersensitivity, extracellular space and extracellular region, and serine-type endopeptidase inhibitor activity. Interestingly, in addition to seed storage proteins such as prolamins and glutelins, certain allergen proteins, protease inhibitors, and lipid transfer proteins were identified in the extracts. Together, the collected data provide novel insights into the protein profile of 2-chloroethanol extract of rice seeds.
© 2020 The Authors.

Entities:  

Keywords:  Ethanol-soluble protein; Protease inhibitors; Proteomic analysis; Rice seed; Seed storage protein

Year:  2020        PMID: 35415619      PMCID: PMC8991595          DOI: 10.1016/j.fochms.2020.100002

Source DB:  PubMed          Journal:  Food Chem (Oxf)        ISSN: 2666-5662


Introduction

Rice (Oryza sativa L.) is one of the most important cereal crops worldwide, and a food staple in many countries (Fei et al., 2019; Wang et al., 2019). Besides starch, rice grains contain abundant storage proteins, accounting for 8–13% of their dry weight. Based on the solubility in various solvents, rice seed storage proteins are classified into three groups: acid/alkaline-soluble glutelins, alcohol-soluble prolamins, and saline-soluble α-globulin (Yamagata & Tanaka, 1986). They account for, respectively, 60%–80%, 20%–30%, 2%–8% of the total protein content of rice grain (Kim, Lee, Yoon, Lim, & Kim, 2013). Prolamins are the second most abundant class of endosperm proteins in rice and are stored mostly in type I of protein body (PB-I)derived from the ER, whereas glutelins and α-globulin are accumulated in irregularly-shaped PB-II bodies derived from the protein storage vacuole (PSV) (Kim et al., 2013). The prolamins in PB-I are mainly composed of 13 kD polypeptides, while 10 kD and 16 kD polypeptides are relatively rare. Although prolamins are difficult to be digested, they affect the textural and pasting properties of rice flour and nutritional quality of rice grain (Baxter, Blanchard, & Zhao, 2004; Kim et al., 2013). The commonly used solvents for rice prolamins are the mixtures of ethanol and water (e.g., 50–70% (v/v) ethanol) or 2-propanol and water (e.g., 55–70%(v/v) 2-propanol), with or without the addition of reducing agents (Gorinstein, Nue, & Arruda, 1991). Interestingly, pure 2-propanol is also an effective solvent for the extraction of prolamin from rice flour (Baxter et al., 2004). In other cereals, such as wheat and barley, 2-chloroethanol is frequently used for the extraction of prolamin and has been successfully used for cultivar identification (Cooke, 1984; Weiss, Postel, & Görg, 1991). However, 2-chloroethanol is rarely used in the extraction of rice prolamin. Ethanol-soluble cereal proteins are extremely difficult to purify and characterize due to their low solubility and tendency to polymerize. These proteins are most often separated by the reverse phase HPLC (RP-HPLC) due to the high resolution of this technique (Bietz, 1983; Paulis & Bietz, 1986; Pernollet et al., 1989). In addition to RP-HPLC, two types of 1-D polyacrylamide gel electrophoresis (PAGE), including SDS-PAGE and acid-PAGE, are also effective in separating ethanol-soluble proteins for the analysis of genetic diversity. In particular, acid-PAGE has been frequently used for cultivar identification (Bean & Lookhart, 2000; Cooke, 1984; Weiss et al., 1991). Interestingly, the combination of gel electrophoresis and chromatographic techniques led to the development of capillary zone electrophoresis (CZE) in free solution (Tsuji, 1991), which has been widely used as a powerful tool for the separation of ethanol-soluble seed proteins of cereal crops and tobacco. The broad application of CZE results from its higher resolution and shorter analysis time than that offered by high-performance liquid chromatography (HPLC) and gel electrophoresis (Deng, Lu, & Yan, 1998; Lookhart & Bean, 1995). Since the limited protein solubility can be overcome by the addition of ethanol and organic solvents to focusing buffer, isoelectric focusing (IEF) has been used to separate storage proteins from several cereals (Bean & Lookhart, 2000). Görg and coworkers developed a technique to generate immobilized pH gradients in the first dimension (IPG-Dalt) of 2-D electrophoresis (2-DE) (Görg, Postel, Baumer, & Weiss, 1992), which has been applied in the discrimination of barley cultivars with different malting grades. However, given its technical limitations, such as poor reproducibility, low sensitivity, and long duration, 2-DE is rarely used in the separation of ethanol-soluble proteins and cultivar identification in cereals. In cereals, ethanol-soluble protein fractions extracted from seeds were generally thought to represent storage proteins. However, in addition to storage proteins, seeds contain two other groups of proteins: structural and metabolic proteins, and protective proteins (Shewry & Halford, 2002). Therefore, other ethanol-soluble proteins, such as structural and metabolic proteins and protective proteins, may also be present in the ethanol-soluble protein fraction. For example, an earlier study demonstrated that besides avenins, the true prolamins, three low-molecular weight proteins, including two α-amylase/trypsin inhibitors and one novel protein, were also found in the ethanol-soluble protein fraction extracted from the oat grains (Pernollet et al., 1989). With the recent development of proteomic techniques, particularly with the significant improvement in the sensitivity, mass accuracy, and fragmentation in the field of mass spectrometry (MS), proteomic strategies became widely adopted (Walther & Mann, 2010) and accelerated the progress in seed proteomics research. Thus far, synthesis and processing of seed storage proteins have been a major focus of seed proteomics (Miernyk & Hajduch, 2011). Similarly, a significant amount of research has been devoted to the proteomic analysis of the development and germination of rice seeds (Cheng et al., 2017; Deng, Gong, & Wang, 2013; Koller et al., 2002; Komatsu, Konishi, Shen, & Yang, 2003; Wang, Liu, Song, & Møller, 2015; Yang et al., 2007). In addition, 2-DE and MALDI-TOF-MS identified 302 proteins in mature rice seeds (Yang et al., 2013). However, little is known about the proteomic profile of ethanol-soluble protein fraction extracted from rice seeds. Therefore, the present investigation was focused on the analysis of the proteomic profile of 2-chloroethanol extract of rice seeds using LC-MS/MS. First, the profile of ethanol-soluble proteins from rice seeds extracted with 2-chloroethanol/water (25% v/v) was compared with that obtained with ethanol/water (50% v/v) using SDS-PAGE. Subsequent proteomic analysis revealed that besides seed storage proteins, including prolamins and glutelins, many metabolic and functional proteins, particularly allergens and protease inhibitors, were identified among the extracted proteins. These findings provide new insights into the protein profile of 2-chloroethanol extract of rice seeds.

Materials and methods

Preparation of rice seeds and 2-chloroethanol extraction

Seeds of rice (O. sativa cv. Aichi asahi) were supplied by China National Rice Research Institute (Fuyang, Zhejiang province, China). Rice grains were de-hulled and manually ground into dry powder. A sample of 1 g of dry powder was transferred to a 50 ml centrifuge tube and mixed with 50 ml of 25% (v/v) 2-chloroethanol (Shanghai Guoyao Group Chemical Reagent Co., Ltd., China) or 50% (v/v) ethanol (Shanghai Guoyao Group Chemical Reagent Co., Ltd., China). The tube was rocked on a shaker (THZ-D, Jiansu, China) at 150 rpm overnight and then centrifuged for 15 min at 13,000g. The supernatant was transferred into another centrifuge tube, and a 12 μl sample was taken to determine the protein concentration using the Bradford method (Coomassie Protein Assay Kit, Bio-Rad, Hercules, CA, USA). Bovine serum albumin was used as the standard.

SDS-PAGE analysis

A sample of 20μg of ethanol-soluble proteins was loaded onto 12% SDS-PAGE gel. Molecular weight markers (Bio-Rad)were used to measure the size of bands corresponding to ethanol-soluble proteins. Constant voltage was set at 100 V, and the run time was approximately 1 h. The gel was stained overnight in 0.12% (w/v) colloidal Coomassie brilliant blue G-250, 10% (w/v) phosphoric acid, 10% (w/v) ammonium sulfate, and 20% (v/v) methanol. The gel was then destained in distilled water until clear bands appeared, and scanned to acquire gel image using a GS-800 imaging densitometer (Bio-Rad).

Protein digestion

Protein samples (100 μg) were added to 50 mM NH4HCO3 to the final volume of 150 μl. Protein samples were added to 100 mM dithiothreitol (DTT) and a final concentration of DTT was 10 mM, mixed at 600 rpm for 1 min, and incubated at 37 °C for 1 h. Iodoacetic acid (IAA, 0.5 M) was added to reach the final concentration of 50 mM, mixed at 600 rpm for 1 min, and stored at room temperature in the dark for 30 min. Protein mixture was filtered through a 10 kD cut-off membrane and centrifuged at 14,000g for 15 min at 4 °C. A total of 150 μl 50 mM NH4HCO3 was added, centrifuged at 14,000 ×g for 15 min; this step was repeated twice. A 100 μl aliquot of trypsin buffer (2 μg trypsin in 100 μl NH4HCO3 buffer) was added, and the sample was digested for 16–18 h at 37 °C. Filtrates were collected after centrifugation at 14,000 ×g for 10 min, addition of 100 μl NH4HCO3, and repeated centrifugation at 14,000 ×g for 10 min. The generated peptides were desalted using a C18 column (Thermo Fisher Scientific, Waltham, MA, USA).

LC-MS/MS

The mixture of tryptic peptides was desalted and separated by the Ettan™ MDLC system (GE Healthcare, Chicago, IL, USA), utilizing RP trap columns (Zorbax 300 SB C18, Agilent Technologies, Santa Clara, CA, USA) for desalting and RP columns (150 μm i.d., 100 mm length, Column Technology Inc., Fremont, CA, USA) for the separation. Mobile phase A consisted of 0.1% formic acid in HPLC-grade water, and mobile phase B consisted of 0.1% formic acid in acetonitrile. A sample containing 20 μg of tryptic peptide mixture was loaded onto the columns, and the separation was done at a flow rate of 2 μl/min using a linear gradient of 4–50% B for 120 min. A Finnigan™ LTQ™ linear ion trap MS (Thermo Electron, Madison, WI, USA) equipped with an electrospray interface was connected to the LC setup for the detection of eluted peptides. Data-dependent MS/MS spectra were obtained simultaneously. Each scan cycle consisted of one full MS scan in the profile mode followed by five MS/MS scans in the centroid mode with the following Dynamic Exclusion™ settings: repeat count 2, repeat duration 30 s, exclusion duration 90 s. Each sample was analyzed in triplicate. MS/MS spectra were automatically searched against the non-redundant protein database from the NCBI (version 20100612) using the BioworksBrowser rev. 3.1.The peptides were constrained to be tryptic, and up to two missed cleavages were allowed. Carbamidomethylation of cysteine residues was treated as a fixed modification, whereas oxidation of methionine residues was considered as a variable modification. The mass tolerance allowed for the precursor ions was 2.0 Da and 0.2 Da for fragment ions. The protein identification criteria were based on Delta CN (≥0.1) and cross-correlation scores (Xcorr, one charge≥1.9, two charges ≥2.2, three charges ≥3.75).

Bioinformatic analysis of the identified proteins

Transmembrane regions in all identified proteins were predicted using the TMpred program from the ExPASy site (http://www.ch.embnet.org/software/TMPRED_form.html). The algorithm employed is based on the statistical analysis of TM base, a database of naturally occurring transmembrane proteins as described by Hofmann and Stoffel (1993). The Gene Ontology (GO) enrichment analysis of the identified proteins was performed by searching the Rice Oligonucleotide Array Database (ROAD) (http://www.ricearray.org, version 20130122). To determine the confidence of the GO enrichment results, the hyper p-value of <0.05 was considered statistically significant. The GO classification of these proteins was illustrated using the Sigmaplot software, version 10.0 (Systat Software, San Jose, CA, USA). KEGG enrichment analysis of the identified proteins was performed in DAVID 6.8 (https://david.ncifcrf.gov/).

Results

Protein profile and identification of 2-chloroethanol extract of rice seeds

SDS-PAGE demonstrated that the protein profile of 2-chloroethanol extract of rice seeds was distributed in the molecular weight range of 10–66 kD. Most proteins were in the low molecular weight range, 10–25 kD. The differences in the profile of ethanol-soluble proteins between 2-chloroethanol/water (25% v/v) and ethanol/water (50% v/v) extracts were also compared. As shown in Fig. 1, under the same loading conditions, the 2-chloroethanol/water extract appeared to have a larger amount of ethanol-soluble proteins, particularly of low molecular weight (10–25 kD) than the ethanol/water extract. This finding indicates a different profile of ethanol-soluble proteins between 2-chloroethanol/water and ethanol/water extracts.
Fig. 1

Protein profile of 2-chloroethanol extract from rice seeds.

20μg of ethanol-soluble proteins2-chloroethanol extract from rice seeds were loaded onto 1-D SDS-PAGE gel (12% gel). Gels were stained overnight with colloidal Coomassie brilliant blue. Numbers on the left represent apparent molecular masses. CE and ET denote 2-chloroethanol/water (25% v/v) and ethanol/water (50% v/v), respectively.

Protein profile of 2-chloroethanol extract from rice seeds. 20μg of ethanol-soluble proteins2-chloroethanol extract from rice seeds were loaded onto 1-D SDS-PAGE gel (12% gel). Gels were stained overnight with colloidal Coomassie brilliant blue. Numbers on the left represent apparent molecular masses. CE and ET denote 2-chloroethanol/water (25% v/v) and ethanol/water (50% v/v), respectively. Identification of ethanol-soluble proteins in rice seeds by LC MS/MS. Accession no.: accession number in NCBI database. Locus ID: locus identification. MW(kD): match molecular weight. pI: Isoelectric point values. None: not available. Subsequently, proteins obtained by extraction with 2-chloroethanol were digested with trypsin to obtain a mixture of peptides. A total of 1940 unique peptides were detected using LC-MS/MS (Table 1S). By database search and match, 64 unique proteins were identified (Table 1).
Table 1

Identification of ethanol-soluble proteins in rice seeds by LC MS/MS.

No.Accession no.aRAP lociLocus IDbMW (kD)cpIdProtein name
1Q01882Os07g0214300LOC_Os07g11380.217,8658.06RAL4 - Seed allergenic protein RA5/RA14/RA17 precursor, expressed
2BAF21097.1Os07g0213800LOC_Os07g11330.117,2708.72RAL2 - Seed allergenic protein RA5/RA14/RA17 precursor, expressed
3P29421.2Os04g0526600LOC_Os04g44470.121,4188.66KUN1 - Kunitz-type trypsin inhibitor precursor, expressed
4BAF26923.1Os10g0505900LOC_Os10g36180.145,5824.74Expressed protein
5BAA01998.1NoneeNone17,8888.06Allergenic protein
6Q01883Os07g0214100LOC_Os07g11360.117,5686.92RAL3 - Seed allergenic protein RA5/RA14/RA17 precursor, expressed
7BAF21100.1Os07g0214600LOC_Os07g11410.116,9838.36RAL5 - Seed allergenic protein RA5/RA14/RA17 precursor, expressed
8BAF08359.1Os02g0250600LOC_Os02g15250.147,3156.4Late embryogenesis abundant domain-containing protein, putative, expressed
9BAA01996.1NoneNone17,1188.74Allergenic protein
10Q01881Os07g0215500LOC_Os07g11510.117,2938.36RAL6 - Seed allergenic protein RA5/RA14/RA17 precursor, expressed
11BAA00800.1NoneNone15,0815.92Copper/zinc-superoxide dismutase
12BAF22321.1Os07g0638300LOC_Os07g44430.124,0425.97Peroxiredoxin, putative
13BAF04641.1Os01g0276300LOC_Os01g16920.124,4856.07Embryonic protein DC-8, putative, expressed
14BAF25418.1Os09g0482600LOC_Os09g30439.164,4985.58Heat shock protein, putative
15BAF12746.1Os03g0663400LOC_Os03g46060.122,7667.37Thaumatin family domain containing protein, expressed
16BAF23472.1Os08g0327700LOC_Os08g23870.115,1639.1Late embryogenesis abundant group 1, putative, expressed
17BAA07772.1NoneNone12,3358.74Allergenic protein
18P29835.1Os05g0499100LOC_Os05g41970.121,0557.48SSA1 - 2S albumin seed storage family protein precursor, expressed
19BAF10944.1Os03g0159600LOC_Os03g06360.227,1904.95Late embryogenesis abundant protein D-34, putative, expressed
20BAF18109.1Os05g0542500LOC_Os05g46480.420,5145.89Late embryogenesis abundant protein, group 3, putative, expressed
21BAF11884.2Os03g0322900LOC_Os03g20680.136,7916.39Late embryogenesis abundant protein 1, putative, expressed
22BAF23070.1Os08g0189100LOC_Os08g08960.123,7126.4Cupin domain containing protein, expressed
23BAF12032.1Os03g0351500LOC_Os03g22810.115,2515.71Copper/zinc superoxide dismutase, putative, expressed
24BAF12732.2Os03g0659300LOC_Os03g45720.121,9308.88Glyoxalase family protein, putative, expressed
25EAZ29742.1NoneNone54,0029.06Hypothetical protein OsJ_013225
26EAZ10632.1NoneNone11,9987.72Hypothetical protein OsJ_000457
27P07728.1Os01g0762500LOC_Os01g55690.156,2479.09Glutelin, putative, expressed (Glutelin type I)
28BAF08683.1Os02g0453600LOC_Os02g25640.257,1208.96Glutelin, putative, expressed
29BAF22692.1Os08g0104400LOC_Os08g01370.110,4905.13Expressed protein
30BAA00462.1NoneNone56,3208.93Prepro-glutelin
31BAF12166.1Os03g0385400LOC_Os03g26820.116,8998.62LTPL52 - Protease inhibitor/seed storage/LTP family protein precursor, expressed
32BAF21096.1Os07g0213600LOC_Os07g11310.115,8916.13LTPL166 - Protease inhibitor/seed storage/LTP family protein precursor, expressed
33BAF21106.1Os07g0216700LOC_Os07g11650.116,4777.48LTPL164 - Protease inhibitor/seed storage/LTP family protein precursor, expressed
34BAF24082.1Os08g0502400LOC_Os08g39270.128,3548.93Fasciclin domain containing protein, expressed
35P14323.1Os02g0249900None56,5519.26Glutelin type-B 1 precursor
36BAF28520.1Os11g0582400LOC_Os11g37270.147,3856.04AMBP1 - Antimicrobial peptide MBP-1 family protein precursor, expressed
37EAZ26837.1NoneNone61,4397.18Hypothetical protein OsJ_010320
38EAZ12939.1NoneNone8533.14.78Hypothetical protein OsJ_002764
39BAF21113.1Os07g0222000LOC_Os07g12080.115,7677.48LTPL169 - Protease inhibitor/seed storage/LTP family protein precursor, expressed
40P30287.1Os01g0702500LOC_Os01g50700.123,2366.64Dehydrin family protein, expressed Dehydrin Rab25 (Drought-resistant protein 1) (rDRP1)
41P46520.1Os05g0349800LOC_Os05g28210.110,1655.57Embryonic abundant protein 1
42BAF16634.2Os05g0160200LOC_Os05g06770.119,94610.0140S ribosomal protein S27a, putative, expressed
43P93407.1Os08g0561700LOC_Os08g44770.221,3015.79Copper/zinc superoxide dismutase, putative, expressed
44Q0JR25Os01g0124401LOC_Os01g03360.127,7905.38BBTI5 - Bowman-Birk type bran trypsin inhibitor precursor, expressed
45EAZ13254.1NoneNone22,1329.14Hypothetical protein OsJ_003079
46BAF10787.1Os03g0133100LOC_Os03g04080.112,6315.2Expressed protein
47Q6ERU3Os02g0268100LOC_Os02g16820.156,8089Glutelin, putative, expressed
48P55142Os04g0508300LOC_Os04g42930.211,7745.77OsGrx_C2.2 - glutaredoxin subgroup I, expressed
49BAF22828.1Os08g0129200LOC_Os08g03520.118,6946.28Retrotransposon protein, putative, Ty1-copia subclass, expressed
50EAZ38338.1NoneNone33,7657.37Hypothetical protein OsJ_021821
51BAF12898.1Os03g0699000LOC_Os03g49190.117,22210.1Oleosin, putative, expressed
52BAF21105.1Os07g0216600LOC_Os07g11630.116,4067.48LTPL163 - Protease inhibitor/seed storage/LTP family protein precursor, expressed
53BAF22842.1Os08g0131200LOC_Os08g03690.112,0488.1LTPL24 - Protease inhibitor/seed storage/LTP family protein precursor, expressed
54BAF29006.1Os12g0115100LOC_Os12g02320.211,3459.58LTPL12 - Protease inhibitor/seed storage/LTP family protein precursor, expressed
55BAF24481.1Os09g0109600LOC_Os09g02180.17847.65.58Expressed protein
56BAF12070.1Os03g0360700LOC_Os03g24600.114,6726.81Peptide methionine sulfoxide reductase msrB, putative, expressed
57BAF22161.1Os07g0609000LOC_Os07g41820.126,5007.11Stress responsive A/B Barrel domain containing protein, expressed
58EAZ11000.1NoneNone23,5974.73Hypothetical protein OsJ_000825
59BAF17132.1Os05g0330600LOC_Os05g26620.116,8838.79PROLM14 - Prolamin precursor, putative, expressed
60BAF11001.1Os03g0168100LOC_Os03g07180.341,8608.26Embryonic protein DC-8, putative, expressed
61BAF14618.1Os04g0404400LOC_Os04g33150.132,1074.91Desiccation-related protein PCC13-62 precursor, putative, expressed
62BAF13030.1Os03g0723400LOC_Os03g51350.118,0546.44Expressed protein
63AAT99435.1NoneNone40,4884.7Lysm-domain GPI-anchored protein
64BAF22821.1Os08g0127900LOC_Os08g03410.158,0468.72Glutelin, putative, expressed

Accession no.: accession number in NCBI database.

Locus ID: locus identification.

MW(kD): match molecular weight.

pI: Isoelectric point values.

None: not available.

Biochemical properties of identified proteins

As shown in Table 1, the molecular weights of all identified proteins were distributed in the range from 8.5 to 64.5 kD, with most proteins in the range from 10 to 25 kD (Fig. 2A). This result was consistent with the SDS-PAGE analysis of the 2-chloroethanol extract of rice seeds. Similarly, the pI values of all identified proteins ranged from 4.7 to 10.1, with the majority of them having mid-acidic (5–7) and mid-basic (7–9) pI (Fig. 2B).
Fig. 2

Distribution of size and isoelectric point all identified proteins from 2-chloroethanol extract.

A. Size distribution of all identified proteins is in a range of 8.5–64.5 kD in terms of minimal and maximal protein molecular weight, the majority of which are in a range of 10–25 kD. B. Isoelectric point distribution of all identified proteins is in a range of pH 4.7–10.1 in terms of minimal and maximal protein pI, the majority of which are in mid-acidic (5–7) and mid-basic (7–9) pH ranges. pI represents isoelectric point.

Distribution of size and isoelectric point all identified proteins from 2-chloroethanol extract. A. Size distribution of all identified proteins is in a range of 8.5–64.5 kD in terms of minimal and maximal protein molecular weight, the majority of which are in a range of 10–25 kD. B. Isoelectric point distribution of all identified proteins is in a range of pH 4.7–10.1 in terms of minimal and maximal protein pI, the majority of which are in mid-acidic (5–7) and mid-basic (7–9) pH ranges. pI represents isoelectric point. Database searches combined with transmembrane domain (TMD) analysis revealed that over 70% of the identified proteins were hydrophobic and had at least one TMD (Fig. 3). Proteins containing only one TMD were the most abundant class, accounting for 46% of all identified proteins. The second most abundant class, 17%, was represented by proteins containing two TMDs. Proteins containing three or more TMDs constituted the smallest class, accounting only for 10% of all proteins.
Fig. 3

Analysis of transmembrane domains of all identified proteins from 2-chloroethanol extract.

Over 70% of all identified proteins are hydrophobic, which have at least one TMD. Proteins which contain only one TMD are the most abundant class, accounting for 46% of all identified proteins. TMDs represent transmembrane domains.

Analysis of transmembrane domains of all identified proteins from 2-chloroethanol extract. Over 70% of all identified proteins are hydrophobic, which have at least one TMD. Proteins which contain only one TMD are the most abundant class, accounting for 46% of all identified proteins. TMDs represent transmembrane domains. GO classification and enrichment analysis of the identified proteins from 2-chloroethanol extract. The identified proteins were classified as being involved in13 kinds of biological processes, 5 kinds of cell components, and 17 kinds of molecular functions, which were significantly in accordance with hyper p value which is <0.05. Of these, type I hypersensitivity, extracellular space and extracellular region, and serine-type endopeptidase inhibitor activity enrich the first largest class of proteins. GO represents gene ontology.

KEGG (Kyoto Encyclopedia of Genes and Genomes) functional classification of identified proteins

As shown in Fig. 4, the GO (Gene Ontology) classification and enrichment analysis of the identified proteins was performed in terms of cellular component, molecular function, and biological processes by ROAD searching. The proteins were classified as belonging to 18 types of cell components in terms of GO annotation (Table 2S-1), but only 5 types were significant (hyper p-value of <0.05) (Table 2S-1). These components included extracellular space, extracellular region, periplasmic space, lipid particle, cell outer membrane, and monolayer-surrounded lipid storage body (Table 2S-1 and Fig. 4). The two largest classes of proteins were located in the extracellular space and extracellular region (Fig. 4).
Fig. 4

GO classification and enrichment analysis of the identified proteins from 2-chloroethanol extract.

The identified proteins were classified as being involved in13 kinds of biological processes, 5 kinds of cell components, and 17 kinds of molecular functions, which were significantly in accordance with hyper p value which is <0.05. Of these, type I hypersensitivity, extracellular space and extracellular region, and serine-type endopeptidase inhibitor activity enrich the first largest class of proteins. GO represents gene ontology.

Similarly, according to the GO annotation, the identified proteins were classified as having 28 types of molecular functions (Table 3S-1), of which 17 types were significant (hyper p-value of <0.05) (Table 3S-1). These 17 types of molecular functions were related to serine-type endopeptidase inhibitor activity, nutrient reservoir activity, ice binding, metal ion binding, lipid binding, antioxidant activity, superoxide dismutase activity, peptidase inhibitor activity, unfolded protein binding, copper ion binding, α-amylase inhibitor activity, peptide-methionine-(S)-S-oxide reductase activity, peroxiredoxin activity, endopeptidase inhibitor activity, acyl-CoA dehydrogenase activity, nucleic acid binding, and protein disulfide oxidoreductase activity. The most frequent category of molecular function among the identified proteins was serine-type endopeptidase inhibitor activity (Fig. 4). In addition to the cell component and molecular function categories, the GO annotation classified the identified proteins as being involved in21 types of biological processes. However, only 13 of them were significant (hyper p-value of <0.05) (Table 4S-1). These processes included type I hypersensitivity, response to freezing, homoiothermy, lipid transport, response to stress, protein folding, superoxide metabolic process, cell redox homeostasis, response to water, response to acidity, response to the abscisic acid stimulus, and embryo development (Table 4S-1 and Fig. 4). The molecular process annotated most frequently to the identified proteins was the type I hypersensitivity (Fig. 4). KEGG analysis showed that the identified proteins were enriched in the peroxisome pathway (osa04146) only. P93407 (superoxide dismutase [Cu—Zn], chloroplastic), P28757 (superoxide dismutase [Cu—Zn] 2), and Q0DRV6 (superoxide dismutase [Cu—Zn] 1) represented the only 3 proteins identified as involved in the peroxisome pathway, which functions in the removal of free radicals (Table S5).

Discussion

The present investigation compared the difference between the profiles of ethanol-soluble proteins from rice seeds extracted using 2-chloroethanol/water (25% v/v) and ethanol/water (50% v/v). Although in other cereals, such as wheat and barley, 2-chloroethanol/water is commonly used for the extraction of ethanol-soluble proteins for cultivar identification (Cooke, 1984; Weiss et al., 1991), it is less frequently employed than ethanol/water. The current results demonstrated that a difference exists between profiles of proteins extracted using 2-chloroethanol/water and ethanol/water. This difference is predominantly apparent in low-molecular-weight proteins, ranging in size from 10 to 25 kD (Fig. 1). This range corresponds exactly to the size of ethanol-soluble proteins, including prolamins. The current work used the MS-based shotgun proteomics to identify all tryptic peptides obtained from ethanol-soluble proteins. Prior to the MS analysis, these peptides were separated by RP-HPLC. RP-HPLC is commonly used for the separation of ethanol-soluble proteins due to its high resolution and its applicability to hydrophobic molecules (Bietz, 1983; Paulis & Bietz, 1986; Pernollet et al., 1989). Therefore, the MS-based shotgun proteomics can be considered a proper tool for identifying ethanol-soluble proteins. Here, a total of 64 unique proteins were identified using LC-MS/MS (Table 1), and over 70% of them were hydrophobic (Fig. 4). Interestingly, besides PROLM14, a true prolamin, over 40 ethanol-soluble proteins, are also found in these hydrophobic ones, which is similar to the result that besides avenins, three low-Mr proteins also appeared in oat ethanol-soluble protein fraction (Pernollet et al., 1989). In addition to prolamin, some glutelins are also present in the ethanol-soluble fraction of proteins from rice grains. It is usually suggested that glutelins represent a type of acid/alkaline-soluble storage proteins in rice seeds. Although the existence of alcohol-soluble glutelins in rice seeds has not been reported yet, they are present in sorghum seeds (Smith & Smith, 1988). Therefore, we raise the possibility that alcohol-soluble glutelins may exist in rice grains as well. In addition, it should be noted that only 51 proteins could have been used for GO enrichment analysis by ROAD since the locus IDs are not available for the remaining proteins (Table 1). In the GO enrichment analysis, a considerable fraction of proteins have been not functionally classified yet due to the absence of GO annotation. The largest number of un-annotated proteins, 32, is in the cell component type (Table 2S-2, 3), the smallest number, 18, is in the molecular function type (Table 3S-2, 3), and an intermediate number, 23, is in the biological process type (Table 4S-2, 3). The presence of un-annotated proteins might result in a certain bias of the entire GO enrichment analysis, but its validity was confirmed in the present work by statistically significant results of the analysis in ROAD. Recent studies show that ethanol/water extracts of rice seeds have antifungal activity (Bretanha, Kupsk, Garda-Buffon, Badiale-Furlong, & Pagnussatt, 2013; Pagnussatt, Bretanha, Garda-Buffon, & Badiale-Furlong, 2011; Pagnussatt, Bretanha, Meza, Garda-Buffon, & Badiale-Furlong, 2013). This effect is due to the presence of enzyme inhibitors in rice seed extracts, in particular, amylase inhibitors which can effectively inhibit the activity of fungal α-amylase (Bretanha et al., 2013; Pagnussatt et al., 2011; Pagnussatt et al., 2013). The present study identified protease inhibitors, seed allergen proteins, and lipid transfer proteins in the 2-chloroethanol extract of rice seeds. NCBI blast results show that seed allergen proteins belong to a class of α-amylase inhibitors. In addition, several antimicrobial proteins, such as thaumatin family domain-containing protein, Cupin domain-containing protein, and AMBP1 (Antimicrobial peptide MBP-1 (maize basic peptide 1) family protein precursor) were also detected in the 2-chloroethanol extract. Together, the accumulated data provide valuable insights into the properties of ethanol extracts of rice seeds.

Conclusions

The study documented the differential profile of proteins extracted from rice seeds by 2-chloroethanol and ethanol. Subsequent proteomic analysis revealed that besides storage proteins, including prolamins and glutelins, the extracted proteins contained some allergens and protease inhibitors. These proteins may provide antimicrobial and insect resistance and their identification may help discover candidate genes for controlling plant disease and pest insects, contributing to the improvement of rice breeding. The following are the supplementary data related to this article.

Table 1S

Unique peptides identified by LC MS/MS.

Table 2S

Cell components. Table 2S-1 GO enrichment analysis results of identified proteins assigned to various kinds of cell components. Table 2S-2 GO terms mapped onto the query. Table 2S-3 Genes unmapped by GO annotation.

Table 3S

Molecular functions. Table 3S-1 GO enrichment analysis results of identified proteins implicated in various kinds of molecular functions. Table 3S-2 GO terms mapped onto the query. Table 3S-3 Genes unmapped by GO annotation.

Table 4S

Biological processes. Table 4S-1 GO enrichment analysis results of identified proteins involved in various kinds of biological processes. Table 4S-2 GO terms mapped onto the query. Table 4S-3 Genes unmapped by GO annotation.

Table S5

KEGG pathway analysis of the identified proteins.

Declaration of competing interest

The authors have declared no conflict of interest.
  17 in total

Review 1.  Cereal seed storage proteins: structures, properties and role in grain utilization.

Authors:  Peter R Shewry; Nigel G Halford
Journal:  J Exp Bot       Date:  2002-04       Impact factor: 6.992

2.  Identification of genes involved in rice seed priming in the early imbibition stage.

Authors:  J Cheng; L Wang; P Zeng; Y He; R Zhou; H Zhang; Z Wang
Journal:  Plant Biol (Stuttg)       Date:  2016-03-01       Impact factor: 3.081

3.  Separation of cereal proteins by reversed-phase high-performance liquid chromatography.

Authors:  J A Bietz
Journal:  J Chromatogr       Date:  1983-01-21

4.  Barley cultivar discrimination: II. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis and isoelectric focusing with immobilized pH gradients.

Authors:  W Weiss; W Postel; A Görg
Journal:  Electrophoresis       Date:  1991-05       Impact factor: 3.535

5.  Two-dimensional polyacrylamide gel electrophoresis, with immobilized pH gradients in the first dimension, of barley seed proteins: discrimination of cultivars with different malting grades.

Authors:  A Görg; W Postel; M Baumer; W Weiss
Journal:  Electrophoresis       Date:  1992-04       Impact factor: 3.535

6.  Rice proteomics: a step toward functional analysis of the rice genome.

Authors:  Setsuko Komatsu; Hirosato Konishi; Shihua Shen; Guangxiao Yang
Journal:  Mol Cell Proteomics       Date:  2003-01       Impact factor: 5.911

7.  Variation in kafirin and alcohol-soluble glutelin chromatograms of sorghum inbred lines revealed by reversed-phase high-performance liquid chromatography.

Authors:  J S Smith; O S Smith
Journal:  Theor Appl Genet       Date:  1988-07       Impact factor: 5.699

8.  High-performance capillary electrophoresis of proteins. Sodium dodecyl sulphate-polyacrylamide gel-filled capillary column for the determination of recombinant biotechnology-derived proteins.

Authors:  K Tsuji
Journal:  J Chromatogr       Date:  1991-07-26

9.  Proteomic analysis of rice (Oryza sativa) seeds during germination.

Authors:  Pingfang Yang; Xiaojuan Li; Xiaoqin Wang; Hui Chen; Fan Chen; Shihua Shen
Journal:  Proteomics       Date:  2007-09       Impact factor: 3.984

Review 10.  Mass spectrometry-based proteomics in cell biology.

Authors:  Tobias C Walther; Matthias Mann
Journal:  J Cell Biol       Date:  2010-08-23       Impact factor: 10.539

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.