Literature DB >> 21492454

Deep sequencing reveals the complex and coordinated transcriptional regulation of genes related to grain quality in rice cultivars.

Rc Venu1, Mv Sreerekha, Kan Nobuta, André Beló, Yuese Ning, Gynheung An, Blake C Meyers, Guo-Liang Wang.   

Abstract

BACKGROUND: Milling yield and eating quality are two important grain quality traits in rice. To identify the genes involved in these two traits, we performed a deep transcriptional analysis of developing seeds using both massively parallel signature sequencing (MPSS) and sequencing-by-synthesis (SBS). Five MPSS and five SBS libraries were constructed from 6-day-old developing seeds of Cypress (high milling yield), LaGrue (low milling yield), Ilpumbyeo (high eating quality), YR15965 (low eating quality), and Nipponbare (control).
RESULTS: The transcriptomes revealed by MPSS and SBS had a high correlation co-efficient (0.81 to 0.90), and about 70% of the transcripts were commonly identified in both types of the libraries. SBS, however, identified 30% more transcripts than MPSS. Among the highly expressed genes in Cypress and Ilpumbyeo, over 100 conserved cis regulatory elements were identified. Numerous specifically expressed transcription factor (TF) genes were identified in Cypress (282), LaGrue (312), Ilpumbyeo (363), YR15965 (260), and Nipponbare (357). Many key grain quality-related genes (i.e., genes involved in starch metabolism, aspartate amino acid metabolism, storage and allergenic protein synthesis, and seed maturation) that were expressed at high levels underwent alternative splicing and produced antisense transcripts either in Cypress or Ilpumbyeo. Further, a time course RT-PCR analysis confirmed a higher expression level of genes involved in starch metabolism such as those encoding ADP glucose pyrophosphorylase (AGPase) and granule bound starch synthase I (GBSS I) in Cypress than that in LaGrue during early seed development.
CONCLUSION: This study represents the most comprehensive analysis of the developing seed transcriptome of rice available to date. Using two high throughput sequencing methods, we identified many differentially expressed genes that may affect milling yield or eating quality in rice. Many of the identified genes are involved in the biosynthesis of starch, aspartate family amino acids, and storage proteins. Some of the differentially expressed genes could be useful for the development of molecular markers if they are located in a known QTL region for milling yield or eating quality in the rice genome. Therefore, our comprehensive and deep survey of the developing seed transcriptome in five rice cultivars has provided a rich genomic resource for further elucidating the molecular basis of grain quality in rice.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21492454      PMCID: PMC3098810          DOI: 10.1186/1471-2164-12-190

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Rice is the staple food crop of more than 50% of the global population, and development of high yielding and high quality rice varieties is essential. Rice grain quality is assessed by its appearance and by its milling, cooking, eating, and nutritional quality [1-3]. Milling yield (the percentage of whole grain remaining after removal of the hulls and bran layers from paddy rice) is a very important characteristic that greatly affects profits for rice farmers. Milling yield or milling efficiency is determined based on the quality of the paddy rice, the milling equipment used and the skill of the mill operator. Milling yield is influenced by grain hardness, chalky area of the grain, grain size and shape, depth of surface ridges, bran thickness, and milling efficiency [4-7]. Agronomic and field managements also affect grain breakage during milling [5,8,9]. Rice eating quality is important because it determines the price of rice in the market. Eating quality is determined by water, protein, starch, and fat content [10-14]. Eating quality is negatively correlated with protein content, stickiness, and hardness of rice [10,11]. The main factors affecting both eating and cooking quality of rice are amylose content, gel consistency and gelatinization temperature [12,13,15,16]. Cooked rice with high amylose content is flaky, dry, hard and non-sticky while rice with low amylose content is sticky, moist, tender and glossy [12,13]. Developing cultivars with high milling yield and eating quality have been the main objectives in rice breeding programs in the last few decades. Milling yield and eating quality are complex traits controlled by quantitative trait loci (QTLs) [17]. In the last several years, many QTLs for eating quality have been mapped in the rice genome. For example, using chromosome segment substitution lines (CSSLs), Wan et al. [18] identified a total of 25 QTLs for nine eating quality traits. Many QTLs affecting different quality traits are mapped in the same chromosomal regions. Six QTLs are non-environment-specific and could be used for marker-assisted selection in rice quality improvement. Recently, Hao et al. [19] constructed 154 CSSLs for QTL mapping of quality traits. In that study, 10 QTLs for rice appearance traits and eight QTLs concerned with physico-chemical traits were detected. QTLs related to glossiness of cooked rice were identified in different genomic regions in Ilpumbyeo, a high grain quality rice in Korea [20]. The amylose content of rice is governed by the waxy (Wx) locus and mapped to chromosome 6 [21-23]. In contrast to the advances in genetic analysis of eating quality, less progress has been made on the genetic analysis of milling quality because the trait has low heritability and is sensitive to environmental factors [24,25]. Another challenge for milling yield analysis is that many mapping populations for milling yield had varied kernel shape among the individual lines and heterogeneity in grain dimensions confounds the assessment of genetic effects [9,24,26-31]. Recently, a mapping study identified six QTLs responsible for head rice (milling) yield using recombinant inbred lines (RILs) derived from crosses of common parent Cypress (high milling) with RT0034 (low milling) and LaGrue (low milling) [9]. The molecular and biochemical basis of grain quality in cereals have been studied in the last decade, and the biochemical processes and many participating genes in the synthesis of starch [32-34], storage proteins [35-39], and lysine within the aspartate family amino acid pathway [40] have been characterized in rice and other cereals. However, how the expression of these genes is coordinated and regulated during grain filling is still poorly understood. Recently, Tian et al. [41] demonstrated that starch synthesis-related genes form a fine network to control eating and cooking qualities by regulating amylose content, gel consistency, and/or gelatinization temperature, and through genetic modification of any of these starch synthesis-related genes, eating and cooking quality can be improved in rice. The expression of 44 genes participating in three pathways (the synthesis of starch, storage proteins, and lysine) during rice grain filling were examined by RT-PCR in the maternal line 93-11 and in the super-hybrid rice line Liang-You-Pei-Jiu (LYP9) [3]. The analysis revealed diverse yet coordinated expression profiles of genes involved in the three pathways in developing seeds. These unique expression patterns of the quality-related genes may influence the final composition and property of starch, protein, and lysine synthesis in rice seeds. Tools for whole-genome expression analysis like microarrays, serial analysis of gene expression (SAGE) and massively parallel signature sequencing (MPSS) have been widely used for transcriptome analysis in plants in last 10 years [42]. The sequencing-by-synthesis (SBS) second-generation sequencing method has been recently used for transcriptome analysis in many organisms because of its low cost and large sequencing output [43]. In this study, we used both MPSS and SBS to analyze the transcriptome of the developing rice seeds in five cultivars that differed in milling yield and eating quality. Many differentially expressed novel transcripts and genes involved in the biosynthesis of starch, aspartate family amino acids, and storage proteins were identified. Promoter analysis revealed the presence of hundreds of novel conserved patterns of cis regulatory elements in the up-regulated genes and putative co-expressed genes in the rice cultivars with high milling yield and good eating quality. Our comprehensive and deep survey of the developing seed transcriptome in five rice cultivars has provided an excellent starting material for further elucidating the molecular and biochemical basis of milling and eating quality in rice.

Results

Characteristics of the MPSS and SBS libraries and their matching to the rice genome and to EST and full-length cDNA databases

Both MPSS and SBS tags are short cDNA tags or digital gene expression tags, which are mainly derived from the 3' regions of a transcript [44]. About 1.0 to 1.3 million 17-base MPSS signatures and about 2.0 to 4.0 million 20-base SBS signatures were obtained in the 10 libraries (Table 1). These signatures were clustered and then processed with reliability and significance filters as described by Meyers et al. [45] (Additional File 1). For comparison of the expression levels across the libraries, the frequency of signatures in the individual libraries was normalized to one million (transcripts per million or TPM) [45]. The number of distinct signatures ranged from 12,000 to 18,000 in the MPSS libraries and from 77,000 to 165,000 in the SBS libraries. The SBS libraries contained two to three times more significant signatures (≥4 TPM) than the MPSS libraries. About 79 to 85% of the MPSS and 89 to 95% of the SBS significant signatures matched to the japonica (Nipponbare) genomic sequence (Table 1). The significant MPSS and SBS signatures from all five libraries were classified into seven classes based on their location on the annotated genes according to the method previously described by Meyers et al. [45] (Additional File 2).
Table 1

Characteristics of the MPSS and SBS libraries of developing rice seeds

ClassificationCypress (PSC)LaGrue (PSL)Ilpumbyeo (PSI)YR15965 (PSY)Nipponbare (PSN)

MPSSSBSMPSSSBSMPSSSBSMPSSSBSMPSSSBS
Number of reads1,266,7133,718,4641,082,0992,326,6631,201,5843,293,3941,190,2502,634,7911,207,9144,110,241
Distinct signatures12,660103,74118,297165,12917,78383,07112,37977,02316,499104,531
Significant signatures10,09930,57114,25336,33513,97125,91510,40223,30213,11634,165
Non-significant signatures2,56173,1704,044128,7943,81257,1561,97753,7213,38370,366
1-100 TPM10,705101,18716,280161,92915,60580,06910,44974,56414,486101,894
101-1,000 TPM1,7832,3121,8272,8961,9852,7041,7412,2261,8152,389
1,001-10,000 TPM157228177290183284176218186234
>10,000 TPM15141314101413151214
Total signatures matched to the Nipponbare genome10,94022,87215,81317,52115,27620,03210,67818,31314,37828,118
Significant signatures matched to the Nipponbare genome8,855 (80.9%)21,582 (94.3%)12,534 (79.2%)15,514 (88.5%)12,313 (80.6%)19,000 (94.8%)9,118 (85.3%)17,477 (95.4%)11,622 (80.8%)25,777 (91.6%)
Significant signatures specifically identified by either MPSS or SBS2,35922,8313,86925,9515,21917,1633,21616,1163,66924,718
Significant signatures identified by both MPSS and SBS7,740 (77% overlap)10,384 (73% overlap)8,752 (62% overlap)7,186 (70% overlap)9,447 (72% overlap)
Characteristics of the MPSS and SBS libraries of developing rice seeds

Correlation of the transcriptomic results generated by the MPSS and SBS technologies

From 62 to 77% of the significant signatures overlapped between the MPSS and SBS libraries (Table 1). Further, we used all the significant signatures in the MPSS and SBS libraries of the same cultivar for Pearson correlation coefficient analysis. The correlation coefficient was low when unfiltered MPSS and SBS data were used (Table 2). Removal of a small fraction of outliers (3-8, < 0.001% of the signatures) increased the correlation coefficient significantly in all five libraries (Table 2). For example, the correlation coefficient between the two YR15965 libraries was increased from 0.58 to 0.90 after removal of only four of 5,757 signatures.
Table 2

Correlation of the transcriptome results obtained by the MPSS and SBS technologies

CultivarCorrelation coefficient using all significant signatures*Correlation coefficient after removing few outliers
Cypress0.610.85 (removal of 3 out of 5,803 signatures)
LaGrue0.490.83 (removal of 6 out of 6,028 signatures)
Nipponbare0.530.87 (removal of 4 out of 7,815 signatures)
Ilpumbyeo0.390.81 (removal of 8 out of 6,985 signatures)
YR159650.580.90 (removal of 4 out of 5,757 signatures)

* Only genome matched significant signatures were used.

Correlation of the transcriptome results obtained by the MPSS and SBS technologies * Only genome matched significant signatures were used.

Expression patterns of grain quality-related genes in the cultivars with high milling yield and good eating quality

Data mining of the TIGR rice annotated genes (pseudomolecules version 5) identified 338 grain quality-related genes belonging to starch biosynthesis and degradation, seed storage protein synthesis (glutelin, globulin, and prolamins), seed maturation, seed allergen synthesis, seed development, and biosynthesis and degradation of aspartate family amino acids (aspartate, asparagine, threonine, isoleucine, methionine, and lysine). We examined the expression level of these genes in developing rice seeds of the five cultivars (Additional File 3). In both SBS and MPSS libraries, a total of 419 (16 grain-related genes) and 168 genes (3 grain-related genes) were ≥5-fold up- and down-regulated, respectively, in Cypress relative to both LaGrue and Nipponbare (Table 3). Similarly, 518 (8 grain-related genes) and 106 genes (4 grain-related genes) were ≥5-fold up- and down-regulated, respectively, in Ilpumbyeo relative to both YR15965 and Nipponbare (Table 3). The number of 5-fold up- and down-regulated antisense genes, genes with antisense transcripts, and genes encoding transcription factors (TFs) in Cypress (compared to both LaGrue and Nipponbare) and Ilpumbyeo (compared to both YR15965 and Nipponbare) are also listed in Table 3.
Table 3

The number of over five fold up- and down-regulated genes in Cypress in comparison to LaGrue and Nipponbare and in Ilpumbyeo in comparison to YR15965 and Nipponbare

Classification of genesCypressIlpumbyeo

MPSSSBSCommonMPSSSBSCommon
Total number of up-regulated genes1536 (44)*4,030 (70)419 (16)2,396 (39)3,339 (50)518 (8)
Total number of down regulated genes1409 (33)1,373 (14)168 (3)514 (11)2,119 (47)106 (4)
Total number of up-regulated antisense genes164 (12)1,266 (32)23 (5)256 (7)1196 (28)32 (1)
Total number of down regulated antisense genes53 (7)330 (4)6 (0)22 (5)372 (17)3 (1)
Total number of up-regulated genes with alternate transcripts17438233223059
Total number of down regulated genes with alternate transcripts131952571952
Total number of up-regulated transcription factor genes1252733718223514
Total number of down regulated transcription factor genes102905034515

*Inside parenthesis is the number of grain quality-related genes

The number of over five fold up- and down-regulated genes in Cypress in comparison to LaGrue and Nipponbare and in Ilpumbyeo in comparison to YR15965 and Nipponbare *Inside parenthesis is the number of grain quality-related genes

Genes involved in starch metabolism

Many genes involved in starch metabolism showed similar expression patterns in both SBS and MPSS libraries (Table 4 and Additional File 4A). For example, the genes encoding 1,4-α-glucan branching enzyme (Os02g32660), limit dextrinase (Os04g08270), 1,4-α-glucan branching enzyme (Os06g51084), and α-amylase (Os09g29404) were 5-fold up-regulated in Cypress compared to LaGrue and Nipponbare in both SBS and MPSS libraries (Table 4).
Table 4

List of grain quality genes with similar expression patterns in both SBS and MPSS libraries and up-regulated over five fold in Cypress (in comparison to LaGrue and Nipponbare) and in Ilpumbyeo (in comparison to YR15965 and Nipponbare)

Up-regulated in CypressFamilyTIGR Gene IDTIGR gene nameMPSS signatureSBS signatureMPSS Ratio PSC/PSLMPSS Ratio PSC/PSNSBS Ratio PSC01/PSL01SBS Ratio PSC01/PLN02
Starch synthesis and degradationOs04g08270Limit dextrinase, putative, expressed or alpha-amylaseGATCAGATACTCCTCACGATCAAATCTGAACCCATCA4916.388
Starch degradationOs09g29404Alpha-amylase activityGATCTCCTCCTTTTCCTGATCTGGAAGCGCGCCATTG252555
GlutelinOs02g15070Glutelin type-B 7 precursorGATCCATTGCACAAGAGGATCCAGCCACAAACCAATG221188
Starch synthesis and degradationOs06g510841,4-alpha-glucan branching enzyme, chloroplast precursorGATCAAGCAATGAATGCGATCAACCCATGCTCCACCC24241919
Seed specificOs03g58480Seed specific protein Bn15D14AGATCACATCGTCACAGCGATCTAGAATCTCCAGAGGG142877
Starch degradationOs02g32660Expressed 1,4-alpha-glucan branching enzymeGATCATGACTTTCAGCAGATCACAGAAGACACACTTC62488
Aspartate biosynthesis and degradation II + asparagine biosynthesis I and degradation IOs02g14110Nitrogen compound metabolismGATCTGTGAATTTGGCAGATCCAGAGAGAGATGCTAA666688
GlobulinOs03g46100Globulin-1 S allele precursorGATCATCCGCGCGTCGGGATCGTTTAGTTGGGAGTGG202088
Seed maturationOs09g10620seed maturation protein LEA 4GATCGAGTTGAGTGTGTGATCGACTTGTGTGAGTTGT161688

Up regulated in IlpumbyeoFamilyTIGR Gene IDTIGR gene nameMPSS signatureSBS signatureMPSS Ratio PSI/PSYMPSS Ratio PSI/PSNSBS Ratio PSI02/PSY02SBS Ratio PSI02/PSN02

Methionine degradation IOs01g22010Methionine adenosyltransferaseGATCCCGACTTCACATGGATCCTCGCGGCCGAAATGG35351414
Isoleucine biosynthesis IOs03g21080Acetolactate synthaseGATCGGAGCCTAGTTGCGATCCAGCACACATTCAAAA551010
Starch synthesisOs06g12450Soluble starch synthase 2-3, chloroplast precursorGATCTGGAAGTGAAATAGATCTGGAAGTGAAATATTT12122020
Seed allergenic/lectinOs07g11510Seed allergenic protein RA5 precursorGATCGCCTCGCACCTGCGATCACTTTAGTCTTTATAG15152424
Starch degradationOs02g32660Expressed 1,4-alpha-glucan branching enzymeGATCAGTGTTTTAAGTTGATCAAATTACATATTGCTG24.5495656
Starch synthesisOs06g04200Granule-bound starch synthase 1, chloroplast precursorGATCTTCCACAGCAACAGATCTTGGCAAGTCAATTAA661414
Isoleucine degradation IOs02g43720Enoyl-coA hydrataseGATCGTCTTGAAGGTCTGATCTACCTCCATGCCTTGA6.2251515
Aspartate biosynthesis and degradation II + asparagine biosynthesis I and degradation IOs01g55540Aspartate transaminaseGATCAAGTGGCTTTCATGATCGGCAAATACTCCTTAA34343232
List of grain quality genes with similar expression patterns in both SBS and MPSS libraries and up-regulated over five fold in Cypress (in comparison to LaGrue and Nipponbare) and in Ilpumbyeo (in comparison to YR15965 and Nipponbare) Interestingly, we found that genes encoding enzymes involved in the biosynthesis of starch underwent alternative splicing (Figure 1). For example, genes involved in the breakdown of long linear glucan leading to β-D-glucose-6-phosphate (Os03g55090 encoding phosphorylase and Os03g50480 encoding phosphoglucomutase) underwent alternative splicing in Ilpumbyeo and Cypress (Additional File 4A, 4B, 4C). Similarly, the genes encoding the α-amylases (Os09g29404/Os04g08270/Os04g33040/Os01g51754) and 1-4 α- glucan branching enzyme (Os06g51084) involved in the breakdown of short linear glucan leading to β-D-glucose underwent alternative splicing in Ilpumbyeo and Cypress (Additional File 4A, 4B, 4C). Some of the 5-fold up-regulated genes identified by either MPSS or SBS also had alternative splicing forms, and these included genes encoding glucose-1-phosphate adenylyltransferase large subunit 1 (also called AGPase) (Os01g44220) and 1,4-α-glucan branching enzyme (Os06g51084). Similarly, some of the 5-fold down-regulated genes identified by either MPSS or SBS produced alternative splicing forms, and these included genes encoding 1,4-α-glucan branching enzyme (Os06g51084) and phosphoglucomutase (Os03g50480) (Additional File 4A, 4B, 4C). These results showed the complexity of the transcription of quality-related genes in developing rice seeds.
Figure 1

Network of genes involved in starch biosynthesis and degradation, and in the biosynthesis of seed storage, seed maturation, and allergenic proteins . Only the genes with 5-fold up- or down-regulation in Cypress (PSC) or Ilpumbyeo (PSI) compared with that in LaGrue or YR15965 are shown. The positive number in parenthesis indicates up-regulation and the negative number in parenthesis indicates down-regulation. The first value in parenthesis shows the fold change in expression either in LaGrue or YR15965, and the second value shows the fold change in expression in Nipponbare. The italicized and underlined bold number before the parenthesis shows the MPSS/SBS signature class [45]. Green indicates that the gene was identified by SBS only. Red indicates that the gene was identified by MPSS only. Blue indicates that the gene was identified by both MPSS and SBS.

Network of genes involved in starch biosynthesis and degradation, and in the biosynthesis of seed storage, seed maturation, and allergenic proteins . Only the genes with 5-fold up- or down-regulation in Cypress (PSC) or Ilpumbyeo (PSI) compared with that in LaGrue or YR15965 are shown. The positive number in parenthesis indicates up-regulation and the negative number in parenthesis indicates down-regulation. The first value in parenthesis shows the fold change in expression either in LaGrue or YR15965, and the second value shows the fold change in expression in Nipponbare. The italicized and underlined bold number before the parenthesis shows the MPSS/SBS signature class [45]. Green indicates that the gene was identified by SBS only. Red indicates that the gene was identified by MPSS only. Blue indicates that the gene was identified by both MPSS and SBS. For validation of the MPSS data, two starch biosynthesis-related genes that showed differential expression in the grain libraries were selected for strand specific RT-PCR. These two genes encode AGPase (AK073146) and GBSS I (AK070431). Total RNA was isolated from the developing seeds of Cypress, LaGrue, Ilpumbyeo, YR15965 and Nipponbare at 3, 6, 9, 12 and 15 DAF (days after flowering). A time-course study of the AGPase and GBSS I genes indicated that expression levels were higher in the high milling Cypress than in the low milling LaGrue in the early stages (6 and 9 DAF) of seed development (Figure 2).
Figure 2

RT-PCR analysis of the genes encoding GBSS I and AGPase in developing rice seeds at 3, 6, 9, 12, and 15 days after anthesis in five rice cultivars.

RT-PCR analysis of the genes encoding GBSS I and AGPase in developing rice seeds at 3, 6, 9, 12, and 15 days after anthesis in five rice cultivars.

Genes encoding essential amino acids

The aspartate family pathway consists of five amino acids (asparagine, aspartate, lysine, methionine, and threonine), and is catalysed primarily by the enzymes aspartate kinase (AK) and dihydrodipicolinate synthase (DHPS). The regulatory network of the genes involved in the biosynthesis and degradation of aspartate family amino acids is plotted in Additional File 5. The genes involved in the metabolism of the aspartate family amino acids with 5-fold up- or down-regulation in Cypress and Ilpumbyeo compared to their controls (LaGrue, YR15965, and Nipponbare) are listed in Additional File 4A, 4B, 4C, and Additional File 5). Some of the important genes for amino acid biosynthesis showed similar expression patterns in both MPSS and SBS libraries (Table 4 and Additional File 4A, 4B, 4C). For example, the genes encoding aspartate transaminase (Os01g55540), methionine adenosyltransferase (Os01g22010), and acetolactate synthase (Os03g21080) were 5-fold up-regulated in Ilpumbyeo compared to YR15965 and Nipponbare in both SBS and MPSS libraries. In contrast, some of the genes involved in aspartate family amino acid biosynthesis were down-regulated, including those encoding threonine synthase (Os01g49890), aspartate kinase (Os03g63330), and malate dehydrogenase (Os10g33800) (Additional File 4A, 4B, 4C). In addition, many of the genes involved in the amino acid biosynthesis also underwent alternative splicing. Among them, some showed 5-fold up-regulation in Ilpumbyeo in either the MPSS or SBS libraries, and these included genes encoding L-3-cyanoalanine synthase (Os04g08350), methionine gamma-lyase (Os09g28050), and asparaginase (Os04g46370), which showed two, two, and three alternative splice forms, respectively (Additional File 4A, 4B, 4C).

Genes encoding seed-storage proteins

The major classes of storage proteins are glutelins, globulins, and prolamins. Some of the genes encoding these classes showed over 5-fold up-regulation in Cypress compared to LaGrue and Nipponbare, and these genes included those encoding glutelin type-B7 precursors (Os02g15070, Os02g15090), globulin-1 S allele precursor (Os03g46100), prolamin PPROL 17 precursor (Os06g31070), and 13 kDa prolamin precursor (Os07g10570) (Table 4 and Additional File 4A). Among the storage-protein genes with over 5-fold up-regulation in Cypress, some produced antisense transcripts like those encoding glutelin type-B7 precursor (Os02g15070, Os02g15090), prolamin PPROL 17 precursor (Os06g31070), and 13 kDa prolamin precursor (Os07g10570) (Additional File 4A). Many genes encoding glutelins and prolamins also underwent alternative splicing or termination in Cypress and Ilpumbyeo. The gene encoding glutelin type-A 2 (Os02g25640) produced 15 and 17 alternative splice forms in MPSS and SBS libraries, and most of them were up-regulated in Cypress but down-regulated in Ilpumbyeo. Among the prolamin-related genes, the prolamin precursor protein gene (Os07g10570) produced five and six alternative splice forms in MPSS and SBS libraries, respectively. The 5-fold induced or suppressed genes encoding globulin, prolamin, and glutelin storage proteins either in Cypress or Ilpumbyeo or both are listed in Additional File 4A and in Figure 1.

Genes encoding seed maturation and allergenic and seed-specific expression proteins

Some of the genes belonging to this group showed similar expression patterns in both MPSS and SBS libraries (Table 4 and Additional File 4A). For example, the genes encoding seed-specific protein Bn15D14A (Os03g58480) and seed-maturation protein LEA4 (Os09g10620) were > 5-fold up-regulated in Cypress compared to LaGrue and Nipponbare in both MPSS and SBS libraries. However, the seed-allergenic protein RA5 precursor gene (Os07g11510) was up-regulated 15-fold in Ilpumbyeo compared to YR15965 (Table 4; Additional File 4A).

Expression patterns of TF genes in cultivars with high milling and good eating quality

TFs were identified using homology search in the rice TF database http://plntfdb.bio.uni-potsdam.de/v3.0/. Clustering analysis was performed to identify TF genes up- and down-regulated in Cypress and Ilpumbyeo compared to the controls (Table 3; Additional File 6). A total of 37 and 14 TF genes showed 5-fold up-regulation in Cypress and Ilpumbyeo libraries, respectively, in both SBS and MPSS libraries (Additional File 6). Similarly, 50 and 5 TF genes were down-regulated in Cypress and Ilpumbyeo, respectively, in both libraries. Some TFs were specifically up-regulated in either Cypress or Ilpumbyeo compared to the controls in both libraries. These TF genes encode PHD-finger family protein (PHD family; Os01g65600), zinc finger CCCH type domain containing protein ZFN-like 2 (C3H family; Os01g68860), transfactor (G2-like; Os06g40710), and bZIP transcription factor family protein (bZIP family; Os06g45140) (Additional File 6).

Identification of the conserved cis motifs among the up-regulated genes in cultivars with high milling and good eating quality

The promoter sequences (1.0 kb before the ATG site) of the highly up-regulated genes (≥50-fold) in Cypress (compared to LaGrue and Nipponbare) and Ilpumbyeo (compared to YR15965 and Nipponbare) identified in both SBS and MPSS libraries were analyzed using the 'PLACE Signal Scan Search' software http://www.dna.affrc.go.jp/htdocs/PLACE/. Many conserved motifs were present in the up-regulated genes in Cypress and Ilpumbyeo, and these included CAATBOX1, WRKY71OS, GATABOX, EBOXBNNAPA, SEF4MOTIFGM7S, CGACGOSAMY3, WBOXHVISO1, CAREOSREP1, CANBNNAPA, AMYBOX1, AACACOREOSGLUB1, BOXIIPCCHS, 2SSEEDPROTBANAPA, ACGTABOX, AMYBOX2, ACGTCBOX, ACGTOSGLUB1, CEREGLUBOX2PSLEGA, and GADOWNAT (Additional File 7). Interestingly, many of the motifs have been reported to play a role in seed development and germination (Additional File 7) [46-70].

Discussion

Rice is a major source of nutrition for most people in the developing world. Although tremendous achievements have been made for the improvement of many agronomic traits in rice in the last three decades, much less progress has been obtained for quality traits due to the lack of simple and efficient selection methods in rice breeding. With rapid advancement in crop molecular breeding, marker-aided selection has been successfully applied in many crop plants. Similarly, new methods for genetic engineering of better crop plants have been reported in the last decade by overexpressing or gene silencing of candidate genes. Although several eating quality QTLs have been identified in previous studies [18,19], it is not clear whether these QTLs are useful for marker-aided selection or not because the genomic regions of these QTLs have not been further characterized. Recently Nelson et al. [9] identified six main-effect milling yield QTLs in the two RIL populations derived from crosses of common parent Cypress with RT0034, a low-milling yield japonica line and LaGrue, a low-milling yield japonica cultivar, respectively. In this study, we used two high throughput sequencing technologies to profile the transcriptome of five cultivars differing in milling yield and eating quality. Many genes specifically or commonly expressed in the high milling yield cultivar Cypress and the good eating quality cultivar Ilpumbyeo were identified from the MPSS and SBS libraries. These candidate genes are excellent starting materials for the development of molecular markers linked to milling quality in the US and eating quality in Korea for rice breeding. It is also possible that overexpression or silencing of some candidate genes will lead to the generation of transgenic rice plants with superior grain quality. During the rice seed development, sugars, amino acids, and other important metabolites are transported from source (primarily leaves) to sink (seeds). Once in the seeds, these metabolites are allocated to different biosynthetic pathways (primarily starch metabolism and storage protein biosynthesis) to produce mainly starch and proteins in precise quantities and ratios. Achieving such a defined composition of starch and proteins require the regulation and coordination of various pathways so that, at each developmental stage, the participating enzymes are present in appropriate amounts and in the correct cellular compartments [3]. AGPase and GBSS I play important roles during starch biosynthesis in rice [71]. The genes encoding for AGPase and GBSS I enzymes are highly expressed 7 to 28 days after flowering during grain development, and their expression is highly correlated with the increases in both starch content and grain weight. The AGPase gene is also highly expressed in the high-yield cultivars of both glutinous and non-glutinous rice [71]. In addition, AGPase (Os01g44220) undergoes alternative splicing similar to the AGPase small subunit gene in barley [72]. Duan and Sun [3] showed that a mutation in the GBSS I gene leads to a lower level of functional GBSS I mRNA and correspondingly to a lower level of GBSS I enzyme for amylose synthesis, which causes a reduction in amylose accumulation. During rice seed formation, the genes encoding AGPases are active 3 days before flowering and maintain an intermediate although declining level of activity during seed maturation [3]. Genetic variation survey showed that the polymorphism in the rice waxy gene encoding the GBSS enzyme explains much of the variation in apparent amylose content across 92 important long, medium and short grain US rice cultivars and 101 progeny of a cross between low-amylose and intermediate-amylose breeding lines [73,74]. The amylose content and the level of waxy protein in 31 rice cultivars from China were correlated with the ability of the cultivar to excise intron I from the leader sequence of the Wx transcript [75]. In this study, we found that the important starch biosynthesis related genes encoding AGPase (Os01g44220), 1,4-α-glucan branching enzyme (Os02g32660), limit dextrinase (Os04g08270), 1,4-α-glucan branching enzyme (Os06g51084), and α-amylase (Os09g29404) were up-regulated in Cypress compared to LaGrue and Nipponbare in six-days old developing seeds. Our time-course RT-PCR analysis also confirmed that expression of AGPase and GBSS I genes was higher in the high milling cultivar Cypress than in the low milling cultivar LaGrue early (6 and 9 DAF) in seed development. These results suggest that these two genes related to starch synthesis may greatly affect milling yield. Starch biosynthesis is also associated with complex genotypic-environmental interactions in maize endosperm [76]. Since the plants in this study were grown in the controlled environmental conditions (growth chambers), the effect of environmental factors on the expression of the starch biosynthesis genes should be tested in the field conditions. Cereal proteins are generally deficient in lysine, but lysine content might be increased with increased accumulation of the precursor molecules required for the enzymatic reactions involved in lysine metabolism. The key precursor molecules include lactate, acetyl CoA, malate, L-aspartate, L-asparagine, L-aspartate-semialdehyde, homoserine, homocysteine, 2-oxobutanoate, 2-aceto-1-hydroxybutyrate, and α-ketoglutarate, and the enzymes involved in their production are very important (Additional File 5). Enhancing the production of these precursor molecules will require the identification of the genes encoding these enzymes. In this study, we found that the genes encoding malate dehydrogenase (Os03g56280, Os01g46070) and aminotransferase (Os09g28050, Os03g18810) involved in the production of malate and aspartate in Cypress and Ilpumbyeo, respectively, were up-regulated compared to the controls. Genes encoding aspartate transaminase (Os01g55540) and enoyl-CoA hydratase (Os02g43720) enzymes, which are responsible for the production of acetyl CoA, were also up-regulated in Cypress compared to the controls. Similarly, the gene encoding lactoylglutathione lyase (Os05g07940), which is responsible for the production of lactate, was up-regulated in Cypress compared to the controls. As indicated, genetic manipulation of the expression levels of these precursors/enzymes may lead to an increased accumulation of lysine in the endosperm and thus an increased nutritional value of the rice seeds. In the last decade, oligoarrays, SAGE, MPSS, and SBS have been widely used for transcriptome profiling. MPSS and SBS have been recently used for whole-genome transcription analysis and have generated abundant expression data for many organisms [42,44,45]. In this study, both MPSS and SBS technologies were used to analyze the transcriptomes of the 6-days-old developing seeds in five rice cultivars. The number of redundant and non-redundant signatures generated in this study were similar to those in previous reports in rice and Arabidopsis [43,45,77]. Although MPSS generates large volume of data, its complicated library-construction procedure and high sequencing cost limit its use in individual laboratories. As the cost of the next-generation sequencing methods has significantly decreased in the last few years, SBS sequencing has become a popular method for transcriptome analysis because it costs 90% less than MPSS and can generate at least three times more transcripts. Furthermore, in the current study, about 30% more transcripts were found in the SBS library than in the MPSS library. Many of these additional signatures are low-copy transcripts, indicating that SBS is a powerful method for identifying rare transcripts [43]. The correlation coefficient is higher between MPSS and SBS than between RL-SAGE and microarray [78], or between RL-SAGE and MPSS or MPSS and microarrays as in previous studies [79]. Therefore, SBS will undoubtedly become the preferred high throughput sequencing method for deep transcriptome analysis in plants.

Conclusion

Breeding for milling yield and eating quality in rice has been a daunting task due to the low genetic inheritability of both traits and the lack of molecular markers linked to the phenotypes. Genetic mapping of the two traits is also challenging because the traits are easily affected by environmental factors in the field. Using two high throughput sequencing methods, we identified many differentially expressed genes in developing rice seeds that may affect milling yield or eating quality. Many of the identified genes are involved in the biosynthesis of starch, aspartate family amino acids, and storage proteins. Some of these potential candidate genes could be used for the development of molecular markers for breeding programs or for the engineering of rice cultivars with high milling yield and eating quality. Our study provides a valuable genomic resource for both improvement of rice grain quality and for the characterization of grain quality pathways at the molecular and biochemical levels.

Methods

Plant materials, developing seeds harvest and growth conditions

Five rice cultivars including Cypress, LaGrue, Ilpumbyeo, YR15965, and Nipponbare were used in the study. Cypress (japonica cultivar) is a long grain cultivar with high yield and high milling quality released by Louisiana State University. Cypress dries down slowly in the field, avoiding grain fissuring, cracking and chalkiness that reduce milling quality http://agebb.missouri.edu/rice/research/99/pg5.htm[80-84]. LaGrue (japonica cultivar), a long grain variety released by the University of Arkansas in 1993, has low milling quality [80-84]. Both Cypress and LaGrue seeds were provided by Dr. Robert Fjellstrom, USDA-ARS Dale Bumpers National Rice Research Center, Stuttgart, Arkansas, USA. Ilpumbyeo (japonica cultivar) is a good eating quality cultivar with low amylose content [85-87]. YR15965 (japonica cultivar) is a low eating quality rice, derived from a cross between Hwayeongbyeo (temperate japonica variety) and Shennung 89-366 (sub-tropical japonica) [86]. Both Ilpumbyeo and YR15965 seeds were provided by Dr. Gynheung An, Crop Biotech Institute, Kyung Hee University, Korea. Nipponbare (japonica cultivar) was used as a control for milling and eating quality with Cypress, LaGrue, Ilpumbyeo and YR15965. All the five cultivars were grown in 3 replications in a Conviron growth chamber at 80% relative humidity with 12 h of light (500 μmol photons m-2 sec-1) at 26°C followed by 12 h of dark at 20°C. The spikelets were labeled on the day of anthesis to identify the age of developing seeds in a panicle. The developing seeds were harvested from the panicles at 3, 6, 9, 12 and 15 D after anthesis. The excised developing seeds from the panicle were freezed immediately in liquid nitrogen.

RNA isolation and RT-PCR

Total RNA was isolated from developing rice seeds harvested from Cypress, LaGrue, Ilpumbyeo, YR15965 and Nipponbare plants using Trizol reagent (Invitrogen). For removal of polysaccharides/polyglycons from the extract, the extracted RNA was purified twice by high salt precipitation according to the manufacturer's instructions. For the MPSS and SBS library construction, RNA isolated from the 6-days (D)-old developing seeds (intermediate stage of grain filling) was used. For the time-course RT-PCR validation experiments, RNA isolated at 3, 6, 9, 12 and 15 D old developing seeds was used. RT-PCR was performed as described previously [78].

MPSS and SBS library construction, sequencing, and bioinformatics

MPSS and SBS libraries were constructed using the RNA obtained from 6 days old developing seeds from Cypress (MPSS library-PSC; SBS library-SPSC), LaGrue (MPSS library-PSL; SBS library-SPSL), Ilpumbyeo (MPSS library-PSI; SBS library-SPSI), YR15965 (MPSS library PSY; SBS library-SPSY) and Nipponbare (MPSS library-PSN; SBS library-SPSN). MPSS and SBS library construction and sequencing were performed essentially as previously described [43,45,77]. Data analysis was carried out to identify the genes responsible for milling quality and eating quality. The expression profiles of Cypress were compared with that of LaGrue and Nipponbare to identify the genes responsible for milling quality. Similarly, the expression profiles of Ilpumbyeo were compared with that of YR15965 and Nipponbare to identify the genes responsible for eating quality. Bioinformatic analyses including identification of antisense transcripts, alternate transcripts, and TFs were conducted as previously described [43]. Gramene database http://www.gramene.org was used as a reference database for the identification of genes involved in starch metabolism, aspartate amino acid metabolism, storage and allergenic protein synthesis, and seed maturation [88]. The entire dataset is available at the NCBI's Gene Expression Omnibus (GEO) database through the accession number GSM629225 to GSM629233

Authors' contributions

RCV and MVS grew the plants, collected developing seeds, extracted RNA and performed RTPCR experiments, constructed SBS libraries, bioinformatic analysis of MPSS and SBS data and wrote the manuscript, KN and AB analyzed the data, GA, BCM and G-LW designed the experimental plan and revised the manuscript. The authors agreed on the contents of the paper.

Additional file 1

Filter results of the five MPSS and SBS libraries. A) A total of 39,288 distinct 17-base expressed signatures from the five MPSS libraries were processed according to three filters: significance, reliability, and genomic match. B) Similarly, 397,543 signatures from the five SBS libraries were also processed using these same filters as previously described by Meyers et al. [45]. Click here for file

Additional file 2

Classification of the MPSS and SBS signatures from the five libraries based on their location on the annotated gene (hits = 1) (See Meyers et al. 2004 [45]for details). Click here for file

Additional file 3

List of expressed grain quality related genes identified in 6 days old developing seeds by MPSS and SBS technologies. Click here for file

Additional file 4

List of five fold up and down-regulated genes, antisense and alternate transcripts. A: List of genes commonly identified by MPSS and SBS technologies. Five fold up- and down-regulated genes, antisense and alternate transcripts are presented. B: Genes identified by SBS technology. Five fold up- and down-regulated genes, antisense and alternate transcripts are listed. C: Genes identified by MPSS technology. Five fold up- and down-regulated genes, antisense and alternate transcripts are listed. Click here for file

Additional file 5

Network of lysine and aspartate family amino acid biosynthesis and degradation. http://www.gramene.org. Only the genes with 5-fold up- or down-regulation in Cypress (PSC) or Ilpumbyeo (PSI) compared with that in LaGrue or YR15965 are shown. The positive number in parenthesis indicates up-regulation and the negative number in parenthesis indicates down-regulation. The first value in parenthesis shows the fold change in expression either in LaGrue or YR15965, and the second value shows the fold change in expression in Nipponbare. The italicized and underlined bold number before the parenthesis shows the MPSS/SBS signature class [45]. Green indicates that the gene was identified by SBS only. Red indicates that the gene was identified by MPSS only. Blue indicates that the gene was identified by both MPSS and SBS. Click here for file

Additional file 6

Five fold up and down regulated transcription factors identified by MPSS, SBS and both. Click here for file

Additional file 7

Conserved . Click here for file
  50 in total

Review 1.  Cereal seed storage proteins: structures, properties and role in grain utilization.

Authors:  Peter R Shewry; Nigel G Halford
Journal:  J Exp Bot       Date:  2002-04       Impact factor: 6.992

2.  Two cis-acting elements necessary and sufficient for gibberellin-upregulated proteinase expression in rice seeds.

Authors:  Keita Sutoh; Daisuke Yamauchi
Journal:  Plant J       Date:  2003-06       Impact factor: 6.417

Review 3.  Short-read sequencing technologies for transcriptional analyses.

Authors:  Stacey A Simon; Jixian Zhai; Raja Sekhar Nandety; Kevin P McCormick; Jia Zeng; Diego Mejia; Blake C Meyers
Journal:  Annu Rev Plant Biol       Date:  2009       Impact factor: 26.379

4.  Sequences responsible for the tissue specific promoter activity of a pea legumin gene in tobacco.

Authors:  A Shirsat; N Wilford; R Croy; D Boulter
Journal:  Mol Gen Genet       Date:  1989-01

5.  The amylose content in rice endosperm is related to the post-transcriptional regulation of the waxy gene.

Authors:  Z Y Wang; F Q Zheng; G Z Shen; J P Gao; D P Snustad; M G Li; J L Zhang; M M Hong
Journal:  Plant J       Date:  1995-04       Impact factor: 6.417

6.  Ternary complex formation between HvMYBS3 and other factors involved in transcriptional control in barley seeds.

Authors:  I Rubio-Somoza; M Martinez; Z Abraham; I Diaz; P Carbonero
Journal:  Plant J       Date:  2006-06-07       Impact factor: 6.417

7.  Messenger RNA targeting of rice seed storage proteins to specific ER subdomains.

Authors:  S B Choi; C Wang; D G Muench; K Ozawa; V R Franceschi; Y Wu; T W Okita
Journal:  Nature       Date:  2000-10-12       Impact factor: 49.962

8.  Analysis of the genetic behavior of some starch properties in indica rice ( Oryza sativa L.): thermal properties, gel texture, swelling volume.

Authors:  J. S. Bao; M. Sun; H. Corke
Journal:  Theor Appl Genet       Date:  2002-02       Impact factor: 5.699

Review 9.  Starch synthesis in the cereal endosperm.

Authors:  Martha G James; Kay Denyer; Alan M Myers
Journal:  Curr Opin Plant Biol       Date:  2003-06       Impact factor: 7.834

10.  A novel WRKY transcription factor, SUSIBA2, participates in sugar signaling in barley by binding to the sugar-responsive elements of the iso1 promoter.

Authors:  Chuanxin Sun; Sara Palmqvist; Helena Olsson; Mats Borén; Staffan Ahlandsberg; Christer Jansson
Journal:  Plant Cell       Date:  2003-09       Impact factor: 11.277

View more
  13 in total

1.  The Function of DNA Demethylase Gene ROS1a Null Mutant on Seed Development in Rice (Oryza Sativa) Using the CRISPR/CAS9 System.

Authors:  Faiza Irshad; Chao Li; Hao-Yu Wu; Yan Yan; Jian-Hong Xu
Journal:  Int J Mol Sci       Date:  2022-06-07       Impact factor: 6.208

2.  Transcriptomic analysis of rice (Oryza sativa) endosperm using the RNA-Seq technique.

Authors:  Yi Gao; Hong Xu; Yanyue Shen; Jianbo Wang
Journal:  Plant Mol Biol       Date:  2013-01-16       Impact factor: 4.076

3.  Genome-wide analysis of the complex transcriptional networks of rice developing seeds.

Authors:  Liang-Jiao Xue; Jing-Jing Zhang; Hong-Wei Xue
Journal:  PLoS One       Date:  2012-02-17       Impact factor: 3.240

4.  Improvement of marker-based predictability of Apparent Amylose Content in japonica rice through GBSSI allele mining.

Authors:  Chiara Biselli; Daniela Cavalluzzo; Rosaria Perrini; Alberto Gianinetti; Paolo Bagnaresi; Simona Urso; Gabriele Orasen; Francesca Desiderio; Elisabetta Lupotto; Luigi Cattivelli; Giampiero Valè
Journal:  Rice (N Y)       Date:  2014-01-02       Impact factor: 4.783

5.  Deep sequencing transcriptional fingerprinting of rice kernels for dissecting grain quality traits.

Authors:  Chiara Biselli; Paolo Bagnaresi; Daniela Cavalluzzo; Simona Urso; Francesca Desiderio; Gabriele Orasen; Alberto Gianinetti; Federico Righettini; Massimo Gennaro; Rosaria Perrini; Manel Ben Hassen; Gian Attilio Sacchi; Luigi Cattivelli; Giampiero Valè
Journal:  BMC Genomics       Date:  2015-12-21       Impact factor: 3.969

6.  Identification and Characterization of Differentially Expressed Genes in Inferior and Superior Spikelets of Rice Cultivars with Contrasting Panicle-Compactness and Grain-Filling Properties.

Authors:  Sudhanshu Sekhar; Sachin Ashruba Gharat; Binay Bhushan Panda; Trupti Mohaptra; Kaushik Das; Ekamber Kariali; Pravat Kumar Mohapatra; Birendra Prasad Shaw
Journal:  PLoS One       Date:  2015-12-28       Impact factor: 3.240

7.  Identification of candidate genes associated with positive and negative heterosis in rice.

Authors:  R C Venu; Jianbing Ma; Yulin Jia; Guangjie Liu; Melissa H Jia; Kan Nobuta; M V Sreerekha; Karen Moldenhauer; Anna M McClung; Blake C Meyers; Guo-Liang Wang
Journal:  PLoS One       Date:  2014-04-17       Impact factor: 3.240

8.  QTL map meets population genomics: an application to rice.

Authors:  Jeffrey A Fawcett; Tomoyuki Kado; Eriko Sasaki; Shohei Takuno; Kentaro Yoshida; Ryuichi P Sugino; Shunichi Kosugi; Satoshi Natsume; Chikako Mitsuoka; Aiko Uemura; Hiroki Takagi; Akira Abe; Takashige Ishii; Ryohei Terauchi; Hideki Innan
Journal:  PLoS One       Date:  2013-12-23       Impact factor: 3.240

9.  WikiPathways for plants: a community pathway curation portal and a case study in rice and arabidopsis seed development networks.

Authors:  Mamatha Hanumappa; Justin Preece; Justin Elser; Denise Nemeth; Gina Bono; Kenny Wu; Pankaj Jaiswal
Journal:  Rice (N Y)       Date:  2013-05-29       Impact factor: 4.783

10.  Proteomic Dissection of Endosperm Starch Granule Associated Proteins Reveals a Network Coordinating Starch Biosynthesis and Amino Acid Metabolism and Glycolysis in Rice Endosperms.

Authors:  Huatao Yu; Tai Wang
Journal:  Front Plant Sci       Date:  2016-05-25       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.