| Literature DB >> 26697344 |
Rongshuang Lin1, Jane Glazebrook1, Fumiaki Katagiri1, James H Orf2, Susan I Gibson1.
Abstract
Soybean is a major source of protein and oil and a primary feedstock for biodiesel production. Research on soybean seed composition and yield has revealed that protein, oil and yield are controlled quantitatively and quantitative trait loci (QTL) have been identified for each of these traits. However, very limited information is available regarding the genetic mechanisms controlling seed composition and yield. To help address this deficiency, we used Affymetrix Soybean GeneChips® to identify genes that are differentially expressed between developing seeds of the Minsoy and Archer soybean cultivars, which differ in seed weight, yield, protein content and oil content. A total of 700 probe sets were found to be expressed at significantly different (defined as having an adjusted p-value below or equal to 0.05 and an at least 2-fold difference) levels between the two cultivars at one or more of the three developmental stages and in at least one of the two years assayed. Comparison of data from soybeans collected in two different years revealed that 97 probe sets were expressed at significantly different levels in both years. Functional annotations were assigned to 78% of these 97 probe sets based on the SoyBase Affymetrix™ GeneChip® Soybean Genome Array Annotation. Genes involved in receptor binding/activity and protein binding are overrepresented among the group of 97 probe sets that were differentially expressed in both years assayed. Probe sets involved in growth/development, signal transduction, transcription, defense/stress response and protein and lipid metabolism were also identified among the 97 probe sets and their possible implications in the regulation of agronomic traits are discussed. As the Minsoy and Archer soybean cultivars differ with respect to seed size, yield, protein content and lipid content, some of the differentially expressed probe sets identified in this study may thus play important roles in controlling these traits. Others of these probe sets may be involved in regulation of general seed development or metabolism. All microarray data and expression values after GCRMA are available at the Gene Expression Omnibus (GEO) at NCBI (http://www.ncbi.nlm.nih.gov/geo), under accession number GSE21598.Entities:
Keywords: Composition; FDR, false discovery rate; GCRMA, Guanine Cytosine Robust Multi-Array analysis; GEO, Gene Expression Omnibus; GO, Gene Ontology; GPI, glycosylphosphatidylinositol; HY, high yield; LY, low yield; QTL, quantitative trait locus; RCB, randomized complete block; RIL, recombinant inbred line; Seed; Soybean; TAG, triacylglycerol; Transcriptional profiling; Yield
Year: 2015 PMID: 26697344 PMCID: PMC4664686 DOI: 10.1016/j.gdata.2015.08.005
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Comparison of seed composition, weight and yield between the Minsoy and Archer cultivars and two recombinant inbred lines (RILs). Data for the two parental lines were taken from published sources [7], [9], [18] and data for the two RILs are based on combined data collected from plants grown during 2000 in Becker, MN and during 2001 in Waseca, MN (unpublished data).
| Trait | RIL 6049-273 | RIL 6049-32 | Archer | Minsoy |
|---|---|---|---|---|
| Yield (bu/a) | 66.1 | 33.5 | 49 | 29 |
| Oil (g/kg) | 166 | 162 | 187 | 178 |
| Protein (g/kg) | 344 | 366 | 340 | 353 |
| Seed weight (mg/seed) | 188 | 142 | 161 | 123 |
Comparison of number of differentially expressed transcripts between Minsoy and Archer and two recombinant inbred lines (RILs). The significant cutoff value is q-value = 0.05 and 2-fold change in expression values. The two RILs are LY (lower yield) line 6049–32 and HY (higher yield) line 6049-273.
| Affymetrix probe sets | Stage 1 | Stage 2 | Stage 3 | Total in all stages combined |
|---|---|---|---|---|
| Higher in Minsoy in 2007 | 117 | 120 | 167 | 273 |
| Higher in Archer in 2007 | 172 | 140 | 135 | 301 |
| Total different between Minsoy and Archer in 2007 | 289 | 260 | 302 | 574 |
| Higher in Minsoy in 2008 | 62 | 67 | 73 | 104 |
| Higher in Archer in 2008 | 89 | 76 | 97 | 128 |
| Total different between Minsoy and Archer in 2008 | 151 | 143 | 170 | 232 |
| Higher in LY RIL in 2007 | 74 | 6 | 0 | 78 |
| Higher in HY RIL in 2007 | 55 | 14 | 0 | 66 |
| Total different between LY and HY RILs in 2007 | 129 | 20 | 0 | 144 |
The total number of differentially expressed transcripts in all stages combined is less than the sum of the numbers of differentially expressed transcripts in each of the individual stages as some transcripts were differentially expressed at multiple developmental stages.
Fig. 1Comparison of differentially expressed probe sets between Minsoy and Archer. A comparison was made for each developmental stage to identify probe sets differentially expressed in both 2007 and 2008 (panels a, b, c); then the numbers from the three stages were combined in panel (d). In panels (a), (b) and (c), the number in the left circle represents probe sets unique to 2007, the number in the right circle represents probe sets unique to 2008, and the number in the center area represents probe sets common to 2007 and 2008. Panel (d) illustrates the distribution of probe sets common to 2007 and 2008 among the three stages.
Fig. 2Gene Ontology (GO)slim categorization of differentially expressed probe sets. Panel (a) describes the percentage of probe sets annotated to different GOslim functional categories. Panel (b) describes the percentage of probe sets annotated to different GOslim biological process categories The percentage of probe sets annotated to a GOslim category is equal to the number of probe sets annotated to terms in that GOslim category/total number of probe sets annotated to any term in this ontology times 100. The light-gray bars illustrate data from the 97 probe sets differentially expressed between Minsoy and Archer and the dark-gray bars represent all the other probe sets on the array whose expression was detected in 2007 and 2008. * indicates that the results of the 97 differentially expressed probe sets differed from those of all other detected probe sets on the array with a p-value < 0.1 and ** indicates the results differed with a p-value < 0.05 using Fisher's exact test.
Probe sets potentially involved in seed growth/development and signal transduction. Probe sets that were found to be expressed at significantly different levels between Minsoy and Archer in one or more developmental stages during 2007 and 2008 were searched for those predicted by their GO assignments to be involved in seed growth/development/signal transduction. Significant differences in transcript levels are defined as having a q-value ≤ 0.05 and fold difference ≥ 2. The developmental stage(s) and year(s) in which significant differences in transcript levels were detected for a particular gene are indicated. The cultivar that has higher expression values is also indicated. S1, S2, S3 represent developmental stage 1 (2 mm seeds), stage 2 (3.5 mm seeds) and stage 3 (5–6 mm seeds), respectively. (07) and (08) refer to 2007 and 2008, respectively.
| Affymetrix probe set ID | Gene description | Stage (year) | Higher expression |
|---|---|---|---|
| Gma.1471.2.S1_at | Suppressor of PHYA-105 1 (SPA1) | S1,2 (07);S1,2,3 (08) | Archer |
| Gma.12643.1.A1_at | Light repressible receptor protein kinase | S1 (07);S1,2,3 (08) | Minsoy |
| GmaAffx.88161.1.S1_at | Phytochrome (PHYA) | S1 (07);S1,2,3 (08) | Archer |
| Gma.7387.1.A1_at | Pseudo-response regulator 7 (PRR7) | S1,2,3 (07,08) | Minsoy |
| GmaAffx.77896.1.S1_at | Suppressor of auxin resistance 3 (SAR3) | S1,2,3 (07,08) | Archer |
| GmaAffx.90343.1.S1_at | GAST1 protein homolog 4 (GASA4) | S1,2 (07);S2,3 (08) | Minsoy |
| GmaAffx.90343.1.S1_s_at | GAST1 protein homolog 4 (GASA4) | S1,2 (07);S2,3 (08) | Minsoy |
| Gma.473.1.A1_at | Retinoblastoma-related 1 (RBR1) | S1 (07);S1,2,3 (08) | Archer |
| GmaAffx.21668.1.A1_at | COBRA (COB) | S1,2,3 (07,08) | Archer |
| GmaAffx.28349.1.S1_at | Ankyrin repeat_containing 2B (AKR2B) | S1,2 (07);S1,2,3 (08) | Archer |
| GmaAffx.7290.1.A1_at | alpha-N-acetylglucosaminidase family/NAGLU family | S3 (07,08) | Archer |
Probe sets potentially involved in defense and stress responses. Probe sets predicted to be involved in defense and/or stress response and that exhibit significant differences in transcript levels between Minsoy and Archer were identified. See Table 3 legend for additional information.
| Affymetrix probe set ID | Gene description | Stage (year) | Higher expression |
|---|---|---|---|
| Gma.1034.4.S1_s_at | Caffeoyl-CoA 3-O-methyltransferase, putative | S3 (07);S1,3 (08) | Minsoy |
| Gma.11082.2.A1_s_at | Caffeoyl-CoA 3-O-methyltransferase, putative | S3 (07);S1,3 (08) | Minsoy |
| Gma.11344.1.S1_at | Disease resistance gene (R gene) | S1,2,3 (07,08) | Archer |
| Gma.13012.1.A1_at | Disease resistance protein (TIR-NBS-LRR class), putative | S1,2,3 (07,08) | Archer |
| Gma.17871.1.S1_at | Disease resistance protein (TIR-NBS-LRR class), putative | S1 (07);S1,2,3 (08) | Archer |
| Gma.13036.1.S1_at | Inducer of CBF expression 1 (ICE1) | S1,2,3 (07,08) | Minsoy |
| Gma.1464.1.S1_at | Rotamase FKBP 1 (ROF1) | S1,2 (07);S1,2,3 (08) | Archer |
| Gma.1571.1.S1_at | ALDH10A8, 3-chloroallyl aldehyde dehydrogenase/oxidoreductase | S1,2 (07);S2,3 (08) | Minsoy |
| Gma.16586.1.S1_at | Harpin-induced 1 | S1,2,3 (07);S3 (08) | Minsoy |
| Gma.2628.1.A1_at | S1,2 (07,08) | Archer | |
| Gma.4155.1.S1_at | RAP2.12,ERF/AP2 transcription factor family | S1,2,3 (07,08) | Minsoy |
| Gma.4155.2.S1_s_at | RAP2.12,ERF/AP2 transcription factor family | S1,2,3 (07,08) | Minsoy |
| GmaAffx.14960.1.S1_s_at | RAP2.12,ERF/AP2 transcription factor family | S1 (07);S1,2,3 (08) | Minsoy |
| Gma.7716.1.A1_at | Zinc induced facilitator 1 (ZIF1) | S2 (07);S1,2,3 (08) | Minsoy |
| Gma.8522.1.S1_at | Major latex protein-related/MLP-related | S1,2 (07);S1,2,3 (08) | Archer |
| GmaAffx.21668.1.A1_at | COBRA (COB) | S1,2,3 (07,08) | Archer |
| GmaAffx.59014.1.S1_s_at | Dehydrin family protein | S1,2 (07);S1 (08) | Archer |
| GmaAffx.66290.1.S1_at | Disease resistance protein (CC-NBS class), putative | S1,2 (07);S1,2,3 (08) | Archer |
| GmaAffx.77896.1.S1_at | Suppressor of auxin resistance 3 (SAR3), nucleoporin | S1,2,3 (07,08) | Archer |
| GmaAffx.93642.1.S1_s_at | Secretory protein, putative | S2 (07);S1,2 (08) | Archer |
These probe sets may represent the same gene as they all show the highest match to the same soybean gene model.
Similarly, these two probe sets may represent the same gene.
Probe sets potentially related to protein metabolism. Probe sets predicted to be involved in protein metabolism and that exhibit significant differences in transcript levels between Minsoy and Archer were identified. See Table 3 legend for additional information.
| Affymetrix probe set ID | Gene description | Stage (year) | Higher expression |
|---|---|---|---|
| Gma.13070.2.A1_at | Ribosomal protein L25 | S1,2,3 (07,08) | Minsoy |
| Gma.2628.1.A1_at | S1,2 (07) | Archer | |
| Gma.7719.1.A1_at | TIF3H1, translation initiation factor | S1 (07);S1,2,3 (08) | Archer |
| GmaAffx.16360.1.S1_at | Glyoxylate/hydroxypyruvate reductase | S2 (07);S1,2,3 (08) | Archer |
| GmaAffx.20624.1.S1_at | PUB14, ubiquitin-protein ligase | S1 (07,08) | Archer |
| GmaAffx.93183.1.S1_at | S-adenosylmethionine decarboxylase (SAMDC) | S1,2 (07);S1,2,3 (08) | Archer |
Probe sets potentially involved in lipid metabolism. Probe sets predicted to be involved in lipid metabolism and that exhibit significant differences in transcript levels between Minsoy and Archer were identified. Gene function was predicted both on the basis of SoyBase Affymetrix ™ GeneChip® Soybean Genome Array Annotation and the Arabidopsis lipid gene database [14] (http://www.plantbiology.msu.edu/lipids/genesurvey/index.htm). See Table 3 legend for additional information.
| AffyID | Gene description | Stage (year) | Higher expression |
|---|---|---|---|
| Gma.13043.1.S1_at | S1,2,3 (07);S3 (08) | Minsoy | |
| Gma.13216.2.S1_at | Ceramidase family protein | S1,2,3 (07,08) | Archer |
| Gma.4488.1.A1_s_at | Ceramidase family protein | S1,2,3 (07,08) | Archer |
| Gma.6802.1.A1_at | GLIP5, carboxylesterase/lipase | S1,3 (07);S1,2,3 (08) | Minsoy |
| GmaAffx.18905.1.S1_at | Esterase/lipase/thioesterase | S1,2 (07,08) | Minsoy |
| GmaAffx.37987.1.S1_at | Diacylglycerol acyltransferase family | S1 (07,08) | Archer |
| GmaAffx.6790.1.S1_at | Epoxide hydrolase, putative | S1,2,3 (07);S1 (08) | Minsoy |
These two probe sets may represent the same gene as they both show the highest match to the same soybean gene model.
| Specifications table | |
|---|---|
| Subject area | Biology |
| More specific subject area | Plant biology, agriculture |
| Type of data | Table of expression values after GCRMA, CEL files with raw GeneChip data |
| How data was acquired | Affymetrix Soybean GeneChips® |
| Data format | Raw data and expression values after GCRMA |
| Experimental factors | None |
| Experimental features | Microarray expression profiling to identify genes that are differentially expressed in developing seeds of the Minsoy and Archer soybean varieties or in two recombinant inbred lines derived from a Minsoy X Archer cross. |
| Data source location | Saint Paul, MN, USA. |
| Data accessibility | All microarray data and expression values after GCRMA are available at the Gene Expression Omnibus (GEO) at NCBI ( |