| Literature DB >> 36119631 |
Yuki Monden1, Hirona Tanaka2, Ryota Funakoshi3, Seiya Sunayama3, Kiyotaka Yabe3, Eri Kimoto4, Kentaro Matsumiya4, Takanori Yoshikawa4.
Abstract
Rice is the most important crop species in the world, being staple food of more than 80% of people in Asia. About 80% of rice grain is composed of carbohydrates (starch), with its protein content as low as 7-8%. Therefore, increasing the protein content of rice offers way to create a stable protein source that contributes to improving malnutrition and health problems worldwide. We detected two rice lines harboring a significantly higher protein content (namely, HP5-7 and HP7-5) in the EG4 population. The EG4 strain of rice is a unique material in that the transposon mPing has high transpositional activity and high copy numbers under natural conditions. Other research indicated that mPing is abundant in the gene-rich euchromatic regions, suggesting that mPing amplification should create new allelic variants, novel regulatory networks, and phenotypic changes in the EG4 population. Here, we aimed to identify the candidate genes and/or mPing insertion sites causing high protein content by comprehensively identifying the mPing insertion sites and carrying out an RNA-seq-based transcriptome analysis. By utilizing the next-generation sequencing (NGS)-based methods, ca. 570 mPing insertion sites were identified per line in the EG4 population. Our results also indicated that mPing apparently has a preference for inserting itself in the region near a gene, with 38 genes in total found to contain the mPing insertion in the HP lines, of which 21 and 17 genes were specific to HP5-7 and HP7-5, respectively. Transcriptome analysis revealed that most of the genes related to protein synthesis (encoding glutelin, prolamin, and globulin) were up-regulated in HP lines relative to the control line. Interestingly, the differentially expressed gene (DEG) analysis revealed that the expression levels of many genes related to photosynthesis decreased in both HP lines; this suggests the amount of starch may have decreased, indirectly contributing to the increased protein content. The high-protein lines studied here are expected to contribute to the development of high protein-content rice by introducing valuable phenotypic traits such as high and stable yield, disease resistance, and abundant nutrients.Entities:
Keywords: RNA-seq; amplicon sequencing; protein; rice; starch; transposon
Year: 2022 PMID: 36119631 PMCID: PMC9479144 DOI: 10.3389/fpls.2022.969582
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Figure 1Average protein content in the selected EG4 lines. Protein content was evaluated using three independent plants per line. Data shown are the mean ± SD of three replicates (n = 3). A statistically significant difference between the mean values was inferred from Student’s t-test (***p < 0.001).
The identified mPing insertion sites in the three lines of rice.
| Subject | C3-1 | HP5-7 | HP7-5 |
|---|---|---|---|
| No. of | 581 | 589 | 495 |
| No. of | 617 | 653 | 538 |
| No. of common insertion sites | 552 (95.0%) | 572 (97.1%) | 478 (96.6%) |
| No. of | 18 | 8 | 7 |
|
|
|
|
|
Figure 2Venn diagram showing the number of mPing insertion sites and their overlap among the three lines.
Distribution of mPing insertion sites in the rice genome.
| Line | Inside gene | Near gene | Intergenic region | Total | |||
|---|---|---|---|---|---|---|---|
| Number | Proportion (%) | Number | Proportion (%) | Number | Proportion (%) | ||
| C3-1 | 83 | 14.6 | 306 | 53.7 | 181 | 31.8 | 570 |
| HP5-7 | 81 | 14.0 | 323 | 55.7 | 176 | 30.3 | 580 |
| HP7-5 | 70 | 14.4 | 256 | 52.8 | 159 | 32.8 | 485 |
Figure 3Results of principal component analysis (PCA) based on the expression levels of 37,871 genes. Principal component scores of seed storage protein genes are plotted as green dots.
Figure 4Expression levels of the genes encoding glutelin, prolamin, and globulin, based on the RNA-seq results. The log2[fold-change] value was calculated by comparison with the expression levels of C3-1 (the control).
Figure 5Validation of expression patterns of the selected three genes by qRT-PCR. The expression level of each gene was calculated relative to that of the Actin gene that served as an internal standard. Data shown are the mean ± SD of three replicates. A statistically significant difference between the mean values was inferred from Student’s t-test (*p < 0.05, **p < 0.01, and ***p < 0.001).
Figure 6The enriched Gene Ontology (GO) terms of the identified 1,278 DEGs commonly detected in the HP lines. The top-ranked GO terms for the down-regulated DEGs in HP lines compared with C3-1.