Literature DB >> 34220257

Prediction of gene expression under drought stress in spring wheat using codon usage pattern.

Meshal M Almutairi1, Abdullah A Alrajhi1.   

Abstract

Spring wheat (Triticum aestivum) is a staple food providing sources of essential proteins for human. In fact, gene expressions of wheat play an important role in growth and productivity that are affected by drought stress. The objective of this work focused on analysis gene feature on spring wheat represented by nucleotide and gene expressions under drought stress. It was found that the higher codon adaptation index was in both wheat root and L-galactono-1, 4-lactone dehydrogenase. It was also found that guanine and cytosine content were high (55.56%) in wheat root. Whereas, guanine and cytosine content were low (41.28%) in L-galactono-1, 4-lactone dehydrogenase. Moreover, the higher relative synonymous codon usage value was observed in codon CAA (1.20), GAA (1.33), GAT (1.00), and ATG (1.00) in wheat root and thus about 62.95% of the total variation in relative synonymous codon was explained by principal component analysis. Additionally, high averages frequency number of codon were (above 15.76) in Met, Lys, Ala, Gly, Phe, Asp, Glu, His, and Tyr; whereas, low averages were in remaining amino acids and majority (90%) of modified relative codon bias values was between 0.40 and 0.90. Shortly, calculations and analysis of codon usage pattern under drought stress would help for genetic engineering, molecular evolution, and gene prediction in wheat studies for developing varieties that associate with drought tolerance.
© 2021 The Author(s).

Entities:  

Keywords:  Codon adaptation index (CAI); Modified relative codon bias (MRCBS); Principal component analysis (PCA); Relative synonymous codon usage (RSCU); Spring wheat (Triticum aestivum)

Year:  2021        PMID: 34220257      PMCID: PMC8241893          DOI: 10.1016/j.sjbs.2021.04.015

Source DB:  PubMed          Journal:  Saudi J Biol Sci        ISSN: 2213-7106            Impact factor:   4.219


Introduction

Spring wheat that belongs to common wheat (Triticum aestivum L., 2n = 6x = 42 = AABBDD genomes) is a staple food providing sources of proteins and dietary energy for human nutrition and is the main target for genome sequences. The AABBDD genomes is hexaploid and contains 21 pairs of chromosomes that are grouped into seven homologous and orthologous genes response to many characters of common wheat (Sleper and Poehlman, 2006). In addition, fusion protein can be developed by homologous recombination through DNA repair mechanism for introducing modified genes. However, wheat has a larger gene pool showing difficult to improve its characters. Genetic transfers from A and D genomes occur through direct hybridization and homologous recombination during breeding program following by selection progress (Sleper and Poehlman, 2006). Spring wheat has shown to acclimate under drought stress through different mechanisms such as early maturity, deep root system, and ability to close stomata during period of drought stress. Some of important genes under drought stress have selected for purpose of this study. In fact, it is important to explain the functions of these selected genes in this study as following. MYB3R gene known as the cellular morphogenesis, secondary metabolism and reflecting important role in controlling cell process (Matsui et al., 2008). The novel allele of GLDH gene is essential for the ascorbic acid AsA and accumulation of respiratory complex in plant (Zhang et al., 2016). TaDREB gene is extraction from shoot and root during 10 days of growing seedlings under drought stress. Transcriptional and post-transcription factor such as nuclear factor (NFYA5) gene is importance for dehydration stresses that confer drought tolerance in oxidative stress (Jones-Rhoades and Bartel, 2004). Moreover, wheat under drought stress has different mechanisms such as osmotic adjustment (proline, betaine, and glycine) and protective metabolites. These mechanisms play important roles for maintaining normal photosynthesis under drought stress (Ma et al., 2016). Amino acids are important assessments of potential growth because of building the protein structures and functions (Tian et al., 2020). Thus, amino acids are manifest the cornerstone template for genetic information. Moreover, most amino acids are encoded by more than one codon that could be revealed by a synonymous codon (Hershberg, 2016). The triple codon (nucleotide) is the basic unit of mRNA encoding that reflects vital roles during a generation of organism. This basic unit encodes amino acids that reflect important genetic and evolution system. Furthermore, codon adaptation index (CAI) is one of the most algorithm to demonstrate a difference in the frequency of the synonymous codon for the mRNA sequence. This CAI helps to understand the gene expression, new genes discovery, and molecular mechanism for genes evolution contribution (Hershberg, 2016). Thus, the theoretical basis for genetic studied such as genetic engineering, gene prediction, and molecular studies in wheat is critical concept by analysis the codon usage pattern (Yu et al., 2015). This CAI has influence by several factors such as nucleotide composition, tRNA abundance, and structure of protein, translation processes, and more (Carbone et al., 2003, Lithwick and Margalit, 2005). At the same amino acid, genetic code associates with sibling codons that may occur more frequently resulting from several factors such as GC content (Grantham et al., 1980, Wada et al., 1990). The objective of this paper was to analyze gene feature in spring wheat under drought stress represented by nucleotide, used codon adaptation index, relative synonymous codon usage, synonymous codon and amino acid usage, modified relative codon bias, and Principal component analysis.

Material and methods

Retrieval of sequences:

Five mRNA sequences of Chinese Spring wheat (Triticum aestivum) were retrieved from the National Center for Biotechnology NCBI (www.ncbi.nlm.nih.gov) Gen-Bank and the accession number are BU672415, ES466900, HV510735, KU695148, and HQ236494 (Table 1). All these sequences were studied under drought stress at different levels during root wheat, MYB gene, enzyme named L-galactono-1,4-lactone dehydrogenase located in mitochondrion, and transcription factor for drought resistance.
Table 1

Codon adaptation index (CAI) and GC Content under drought stress for root and genes.

AccessionRoot and genesLength bpCAIGC%Authors
BU672415Wheat root23700.7455.56Anderson et al. (2011)
ES466900TaDREB gene6450.7351.47Sadeghi et al. (2007)
KU695148GLDH gene55260.7441.28Zhang et al. (2016)
HQ236494MYB3R gene23700.7346.29Cai et al. (2011)
HV510735NFYA5 gene7290.6755.56Li and Zhu. (2011)
Codon adaptation index (CAI) and GC Content under drought stress for root and genes.

Measures of codon usage patterns

A variety of analyses tool were used to measure the codon usage patterns such as codon adaptation index (CAI) given by sharp and Li. (1987), relative synonymous codon usage (RSCU) given by Yu, et al. (2015). GC content, relative codon bias strength (RCBS), and modified relative codon bias (MRCBS) applied in this study (Shoo et al., 2019). Some of codons were excluded form calculation such as AUG (methionine), UGG (tryptophan), and three stop codons such as UAA, UAG, and UGA. Therefore, these codons for all mRNA sequences were normalized (Zhou et al., 2005).

Codon adaptation index

CAI is represented by the formula: CAI = . Where N is the number of codons in the gene and relativeness is described as where is the frequency of the ith codon and is maximum frequency of the codons most often used for encoding amino acid aa.

Relative synonymous codon usage

The RSCU can be calculated by way of following formula: where xij shows the frequency of codon j coding for the ith amino acid and ni represented the number of synonymous codon encoding the ith amino acids. The interpretation for the RSCU is that RSCU greater than one indicates more frequency while RSCU small than 1 indicates less frequency in a gene for the particular codon. Whereas RSCU > 1.6 indicates over represented codon for the corresponding amino acid (Wong et al., 2010).

Modified relative codon bias

MRCSB measures the expression level of a gene (Shoo et al., 2019). MRCSBs were computed through this formula: MRCBS = where where whereas is the normalized codon frequency of a codon and (m) is the normalized frequency of base m at codon position in a gene. The reflects the maximum value of RCBS of codon encoding the same amino acid aa. The score of MRCBS ranges from zero to one.

Statistical analysis

Calculations were done through Microsoft Excel and codonW software (Peden, 2000). Moreover, two ways analyses of variances were computed using PROC GLM procedure at significate level of p-value (0.001) in SAS computer packages version 9.2 for windows (SAS Institute Inc.). Analysis of principal component helps to re-set data into a few data set and was computed using PROC PRINCOMP in SAS in order to analysis total variations.

Results

The CAI and GC content

At early stages of development in spring wheat, randomly selected transcriptional factors are known to various functions. The overall result of the codon adaptation index was 0.722 and the highest GC content was 50.03 in the wheat root while the lowest GC content was 41.28 in the GLDH gene (Table 1). The ANOVA analysis reflecting significant differences for the variables under study such as genes, amino acids, and codons with p-value < 0.0003, <0.0001, and <0.0001 respectively (Table 2).
Table 2

The ANOVA analysis for spring wheat at different growth level, amino acids, and codons.

SourceDFSSMSF-ValuePr > F
Root and genes41.430.365.530.0003
Amino acids195.970.314.62<0.0001
Codons6314.400.233.54<0.0001
The ANOVA analysis for spring wheat at different growth level, amino acids, and codons. Codon adaptation index (CAI) helps to measure the synonymous codon usage bias toward codons that highly expressed genes. The high CAI shows in both the wheat root and GLDH gene. We found that GC content in the wheat root, TaDREB gene, and NFYA5 gene had the higher CAI value excluded the NFYA5 gene. The relative adaptiveness of a codon in the CAI calculation is computed as the ratio between the frequency in the reference and the largest frequency among synonymous. The Length of the mRNA sequence did not influence the GC contact and CAI. The overall frequency of codon used was demonstrated in Fig. 1. In this figure, the black line represents the average (15.76) across all amino acid. This average divides amino acids into two groups. The First group was above average that were lysine, alanine, glycine, phenylalanine, aspartic acid, asparagine, glutamate, histidine, and tyrosine. The second group was below the average that was serine, proline, isoleucine, valine, leucine, threonine, cysteine, and arginine. This result showed the importance of amino acid metabolism under drought stress that is robust gene expression. Moreover, amino acids are more disturbed by drought stress during the transcriptional and metabolic processes.
Fig. 1

Overall frequency of number of codon used obtained from genes in this study under drought stress.

Overall frequency of number of codon used obtained from genes in this study under drought stress.

The RSCU and MRCBS

More computational tools such as RSCU and MRCBS were used to explain more about the codon usage patterns and determine gene expression as presenting in Fig. 2. RSCU stands to the ratio between the frequency of synonymous and the expected frequency of codon. The majority of genes about 90% that had MRCBS values between 0.40 and 0.90 with mean, median, and standard deviation values were 0.623, 0.627, and 0.339 respectively. RSCU greater than or equal to 1 implies high frequency whereas RSCU small than 1 means less frequency usage. When RSCU equal to 0 implies no bias codon usage existing. The higher RSCU value was observed from codon CAA (1.20), GAA (1.33), GAT (1.00), and ATG (1.00) in the wheat root, MYB3R, and TaDREB gene respectively. Besides, Pearson correlation coefficients analysis showed that the high correlation is between MRCBS and RSCU 0.64. While the second correlation is between MRCBS and RCBS 0.41 with a p-value < 0.0001.
Fig. 2

Frequency of MRCBS for all codons in all traits for Wheat.

Frequency of MRCBS for all codons in all traits for Wheat.

The PCA

The principal component analysis (PCA) was able to distinguish the genetic variance based on RSCU, RCBS, and MRCBS (see Fig. 3). PCA illustrated the most important factor that explained 62.95% of the total variation. The most effective method for codon usage bias in spring wheat genes in the first component was RSCU with eigenvalue 1.88 and proportion 0.62 while the second component was MRCBS with eigenvalue 0.77 and proportion 0.25.
Fig. 3

The principal component analysis (PCA) using relative synonymous codon usage (RSCU), relative codon bias strength (RCBS), and modified relative codon bias (MRCBS).

The principal component analysis (PCA) using relative synonymous codon usage (RSCU), relative codon bias strength (RCBS), and modified relative codon bias (MRCBS).

Discussion

Spring wheat (T. aestivum) is a staple food providing sources of essential proteins but its production is drastically impacted by the drought stress. Thus, development cultivars that drought tolerance is a major objective for wheat breeding programs. Analyzed codon patterns are significant aspect because of connotation with the genome evolution concerning nucleotide and gene expression. These nucleotide and gene expression could benefit the translation efficiency of gene population (Das et al., 2006). This analysis is also importance to increase our understanding of wheat response and its mechanism under drought stress (Sanders et al., 2009). To enhance drought tolerance, it is importance to analysis codon to ensure high expression levels with removing repeated and unnecessary digestion sites during synthesized. Furthermore, The GC contents were used for estimated codon usage in a gene and these contents help breeders to predict the conserved amino acid patterns (Araus et al., 2008). However, this GC index is more sensitive for both rare amino acids and short CDS and thus it is poorly informative that could not provide details for codon usage. Unless analysis small gene that is less 100 codons could be used in GC index to be more informative. Several studies have argued that GC contents may reflect the primary influence on codon usage patterns that change the expression profile. These results imply variation in GC contents that might be due to genetic composition and mating behavior (Mazumdar et al., 2017). It was found that Arginine and proline metabolism, alanine, aspartate, glutamate were significantly disturbed by drought stress (You et al., 2019). The content of proline could be consider as main key in plant consistency with drought stress. In Arabidopsis, gene named Atlpk2α encoding IPK2 has several functions in plant development such as root growth and pollen germination association with drought stress and involving in abscisic acid (ABA) (Wang et al., 2017, Yang et al., 2018). This gene, Atlpk2α, could be benefit for enhancing drought tolerance of wheat through strategy of genetic engineering by a synthetic gene (Shu-juan et al., 2020). Furthermore, it is a useful purpose of wheat breeding program to determine gene feature and its mechanisms associated with drought stress providing ideas for genetic manipulation (Salehe Naderia et al., 2020).

Conclusion

In this study, the relative synonymous codon usage, relative codon bias strength, and modified relative codon bias may be useful tools predicting amino acids frequency and gene expression during growth stages of spring wheat under drought stress. More study on codon usage pattern for spring wheat is necessarily to identification of expressed genes. Thus, this study was importance to demonstrate significant heterogeneity in codon usage among genes in spring wheat. The overall result of codon adaptation index was 0.722, which indicates highest genes expressed represented bias of these genes. Thus, result of codon adaption index can help for more confidence of predictions. In addition, the overall of GC content was 50.03 implies that all codons are used uniformly, which is valuable for gene expression in wheat. The idea of support our method and finding is based on largely responsible of codon usage pattern for regulation gene translations that occur during protein translation. In addition, the relative synonymous codon usage is the ratio of observed frequency of codon to expected frequency at all synonymous codons. This relative synonymous codon usage was normalized by removing methionine (ATG), tryptophan (TGG), and three stop codons. The higher RSCU value was observed in following codons: CAA, GAA, GAT, and ATG in wheat root, MYB3R, and TaDREB gene respectively indicating high frequency in these codons. Further investigation in spring wheat genes would enhance our understanding of metabolism. You et al., (2019) Found that these amino acids: arginine and proline metabolism, alanine, aspartate, and glutamate were significantly disturbed by drought stress. Lastly, PCA is used to illustrate the most important factor because PCA provides the predication accuracy by eliminating random error and redundancy. PCA explained 62.95% of the total variation and the most effect method for codon usage bias was RSCU with eigenvalue 1.88 and proportion 0.62 in spring wheat genes (Ghosh and Barman, 2016).

Funding

This paper did not receive any funding resources or specific grant from agencies in the public, commercial, or any other sectors.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  23 in total

1.  Codon adaptation index as a measure of dominating codon bias.

Authors:  A Carbone; A Zinovyev; F Képès
Journal:  Bioinformatics       Date:  2003-11-01       Impact factor: 6.937

2.  Heterogeneity in codon usages of sobemovirus genes.

Authors:  H Zhou; H Wang; L F Huang; M Naylor; P Clifford
Journal:  Arch Virol       Date:  2005-04-18       Impact factor: 2.574

3.  Tolerance responses in wheat landrace Bolani are related to enhanced metabolic adjustments under drought stress.

Authors:  Salehe Naderi; Barat-Ali Fakheri; Reza Maali-Amiri; Nafiseh Mahdinezhad
Journal:  Plant Physiol Biochem       Date:  2020-03-05       Impact factor: 4.270

4.  Genome-wide survey of the amino acid transporter gene family in wheat (Triticum aestivum L.): Identification, expression analysis and response to abiotic stress.

Authors:  Ruizheng Tian; Yang Yang; Maohua Chen
Journal:  Int J Biol Macromol       Date:  2020-08-08       Impact factor: 6.953

5.  Identification of a MYB3R gene involved in drought, salt and cold stress in wheat (Triticum aestivum L.).

Authors:  Hongsheng Cai; Shan Tian; Changlai Liu; Hansong Dong
Journal:  Gene       Date:  2011-06-30       Impact factor: 3.688

6.  The Role of Arabidopsis Inositol Polyphosphate Kinase AtIPK2β in Glucose Suppression of Seed Germination and Seedling Development.

Authors:  Qiaofeng Yang; Sihong Sang; Yao Chen; Zhaoyun Wei; Peng Wang
Journal:  Plant Cell Physiol       Date:  2018-02-01       Impact factor: 4.927

7.  Codon Usage Influences the Local Rate of Translation Elongation to Regulate Co-translational Protein Folding.

Authors:  Chien-Hung Yu; Yunkun Dang; Zhipeng Zhou; Cheng Wu; Fangzhou Zhao; Matthew S Sachs; Yi Liu
Journal:  Mol Cell       Date:  2015-08-27       Impact factor: 17.970

8.  Computational identification of plant microRNAs and their targets, including a stress-induced miRNA.

Authors:  Matthew W Jones-Rhoades; David P Bartel
Journal:  Mol Cell       Date:  2004-06-18       Impact factor: 17.970

9.  Relative predicted protein levels of functionally associated proteins are conserved across organisms.

Authors:  Gila Lithwick; Hanah Margalit
Journal:  Nucleic Acids Res       Date:  2005-02-17       Impact factor: 16.971

10.  A novel allele of L-galactono-1,4-lactone dehydrogenase is associated with enhanced drought tolerance through affecting stomatal aperture in common wheat.

Authors:  Juncheng Zhang; Bin Li; Yanping Yang; Peiyuan Mu; Weiqiang Qian; Lingli Dong; Kunpu Zhang; Xin Liu; Huanju Qin; Hongqing Ling; Daowen Wang
Journal:  Sci Rep       Date:  2016-07-22       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.