| Literature DB >> 23251043 |
Rajkumari Sanjukta1, Mohammad Samir Farooqi, Naveen Sharma, Anil Rai, Dwijesh Chandra Mishra, Dhananjaya P Singh.
Abstract
Chromohalobacter salexigens, a Gammaproteobacterium belonging to the family Halomonadaceae, shows a broad salinity range for growth. In order to reveal the factors influencing architecture of protein coding genes in C. salexigens, pattern of synonymous codon usage bias has been investigated. Overall codon usage analysis of the microorganism revealed that C and G ending codons are predominantly used in all the genes which are indicative of mutational bias. Multivariate statistical analysis showed that the genes are separated along the first major explanatory axis according to their expression levels and their genomic GC content at the synonymous third positions of the codons. Both NC plot and correspondence analysis on Relative Synonymous Codon Usage (RSCU) indicates that the variation in codon usage among the genes may be due to mutational bias at the DNA level and natural selection acting at the level of mRNA translation. Gene length and the hydrophobicity of the encoded protein also influence the codon usage variation of genes to some extent. A comparison of the relative synonymous codon usage between 10% each of highly and lowly expressed genes determines 23 optimal codons, which are statistically over represented in the former group of genes and may provide useful information for salt-stressed gene prediction and gene-transformation. Furthermore, genes for regulatory functions; mobile and extrachromosomal element functions; and cell envelope are observed to be highly expressed. The study could provide insight into the gene expression response of halophilic bacteria and facilitate establishment of effective strategies to develop salt-tolerant crops of agronomic value.Entities:
Keywords: Codon usage pattern; Correspondence analysis; Halophilic bacteria; Mutational bias; Relative synonymous codon usage
Year: 2012 PMID: 23251043 PMCID: PMC3523223 DOI: 10.6026/97320630081087
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Nc plot of 2147 genes of C. salexigens. The continuous curve between GC3s and Nc under random condon usage.
Figure 2(A) Frequency of GC3s; (B) Frequency of Nc
Figure 3(A) Plot of Nc versus gene length; (B) Plot of GC3s versus gene length; (C) Plot of gene position on axis1 versus gene length; (D) Correspondence analysis of Relative Synonymous Codon Usage values of C. salexigens genes; (E) Scatter plot of gene position on axis 1 and Nc values; (F) Scatter plot of gene position on axis 1 and GC3s values; (G) Scatter plot of gene position on axis 1 and CAI values.
Figure 4Expression level of functional genes of C. salixigens on the basis of Axis 1 based on RSCU values. AA= Amino acid biosynthesis; BCP=Biosynthesis of cofactor, Prosthetic groups and carriers; CE=Cell envelope; CP=Cellular processes; CIM=Central intermediary metabolism; DM=DNA metabolism; EM=Energy metabolism; FPM=Fatty acid & phospholipid metabolism; MEF=Mobile & extrachromosomal element functions; PF=Protein fate; PS=Protein synthesis; PPN=Purines, pyrimidines, nucleosides & nucleotides; RF=Regulatory functions; ST=Signal transduction; T=Transcription; TBP=Transport & binding protein