| Literature DB >> 19131380 |
Uttam Roymondal1, Shibsankar Das, Satyabrata Sahoo.
Abstract
We present an expression measure of a gene, devised to predict the level of gene expression from relative codon bias (RCB). There are a number of measures currently in use that quantify codon usage in genes. Based on the hypothesis that gene expressivity and codon composition is strongly correlated, RCB has been defined to provide an intuitively meaningful measure of an extent of the codon preference in a gene. We outline a simple approach to assess the strength of RCB (RCBS) in genes as a guide to their likely expression levels and illustrate this with an analysis of Escherichia coli (E. coli) genome. Our efforts to quantitatively predict gene expression levels in E. coli met with a high level of success. Surprisingly, we observe a strong correlation between RCBS and protein length indicating natural selection in favour of the shorter genes to be expressed at higher level. The agreement of our result with high protein abundances, microarray data and radioactive data demonstrates that the genomic expression profile available in our method can be applied in a meaningful way to the study of cell physiology and also for more detailed studies of particular genes of interest.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19131380 PMCID: PMC2646356 DOI: 10.1093/dnares/dsn029
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Distribution of RCBS for all coding genes in the genome of E. coli.
Figure 2RCBS plotted against CAI for E. coli genes.
Figure 3RCBS plotted against E(g)[18] for E. coli genes.
Figure 4RCBS plotted against the length of 4174 genes from the E. coli genome.
Figure 5RCBS (+) and E(g) (*) plotted against relative molecular abundance of 96 genes from E. coli genome.[18] RMB denotes relative molecular abundance. X-axis is taken in logarithmic scale.
Figure 6CAI (+) and RCBS (*) plotted against protein concentration of 45 genes from the E. coli genome.[24] X-axis is taken in logarithmic scale.
Figure 7Radioactive data (+) and microarray data (*)[35] plotted against RCBS for E. coli genes. Y-axis is taken in logarithmic scale.
RCBS of the highly expressed genes of different functional class in the E. coli genome
| Functional class | Gene | RCBS | Gene | RCBS | Gene | RCBS | Gene | RCBS |
|---|---|---|---|---|---|---|---|---|
| Ribosomal | 0.50496 | 0.74635 | 0.87367 | 1.08922 | ||||
| 0.56061 | 0.75111 | 0.88011 | 1.09439 | |||||
| 0.60728 | 0.75859 | 0.90076 | 1.16165 | |||||
| 0.61255 | 0.76302 | 0.90877 | 1.24694 | |||||
| 0.62318 | 0.79227 | 0.91121 | 1.2494 | |||||
| 0.62913 | 0.79299 | 0.92341 | 1.24983 | |||||
| 0.67119 | 0.80176 | 0.92858 | 1.3063 | |||||
| 0.67126 | 0.80995 | 0.9446 | 1.3222 | |||||
| 0.67627 | 0.81499 | 0.95959 | 1.32324 | |||||
| 0.7021 | 0.82165 | 1.00068 | 1.49921 | |||||
| 0.71682 | 0.84223 | 1.00368 | 1.60846 | |||||
| 0.7302 | 0.84341 | 1.03424 | 1.66876 | |||||
| 0.7344 | 0.84538 | 1.04844 | 1.77046 | |||||
| 0.73917 | 0.85618 | 1.05606 | – | – | ||||
| Translational | 0.70878 | 0.50131 | 1.03184 | 0.70761 | ||||
| 1.31636 | 1.11799 | 1.02752 | 0.85208 | |||||
| 0.77909 | 1.03184 | 1.11995 | 0.94012 | |||||
| 0.72335 | 1.11995 | 1.11995 | 0.86312 | |||||
| 0.7532 | 1.11995 | 1.06128 | 0.52763 | |||||
| Transcriptional | 0.64494 | 0.81972 | 0.71495 | 0.874 | ||||
| 0.91144 | 0.61192 | 0.77923 | 0.66054 | |||||
| 0.68275 | 0.52545 | 0.68232 | 0.53879 | |||||
| 1.2802 | 0.88747 | 0.54866 | 0.59593 | |||||
| 1.12974 | 0.73934 | 0.65143 | 0.60395 | |||||
| 0.87402 | 0.5234 | 0.53467 | 0.53095 | |||||
| 0.62977 | 0.66651 | 0.66692 | 0.60661 | |||||
| 0.58028 | 0.62894 | 0.53475 | 0.6079 | |||||
| 0.504 | 0.55743 | 0.51287 | – | – | ||||
| CH and folding | 0.81384 | 0.90549 | 0.62208 | 0.66081 | ||||
| 0.5747 | 0.82021 | 0.66931 | 0.85476 | |||||
| 0.65259 | 0.62877 | 0.73575 | 0.60592 | |||||
| 0.59085 | 0.51531 | 0.51362 | 0.74434 | |||||
| 0.63123 | 0.49623 | 0.50787 | 0.79986 | |||||
| 0.55943 | 0.5791 | 0.65291 | – | – | ||||
| 0.51531 | 0.56129 | 0.70111 | – | – | ||||
| 0.51569 | 0.51315 | 0.96923 | – | – | ||||
| Outer membrane | 0.73214 | 1.03758 | 0.59077 | 0.69853 | ||||
| 0.52949 | 0.63223 | 0.58718 | – | – | ||||
| 0.51413 | 0.90683 | 0.57797 | – | – | ||||
| 0.79079 | 0.50225 | 0.53974 | – | – | ||||
| Post-translational | 0.50362 | 0.50521 | 0.65324 | 0.66442 | ||||
| DNA repair/replication/recombination | 0.49781 | 0.70777 | 0.58392 | 0.53058 | ||||
| 0.66454 | 0.97108 | 0.58088 | 0.71106 | |||||
| 0.57421 | 0.74465 | 0.51482 | 0.865 | |||||
| 0.93575 | 0.55962 | 0.60858 | 0.59001 | |||||
| RNA modification | 0.55764 | 0.59733 | 0.62977 | 0.72106 | ||||
| DNA degradation | 0.53058 | 0.865 | – | – | – | |||
| Degradation of Proteins/peptides/glycopeptides | 0.4998 | 0.51382 | 0.53736 | 0.55166 | ||||
| Degradation of small molecules | 0.58128 | 0.57401 | 1.33277 | – | ||||
| Nucleoprotein and basic protein | 0.51407 | 0.73934 | 0.85476 | 1.29474 | ||||
| 0.55438 | 0.74434 | 0.93575 | – | – | ||||
| 0.58392 | 0.74465 | 0.97108 | – | – | ||||
| Aminoacyl tRNA synthase | 0.52912 | 0.54138 | 2.38353 | 0.52017 | ||||
| 0.5786 | – | – | – | – | – | – | ||
| Energy metabolism | ||||||||
| Glycolysis | 0.99727 | 0.87498 | 0.67783 | 0.62056 | ||||
| 0.7547 | 0.65413 | 0.76595 | 0.80293 | |||||
| TCA cycle | 0.55763 | 0.51856 | 0.50409 | 0.62233 | ||||
| Pentose phosphate pathway | 0.58526 | 0.63261 | ||||||
| ATP synthase | 0.64784 | 0.51365 | 0.64873 | 1.08527 | ||||
| 0.60762 | ||||||||
| Pyruvate dehydronage | 0.57263 | 0.55269 | 0.56421 | |||||
| Aerobic respiration | 0.53164 | 0.56129 | 0.54378 | 0.61103 | ||||
| 0.61485 | 0.70885 | 0.59343 | ||||||
| Anaerobic respiration | 0.73468 | 0.62208 | 0.60086 | 0.75126 | ||||
| 0.72395 | 0.69364 | 0.52986 | 0.52458 | |||||
| 0.54693 | 0.67865 | 0.50787 | ||||||
| 0.51315 | 0.56922 | 0.87609 | ||||||
| Electron transport | 0.61336 | 0.61409 | 0.60624 | 0.56769 | ||||
| Flagellum biogenesis | 0.54626 | 0.67522 | 0.52105 | 0.51569 | ||||
| 0.66739 | 0.5854 | |||||||
| Transport of small molecules | 0.50273 | 0.51092 | 0.58718 | |||||
| Salvage of nucleocides and nucleotides | 0.73291 | 0.63634 | 0.51826 | 0.69492 | ||||
| 0.55136 | 0.57449 | 0.56649 | ||||||
| Central intermediary metabolism | 0.59133 | 0.51347 | 0.76667 | 0.52297 | ||||
| 0.51485 | 0.63455 | 0.52318 | 0.72458 | |||||
| 0.60213 | ||||||||
| Carbohydrate metabolism | 0.62187 | 0.50361 | 0.51605 | 0.57269 | ||||
| 0.53522 | 0.56421 | 0.52297 | 0.59595 | |||||
| 0.60215 | ||||||||
| Phosphorus metabolism | 0.51705 | 0.5871 | 0.6365 | 0.66563 | ||||
| 0.5443 | ||||||||
| Nitrogen metabolism | 0.53274 | 0.65458 | ||||||
| Sulphur metabolism | 0.51334 | |||||||
| Amines metabolism | 0.57934 | |||||||
| Amino acid biosynthesis | 0.51962 | 0.54244 | 1.32851 | 0.5234 | ||||
| 0.51627 | 0.596 | 1.51982 | 2.8411 | |||||
| 0.51995 | 0.57258 | 0.84298 | 0.62785 | |||||
| 0.5114 | 1.99822 | 1.76046 | 1.7054 | |||||
| 0.81972 | 0.54397 | 1.93311 | 3.41556 | |||||
| 0.60479 | ||||||||
| Fatty acid biosynthesis | 0.57451 | 0.55757 | 0.54893 | 0.60055 | ||||
| 0.55661 | 0.67664 | 0.58465 | ||||||
| Nucleotide biosynthesis | 0.76156 | 0.79214 | 0.5899 | 1.1651 | ||||
| 0.58481 | 0.53711 | |||||||
| Cofactor and small molecule biosynthesis | 0.87498 | 0.50538 | 0.58446 | 0.52458 | ||||
| 0.57258 | 0.50171 | 0.59736 | ||||||
| 0.60086 | 0.61154 | 0.78241 | ||||||
| Macromolecule biosynthesis | 0.55326 | 0.55757 | 0.79395 | 0.52949 | ||||
| 0.82199 | 0.57714 | 0.62205 | 0.66531 | |||||
| 0.81384 | 0.89234 | 0.50455 | 0.50225 | |||||
| 0.56769 | 0.55662 | 1.632 | 0.75124 | |||||
| 0.72071 | ||||||||
| Inner membrane | 0.81384 | 0.53708 | 0.58505 | 0.53297 | ||||
| 0.53164 | 0.57954 | 0.55456 | 0.51556 | |||||
| 0.61485 | 0.52986 | 0.50064 | 0.50746 | |||||
| 0.55757 | 0.54378 | 0.71217 | 0.54162 | |||||
| 0.73468 | 0.61103 | 0.62392 | 0.68009 | |||||
| 0.72395 | 0.50273 | 0.74646 | 0.7111 | |||||
| 0.596 | 0.86696 | 0.62656 | 0.60738 | |||||
| 1.632 | 0.7921 | 0.65847 | ||||||
| 0.61263 | 0.55105 | 0.64364 | ||||||
| Transport | 0.76533 | 0.54244 | 0.93025 | 0.54377 | ||||
| 0.5484 | 0.596 | 0.51092 | 0.75473 | |||||
| 0.51995 | 0.57954 | 0.5388 | 0.62148 | |||||
| 0.53164 | 0.51943 | 0.54592 | 0.69682 | |||||
| 0.53708 | 0.74374 | 0.57401 | 0.72924 | |||||
| 0.56475 | 0.61263 | 0.70188 | 0.71983 | |||||
| 0.76102 | 0.55214 | 0.50339 | 0.51334 | |||||
| 0.53522 | 0.65397 | 0.5262 | 0.66442 | |||||
| 0.54627 | 0.59343 | 1.13838 | 0.62785 | |||||
| 0.6849 | 0.50273 | 0.61754 | ||||||
| Regulator | 0.57732 | 0.51672 | 0.62205 | 0.5528 | ||||
| 0.50596 | 1.78721 | 1.34529 | 0.51628 | |||||
| 0.73214 | 0.75282 | 0.58559 | 0.54534 | |||||
| 0.83793 | 0.86637 | 0.5642 | 0.62229 | |||||
Predicted expression levels of highly expressed prophage genes
| Gene | Description | RCBS |
|---|---|---|
| CP4-44 prophage; predicted protein | 0.76113 | |
| CP4-57 prophage; DNA-binding transcriptional activator | 0.64494 | |
| CP4-57 prophage; predicted inner membrane protein | 0.7551 | |
| CP4-57 prophage; predicted inner membrane protein | 1.07646 | |
| CP4-57 prophage; predicted protein | 0.56069 | |
| CP4-6 prophage; antitoxin of the YkfI–YafW toxin–antitoxin system | 0.54248 | |
| CPS-53 (KpLE1) prophage; conserved protein | 0.60714 | |
| CPS-53 (KpLE1) prophage; predicted protein | 0.54524 | |
| CPS-53 (KpLE1) prophage; predicted protein | 0.59437 | |
| CPZ-55 prophage; predicted protein | 0.72955 | |
| DLP12 prophage; conserved protein | 0.61069 | |
| DLP12 prophage; endonuclease RUS | 0.53058 | |
| DLP12 prophage; multidrug resistance protein | 0.65874 | |
| DLP12 prophage; predicted lipoprotein | 0.50128 | |
| DLP12 prophage; predicted lipoprotein | 0.98537 | |
| DLP12 prophage; predicted phage lysis protein | 0.77232 | |
| DLP12 prophage; predicted protein | 0.56517 | |
| DLP12 prophage; predicted protein | 0.67154 | |
| DLP12 prophage; predicted protein | 1.05554 | |
| e14 prophage; 5-methylcytosine-specific restriction endonuclease B | 0.67815 | |
| e14 prophage; predicted DNA-binding transcriptional regulator | 0.79718 | |
| e14 prophage; predicted inner membrane protein | 0.50282 | |
| e14 prophage; predicted integrase | 0.55764 | |
| e14 prophage; predicted protein | 1.3517 | |
| hypothetical protein, DLP12 prophage | 1.56134 | |
| KpLE2 phage-like element; iron-dicitrate transporter subunit | 0.6455 | |
| KpLE2 phage-like element; IS1 repressor protein InsA | 0.52239 | |
| KpLE2 phage-like element; IS2 insertion element repressor InsA | 0.69853 | |
| KpLE2 phage-like element; IS30 transposase | 0.6955 | |
| Qin prophage; bifunctional antitoxin of the RelE–RelB toxin–antitoxin system/transcriptional repressor | 0.68232 | |
| Qin prophage; cell division inhibition protein | 0.66801 | |
| Qin prophage; cold shock protein | 0.52261 | |
| Qin prophage; cold shock protein | 0.5891 | |
| Qin prophage; cold shock protein | 0.80085 | |
| Qin prophage; DNA-binding transcriptional regulator for DicB | 0.69275 | |
| Qin prophage; predicted DNA-binding transcriptional regulator | 0.50987 | |
| Qin prophage; predicted protein | 0.69704 | |
| Qin prophage; predicted protein | 0.82038 | |
| Qin prophage; predicted protein | 0.83742 | |
| Qin prophage; predicted protein | 0.95351 | |
| Qin prophage; predicted protein | 1.34218 | |
| Qin prophage; predicted S lysis protein | 0.62869 | |
| Qin prophage; small toxic polypeptide | 0.75743 | |
| Qin prophage; toxin of the RelE–RelB toxin–antitoxin system | 0.54866 |
Predicted expression levels of tRNA genes
| Gene | RCBS | Gene | RCBS | Gene | RCBS | Gene | RCBS |
|---|---|---|---|---|---|---|---|
| 1.35584 | 1.96033 | 1.06805 | 1.15723 | ||||
| 1.35584 | 1.96033 | 1.18771 | 1.32755 | ||||
| 1.5556 | 1.85009 | 1.23093 | 1.45877 | ||||
| 1.5556 | 1.85009 | 1.3515 | 1.45877 | ||||
| 1.5556 | 1.85009 | 1.91913 | 1.175 | ||||
| 1.40468 | 1.85009 | 1.91913 | 1.27061 | ||||
| 1.67244 | 1.32551 | 1.91913 | 1.27325 | ||||
| 1.76167 | 1.32551 | 1.91913 | 1.7256 | ||||
| 1.76167 | 1.32551 | 1.91913 | 1.62046 | ||||
| 1.76167 | 1.32551 | 1.91913 | 1.00445 | ||||
| 1.76167 | 1.33638 | 1.22225 | 1.0433 | ||||
| 1.99759 | 1.47125 | 1.32682 | 1.0433 | ||||
| 1.87865 | 1.21868 | 1.32682 | 1.37166 | ||||
| 1.87865 | 1.41462 | 1.32682 | 1.37566 | ||||
| 1.87865 | 1.42883 | 1.36722 | 1.37566 | ||||
| 1.87865 | 1.42883 | 1.36722 | 1.37566 | ||||
| 1.38539 | 1.42883 | 1.38483 | 1.37566 | ||||
| 1.38539 | 1.45397 | 1.38483 | 1.37566 | ||||
| 1.38539 | 1.02415 | 1.26942 | 1.6125 | ||||
| 1.35851 | 1.03107 | 1.38923 | 1.28639 | ||||
| 1.65127 | 1.03107 | 1.44416 | – | ||||
| 1.65127 | 1.03107 | 1.14888 | – |
Predicted expression levels of highly expressed hypothetical protein genes
| Gene | RCBS | Gene | RCBS | Gene | RCBS |
|---|---|---|---|---|---|
| 0.51055 | 0.77343 | 1.09738 | |||
| 0.51884 | 0.84734 | 1.15141 | |||
| 0.58644 | 0.85155 | 1.48121 | |||
| 0.63276 | 0.92206 | 1.86114 | |||
| 0.63348 | 1.07903 | 1.56134 |