Literature DB >> 12034849

Conserved codon composition of ribosomal protein coding genes in Escherichia coli, Mycobacterium tuberculosis and Saccharomyces cerevisiae: lessons from supervised machine learning in functional genomics.

Kui Lin1, Yuyu Kuang, Jeremiah S Joseph, Prasanna R Kolatkar.   

Abstract

Genomics projects have resulted in a flood of sequence data. Functional annotation currently relies almost exclusively on inter-species sequence comparison and is restricted in cases of limited data from related species and widely divergent sequences with no known homologs. Here, we demonstrate that codon composition, a fusion of codon usage bias and amino acid composition signals, can accurately discriminate, in the absence of sequence homology information, cytoplasmic ribosomal protein genes from all other genes of known function in Saccharomyces cerevisiae, Escherichia coli and Mycobacterium tuberculosis using an implementation of support vector machines, SVM(light). Analysis of these codon composition signals is instructive in determining features that confer individuality to ribosomal protein genes. Each of the sets of positively charged, negatively charged and small hydrophobic residues, as well as codon bias, contribute to their distinctive codon composition profile. The representation of all these signals is sensitively detected, combined and augmented by the SVMs to perform an accurate classification. Of special mention is an obvious outlier, yeast gene RPL22B, highly homologous to RPL22A but employing very different codon usage, perhaps indicating a non-ribosomal function. Finally, we propose that codon composition be used in combination with other attributes in gene/protein classification by supervised machine learning algorithms.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 12034849      PMCID: PMC117187          DOI: 10.1093/nar/30.11.2599

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  30 in total

Review 1.  The ribosome in focus.

Authors:  B A Maguire; R A Zimmermann
Journal:  Cell       Date:  2001-03-23       Impact factor: 41.582

Review 2.  The ribosome at atomic resolution.

Authors:  P B Moore
Journal:  Biochemistry       Date:  2001-03-20       Impact factor: 3.162

Review 3.  Whole-genome expression analysis: challenges beyond clustering.

Authors:  R B Altman; S Raychaudhuri
Journal:  Curr Opin Struct Biol       Date:  2001-06       Impact factor: 6.809

4.  Multifunctional DNA-binding proteins mediate concerted transcription activation of yeast ribosomal protein genes.

Authors:  W H Mager; R J Planta
Journal:  Biochim Biophys Acta       Date:  1990-08-27

5.  The selection-mutation-drift theory of synonymous codon usage.

Authors:  M Bulmer
Journal:  Genetics       Date:  1991-11       Impact factor: 4.562

Review 6.  Synonymous codon usage in Saccharomyces cerevisiae.

Authors:  P M Sharp; E Cowe
Journal:  Yeast       Date:  1991-10       Impact factor: 3.239

7.  Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes.

Authors:  P M Sharp; T M Tuohy; K R Mosurski
Journal:  Nucleic Acids Res       Date:  1986-07-11       Impact factor: 16.971

8.  Codon usage in bacteria: correlation with gene expressivity.

Authors:  M Gouy; C Gautier
Journal:  Nucleic Acids Res       Date:  1982-11-25       Impact factor: 16.971

Review 9.  Codon usage and tRNA content in unicellular and multicellular organisms.

Authors:  T Ikemura
Journal:  Mol Biol Evol       Date:  1985-01       Impact factor: 16.240

10.  The small nucleolar RNP protein NOP1 (fibrillarin) is required for pre-rRNA processing in yeast.

Authors:  D Tollervey; H Lehtonen; M Carmo-Fonseca; E C Hurt
Journal:  EMBO J       Date:  1991-03       Impact factor: 11.598

View more
  20 in total

1.  Use and misuse of correspondence analysis in codon usage studies.

Authors:  Guy Perrière; Jean Thioulouse
Journal:  Nucleic Acids Res       Date:  2002-10-15       Impact factor: 16.971

2.  Gene expression, nucleotide composition and codon usage bias of genes associated with human Y chromosome.

Authors:  Monisha Nath Choudhury; Arif Uddin; Supriyo Chakraborty
Journal:  Genetica       Date:  2017-04-18       Impact factor: 1.082

3.  Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats.

Authors:  Jainendra Pathak; Vinod K Kannaujiya; Shailendra P Singh; Rajeshwar P Sinha
Journal:  3 Biotech       Date:  2017-06-29       Impact factor: 2.406

4.  Comparative analysis of expressed sequences reveals a conserved pattern of optimal codon usage in plants.

Authors:  Liangjiang Wang; Marilyn J Roossinck
Journal:  Plant Mol Biol       Date:  2006-07       Impact factor: 4.076

Review 5.  DNA array-based gene profiling: from surgical specimen to the molecular portrait of cancer.

Authors:  Simone Mocellin; Maurizio Provenzano; Carlo Riccardo Rossi; Pierluigi Pilati; Donato Nitti; Mario Lise
Journal:  Ann Surg       Date:  2005-01       Impact factor: 12.969

6.  Analysis of synonymous codon usage in Zea mays.

Authors:  Hanmei Liu; Rui He; Huaiyu Zhang; Yubi Huang; Mengliang Tian; Junjie Zhang
Journal:  Mol Biol Rep       Date:  2009-03-29       Impact factor: 2.316

7.  Evolutionary and genetic analyses of mitochondrial translation initiation factors identify the missing mitochondrial IF3 in S. cerevisiae.

Authors:  Gemma C Atkinson; Anton Kuzmenko; Piotr Kamenski; Mikhail Y Vysokikh; Valentina Lakunina; Stoyan Tankov; Ekaterina Smirnova; Aksel Soosaar; Tanel Tenson; Vasili Hauryliuk
Journal:  Nucleic Acids Res       Date:  2012-03-28       Impact factor: 16.971

8.  Use of a multi-way method to analyze the amino acid composition of a conserved group of orthologous proteins in prokaryotes.

Authors:  Alberto Pasamontes; Santiago Garcia-Vallve
Journal:  BMC Bioinformatics       Date:  2006-05-18       Impact factor: 3.169

9.  Tissue-specific differences in human transfer RNA expression.

Authors:  Kimberly A Dittmar; Jeffrey M Goodenbour; Tao Pan
Journal:  PLoS Genet       Date:  2006-11-13       Impact factor: 5.917

10.  On the limitations of using ribosomal genes as references for the study of codon usage: a rebuttal.

Authors:  Ruth Hershberg; Dmitri A Petrov
Journal:  PLoS One       Date:  2012-12-20       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.