BACKGROUND: Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. RESULTS: Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. CONCLUSIONS: Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism. Paralogs and singletons dominate different categories of functional classification, where paralogs in particular seem to be associated with processes involving interaction with the environment.
BACKGROUND: Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. RESULTS: Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. CONCLUSIONS: Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism. Paralogs and singletons dominate different categories of functional classification, where paralogs in particular seem to be associated with processes involving interaction with the environment.
Authors: Hidehiro Toh; Brian L Weiss; Sarah A H Perkin; Atsushi Yamashita; Kenshiro Oshima; Masahira Hattori; Serap Aksoy Journal: Genome Res Date: 2005-12-19 Impact factor: 9.043
Authors: F R Blattner; G Plunkett; C A Bloch; N T Perna; V Burland; M Riley; J Collado-Vides; J D Glasner; C K Rode; G F Mayhew; J Gregor; N W Davis; H A Kirkpatrick; M A Goeden; D J Rose; B Mau; Y Shao Journal: Science Date: 1997-09-05 Impact factor: 47.728
Authors: N T Perna; G Plunkett; V Burland; B Mau; J D Glasner; D J Rose; G F Mayhew; P S Evans; J Gregor; H A Kirkpatrick; G Pósfai; J Hackett; S Klink; A Boutin; Y Shao; L Miller; E J Grotbeck; N W Davis; A Lim; E T Dimalanta; K D Potamousis; J Apodaca; T S Anantharaman; J Lin; G Yen; D C Schwartz; R A Welch; F R Blattner Journal: Nature Date: 2001-01-25 Impact factor: 49.962
Authors: S D Bentley; K F Chater; A-M Cerdeño-Tárraga; G L Challis; N R Thomson; K D James; D E Harris; M A Quail; H Kieser; D Harper; A Bateman; S Brown; G Chandra; C W Chen; M Collins; A Cronin; A Fraser; A Goble; J Hidalgo; T Hornsby; S Howarth; C-H Huang; T Kieser; L Larke; L Murphy; K Oliver; S O'Neil; E Rabbinowitsch; M-A Rajandream; K Rutherford; S Rutter; K Seeger; D Saunders; S Sharp; R Squares; S Squares; K Taylor; T Warren; A Wietzorrek; J Woodward; B G Barrell; J Parkhill; D A Hopwood Journal: Nature Date: 2002-05-09 Impact factor: 49.962
Authors: Shawn R Starkenburg; Frank W Larimer; Lisa Y Stein; Martin G Klotz; Patrick S G Chain; Luis A Sayavedra-Soto; Amisha T Poret-Peterson; Mira E Gentry; Daniel J Arp; Bess Ward; Peter J Bottomley Journal: Appl Environ Microbiol Date: 2008-03-07 Impact factor: 4.792
Authors: Troels T Marstrand; Jes Frellsen; Ida Moltke; Martin Thiim; Eivind Valen; Dorota Retelska; Anders Krogh Journal: PLoS One Date: 2008-02-20 Impact factor: 3.240
Authors: Shelley Campeau Miller; Stephen F Porcella; Sandra J Raffel; Tom G Schwan; Alan G Barbour Journal: J Bacteriol Date: 2013-06-07 Impact factor: 3.490
Authors: Madhan R Tirumalai; Rajat Rastogi; Nader Zamani; Elisha O'Bryant Williams; Shamail Allen; Fatma Diouf; Sharon Kwende; George M Weinstock; Kasthuri J Venkateswaran; George E Fox Journal: PLoS One Date: 2013-06-14 Impact factor: 3.240
Authors: Tina Paradzik; Nives Ivic; Zelimira Filic; Babu A Manjasetty; Paul Herron; Marija Luic; Dusica Vujaklija Journal: Nucleic Acids Res Date: 2013-02-07 Impact factor: 16.971