Literature DB >> 11927774

Comparative genomics using data mining tools.

Tannistha Nandi1, Chandrika B-Rao, Srinivasan Ramachandran.   

Abstract

We have analysed the genomes of representatives of three kingdoms of life, namely, archaea, eubacteria and eukaryota using data mining tools based on compositional analyses of the protein sequences. The representatives chosen in this analysis were Methanococcus jannaschii, Haemophilus influenzae and Saccharomyces cerevisiae. We have identified the common and different features between the three genomes in the protein evolution patterns. M. jannaschii has been seen to have a greater number of proteins with more charged amino acids whereas S. cerevisiae has been observed to have a greater number of hydrophilic proteins. Despite the differences in intrinsic compositional characteristics between the proteins from the different genomes we have also identified certain common characteristics. We have carried out exploratory Principal Component Analysis of the multivariate data on the proteins of each organism in an effort to classify the proteins into clusters. Interestingly, we found that most of the proteins in each organism cluster closely together, but there are a few 'outliers'. We focus on the outliers for the functional investigations, which may aid in revealing any unique features of the biology of the respective organisms

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11927774     DOI: 10.1007/BF02703680

Source DB:  PubMed          Journal:  J Biosci        ISSN: 0250-5991            Impact factor:   1.826


  19 in total

1.  How many potentially secreted proteins are contained in a bacterial genome?

Authors:  G Schneider
Journal:  Gene       Date:  1999-09-03       Impact factor: 3.688

2.  Global transposon mutagenesis and a minimal Mycoplasma genome.

Authors:  C A Hutchison; S N Peterson; S R Gill; R T Cline; O White; C M Fraser; H O Smith; J C Venter
Journal:  Science       Date:  1999-12-10       Impact factor: 47.728

3.  Prediction of transcription regulatory sites in Archaea by a comparative genomic approach.

Authors:  M S Gelfand; E V Koonin; A A Mironov
Journal:  Nucleic Acids Res       Date:  2000-02-01       Impact factor: 16.971

4.  The amino acid composition is different between the cytoplasmic and extracellular sides in membrane proteins.

Authors:  H Nakashima; K Nishikawa
Journal:  FEBS Lett       Date:  1992-06-01       Impact factor: 4.124

Review 5.  Beyond complete genomes: from sequence to structure and function.

Authors:  E V Koonin; R L Tatusov; M Y Galperin
Journal:  Curr Opin Struct Biol       Date:  1998-06       Impact factor: 6.809

6.  The folding type of a protein is relevant to the amino acid composition.

Authors:  H Nakashima; K Nishikawa; T Ooi
Journal:  J Biochem       Date:  1986-01       Impact factor: 3.387

7.  Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies.

Authors:  H Nakashima; K Nishikawa
Journal:  J Mol Biol       Date:  1994-04-22       Impact factor: 5.469

8.  Non-globular domains in protein sequences: automated segmentation using complexity measures.

Authors:  J C Wootton
Journal:  Comput Chem       Date:  1994-09

9.  Polypurine.polypyrimidine sequences in complete bacterial genomes: preference for polypurines in protein-coding regions.

Authors:  S Raghavan; R Hariharan; S K Brahmachari
Journal:  Gene       Date:  2000-01-25       Impact factor: 3.688

10.  A new family of powerful multivariate statistical sequence analysis techniques.

Authors:  M van Heel
Journal:  J Mol Biol       Date:  1991-08-20       Impact factor: 5.469

View more
  2 in total

1.  An ant colony optimisation algorithm for the 2D and 3D hydrophobic polar protein folding problem.

Authors:  Alena Shmygelska; Holger H Hoos
Journal:  BMC Bioinformatics       Date:  2005-02-14       Impact factor: 3.169

2.  Functional annotations in bacterial genomes based on small RNA signatures.

Authors:  Jayavel Sridhar; Ziauddin Ahamed Rafi
Journal:  Bioinformation       Date:  2008-04-04
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.