Literature DB >> 18288471

Homology and phylogeny and their automated inference.

Georg Fuellen1.   

Abstract

The analysis of the ever-increasing amount of biological and biomedical data can be pushed forward by comparing the data within and among species. For example, an integrative analysis of data from the genome sequencing projects for various species traces the evolution of the genomes and identifies conserved and innovative parts. Here, I review the foundations and advantages of this "historical" approach and evaluate recent attempts at automating such analyses. Biological data is comparable if a common origin exists (homology), as is the case for members of a gene family originating via duplication of an ancestral gene. If the family has relatives in other species, we can assume that the ancestral gene was present in the ancestral species from which all the other species evolved. In particular, describing the relationships among the duplicated biological sequences found in the various species is often possible by a phylogeny, which is more informative than homology statements. Detecting and elaborating on common origins may answer how certain biological sequences developed, and predict what sequences are in a particular species and what their function is. Such knowledge transfer from sequences in one species to the homologous sequences of the other is based on the principle of 'my closest relative looks and behaves like I do', often referred to as 'guilt by association'. To enable knowledge transfer on a large scale, several automated 'phylogenomics pipelines' have been developed in recent years, and seven of these will be described and compared. Overall, the examples in this review demonstrate that homology and phylogeny analyses, done on a large (and automated) scale, can give insights into function in biology and biomedicine.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18288471     DOI: 10.1007/s00114-008-0348-1

Source DB:  PubMed          Journal:  Naturwissenschaften        ISSN: 0028-1042


  85 in total

1.  Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption.

Authors:  M Y Galperin; E V Koonin
Journal:  In Silico Biol       Date:  1998

2.  Intrinsic errors in genome annotation.

Authors:  D Devos; A Valencia
Journal:  Trends Genet       Date:  2001-08       Impact factor: 11.639

Review 3.  Phylogenomic inference of protein molecular function: advances and challenges.

Authors:  Kimmen Sjölander
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

4.  An accurate, sensitive, and scalable method to identify functional sites in protein structures.

Authors:  Hui Yao; David M Kristensen; Ivana Mihalek; Mathew E Sowa; Chad Shaw; Marek Kimmel; Lydia Kavraki; Olivier Lichtarge
Journal:  J Mol Biol       Date:  2003-02-07       Impact factor: 5.469

5.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

6.  Protein sequence similarity searches using patterns as seeds.

Authors:  Z Zhang; A A Schäffer; W Miller; T L Madden; D J Lipman; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  1998-09-01       Impact factor: 16.971

Review 7.  Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis.

Authors:  J A Eisen
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

8.  Evolution of the nuclear receptor superfamily: early diversification from an ancestral orphan receptor.

Authors:  V Laudet
Journal:  J Mol Endocrinol       Date:  1997-12       Impact factor: 5.098

9.  Distinguishing homologous from analogous proteins.

Authors:  W M Fitch
Journal:  Syst Zool       Date:  1970-06

10.  Orthologs and paralogs - we need to get it right.

Authors:  R A Jensen
Journal:  Genome Biol       Date:  2001-08-03       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.