Literature DB >> 11063778

Towards a covering set of protein family profiles.

A Heger1, L Holm.   

Abstract

Evolutionary classification leads to an economical description of the protein sequence universe because attributes of function and structure are inherited in protein families. Efficient strategies of functional and structural genomics therefore target one representative from each family. Enumerating all families and establishing family membership consistently based on sequence similarities are nontrivial computational problems. Emerging concepts and caveats of global sequence clustering are reviewed. Explicit multiple alignments coupled with neighbourhood analysis lead to domain segmentation, and hierarchical unification helps to resolve conflicts and validate clusters. Eventually, every part of every sequence will be assigned to a domain family which is uniquely associated with a fold and a molecular function.

Mesh:

Substances:

Year:  2000        PMID: 11063778     DOI: 10.1016/s0079-6107(00)00013-4

Source DB:  PubMed          Journal:  Prog Biophys Mol Biol        ISSN: 0079-6107            Impact factor:   3.667


  12 in total

1.  An efficient algorithm for large-scale detection of protein families.

Authors:  A J Enright; S Van Dongen; C A Ouzounis
Journal:  Nucleic Acids Res       Date:  2002-04-01       Impact factor: 16.971

2.  Sequence space and the ongoing expansion of the protein universe.

Authors:  Inna S Povolotskaya; Fyodor A Kondrashov
Journal:  Nature       Date:  2010-05-19       Impact factor: 49.962

3.  A limited universe of membrane protein families and folds.

Authors:  Amit Oberai; Yungok Ihm; Sanguk Kim; James U Bowie
Journal:  Protein Sci       Date:  2006-07       Impact factor: 6.725

4.  Oxidative opening of the aromatic ring: Tracing the natural history of a large superfamily of dioxygenase domains and their relatives.

Authors:  A Maxwell Burroughs; Margaret E Glasner; Kevin P Barry; Erika A Taylor; L Aravind
Journal:  J Biol Chem       Date:  2019-05-15       Impact factor: 5.157

5.  Visualizing sequence similarity of protein families.

Authors:  Vamsi Veeramachaneni; Wojciech Makałowski
Journal:  Genome Res       Date:  2004-05-12       Impact factor: 9.043

6.  INVHOGEN: a database of homologous invertebrate genes.

Authors:  Ingo Paulsen; Arndt von Haeseler
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  Hierarchical clustering algorithm for comprehensive orthologous-domain classification in multiple genomes.

Authors:  Ikuo Uchiyama
Journal:  Nucleic Acids Res       Date:  2006-01-25       Impact factor: 16.971

8.  Family classification without domain chaining.

Authors:  Jacob M Joseph; Dannie Durand
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

9.  TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences.

Authors:  Yujun Han; James M Burnette; Susan R Wessler
Journal:  Nucleic Acids Res       Date:  2009-05-08       Impact factor: 16.971

10.  PairsDB atlas of protein sequence space.

Authors:  Andreas Heger; Eija Korpelainen; Taavi Hupponen; Kimmo Mattila; Vesa Ollikainen; Liisa Holm
Journal:  Nucleic Acids Res       Date:  2007-11-05       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.