Literature DB >> 10687950

The quest to deduce protein function from sequence: the role of pattern databases.

T K Attwood1.   

Abstract

In the wake of the numerous now-fruitful genome projects, we have witnessed a 'tsunami' of sequence data and with it the birth of the field of bioinformatics. Bioinformatics involves the application of information technology to the management and analysis of biological data. For many of us, this means that databases and their search tools have become an essential part of the research environment. However, the rate of sequence generation and the haphazard proliferation of databases have made it difficult to keep pace with developments, even for the cognoscenti. Moreover, increasing amounts of sequence information do not necessarily equate with an increase in knowledge, and in the panic to automate the route from raw data to biological insight, we may be generating and propagating innumerable errors in our precious databases. In the genome era upon us, researchers want rapid, easy-to-use, reliable tools for functional characterisation of newly determined sequences. For the pharmaceutical industry in particular, the Pandora's box of bioinformatics harbours an information-rich nugget, ripe with potential drug targets and possible new avenues for the development of therapeutic agents. This review outlines the current status of the major pattern databases now used routinely in the analysis of protein sequences. The review is divided into three main sections. In the first, commonly used terms are defined and the methods behind the databases are briefly described; in the second, the structure and content of the principal pattern databases are discussed; and in the final part, several alignment databases, which are frequently confused with pattern databases, are mentioned. For the new-comer, the array of resources, the range of methods behind them and the different tools required to search them can be confusing. The review therefore also briefly mentions a current international endeavour to integrate the diverse databases, which effort should facilitate sequence analysis in the future. This is particularly important for target-discovery programmes, where the challenge is to rationalise the enormous numbers of potential targets generated by sequence database searches. This problem may be addressed, at least in part, by reducing search outputs to the more focused and manageable subsets suggested by searches of integrated groups of family-specific pattern databases.

Mesh:

Substances:

Year:  2000        PMID: 10687950     DOI: 10.1016/s1357-2725(99)00106-5

Source DB:  PubMed          Journal:  Int J Biochem Cell Biol        ISSN: 1357-2725            Impact factor:   5.085


  8 in total

1.  Prediction of protein functional domains from sequences using artificial neural networks.

Authors:  J Murvai; K Vlahovicek; C Szepesvári; S Pongor
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

2.  Validating annotations for uncharacterized proteins in Shewanella oneidensis.

Authors:  Brenton Louie; Peter Tarczy-Hornoch; Roger Higdon; Eugene Kolker
Journal:  OMICS       Date:  2008-09

3.  Oct-2 DNA binding transcription factor: functional consequences of phosphorylation and glycosylation.

Authors:  Ishtiaq Ahmad; Daniel C Hoessli; Evelyne Walker-Nasir; Saleem M Rafik; Abdul R Shakoori
Journal:  Nucleic Acids Res       Date:  2006-01-08       Impact factor: 16.971

4.  Pattern similarity study of functional sites in protein sequences: lysozymes and cystatins.

Authors:  Shuryo Nakai; Eunice C Y Li-Chan; Jinglie Dou
Journal:  BMC Biochem       Date:  2005-05-18       Impact factor: 4.059

5.  Current awareness on comparative and functional genomics.

Authors: 
Journal:  Yeast       Date:  2000-09-30       Impact factor: 3.239

6.  Genetic Variation and Its Reflection on Posttranslational Modifications in Frequency Clock and Mating Type a-1 Proteins in Sordaria fimicola.

Authors:  Rabia Arif; Faiza Akram; Tazeen Jamil; Hamid Mukhtar; Siu Fai Lee; Muhammad Saleem
Journal:  Biomed Res Int       Date:  2017-06-21       Impact factor: 3.411

7.  Accurate prediction of protein enzymatic class by N-to-1 Neural Networks.

Authors:  Viola Volpato; Alessandro Adelfio; Gianluca Pollastri
Journal:  BMC Bioinformatics       Date:  2013-01-14       Impact factor: 3.169

8.  Similar genomic patterns of clinical infective endocarditis and oral isolates of Streptococcus sanguinis and Streptococcus gordonii.

Authors:  Katrine Højholt Iversen; Louise Hesselbjerg Rasmussen; Kosai Al-Nakeeb; Jose Juan Almagro Armenteros; Christian Salgård Jensen; Rimtas Dargis; Oksana Lukjancenko; Ulrik Stenz Justesen; Claus Moser; Flemming S Rosenvinge; Xiaohui Chen Nielsen; Jens Jørgen Christensen; Simon Rasmussen
Journal:  Sci Rep       Date:  2020-02-17       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.