Literature DB >> 15961438

MagicMatch--cross-referencing sequence identifiers across databases.

Mike Smith1, Victor Kunin, Leon Goldovsky, Anton J Enright, Christos A Ouzounis.   

Abstract

MOTIVATION: At present, mapping of sequence identifiers across databases is a daunting, time-consuming and computationally expensive process, usually achieved by sequence similarity searches with strict threshold values.
SUMMARY: We present a rapid and efficient method to map sequence identifiers across databases. The method uses the MD5 checksum algorithm for message integrity to generate sequence fingerprints and uses these fingerprints as hash strings to map sequences across databases. The program, called MagicMatch, is able to cross-link any of the major sequence databases within a few seconds on a modest desktop computer.

Mesh:

Substances:

Year:  2005        PMID: 15961438     DOI: 10.1093/bioinformatics/bti548

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

Review 1.  Data integration for dynamic and sustainable systems biology resources: challenges and lessons learned.

Authors:  Daniel E Sullivan; Joseph L Gabbard; Maulik Shukla; Bruno Sobral
Journal:  Chem Biodivers       Date:  2010-05       Impact factor: 2.408

2.  The M5nr: a novel non-redundant database containing protein sequences and annotations from multiple sources and associated tools.

Authors:  Andreas Wilke; Travis Harrison; Jared Wilkening; Dawn Field; Elizabeth M Glass; Nikos Kyrpides; Konstantinos Mavrommatis; Folker Meyer
Journal:  BMC Bioinformatics       Date:  2012-06-21       Impact factor: 3.169

3.  The Booly aliasing resource: a database of grouped biological identifiers.

Authors:  Long Hoang Do; Ethan Bier
Journal:  Bioinformation       Date:  2011-03-26

4.  Computational approaches to selecting and optimising targets for structural biology.

Authors:  Ian M Overton; Geoffrey J Barton
Journal:  Methods       Date:  2011-08-27       Impact factor: 3.608

5.  Expansion of the BioCyc collection of pathway/genome databases to 160 genomes.

Authors:  Peter D Karp; Christos A Ouzounis; Caroline Moore-Kochlacs; Leon Goldovsky; Pallavi Kaipa; Dag Ahrén; Sophia Tsoka; Nikos Darzentas; Victor Kunin; Núria López-Bigas
Journal:  Nucleic Acids Res       Date:  2005-10-24       Impact factor: 16.971

6.  CATH FunFHMMer web server: protein functional annotations using functional family assignments.

Authors:  Sayoni Das; Ian Sillitoe; David Lee; Jonathan G Lees; Natalie L Dawson; John Ward; Christine A Orengo
Journal:  Nucleic Acids Res       Date:  2015-05-11       Impact factor: 16.971

7.  Detection of genomic idiosyncrasies using fuzzy phylogenetic profiles.

Authors:  Fotis E Psomopoulos; Pericles A Mitkas; Christos A Ouzounis
Journal:  PLoS One       Date:  2013-01-14       Impact factor: 3.240

8.  PIPs: human protein-protein interaction prediction database.

Authors:  Mark D McDowall; Michelle S Scott; Geoffrey J Barton
Journal:  Nucleic Acids Res       Date:  2008-11-06       Impact factor: 16.971

9.  iRefIndex: a consolidated protein interaction database with provenance.

Authors:  Sabry Razick; George Magklaras; Ian M Donaldson
Journal:  BMC Bioinformatics       Date:  2008-09-30       Impact factor: 3.169

10.  Matching curated genome databases: a non trivial task.

Authors:  Stéphane Descorps-Declère; Matthieu Barba; Bernard Labedan
Journal:  BMC Genomics       Date:  2008-10-24       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.