Literature DB >> 18025043

The Molecular Biology Database Collection: 2008 update.

Michael Y Galperin1.   

Abstract

The Nucleic Acids Research online Molecular Biology Database Collection is a public repository that lists more than 1000 databases described in this and previous Nucleic Acids Research annual database issues, as well as a selection of molecular biology databases described in other journals. All databases included in this Collection are freely available to the public. The 2008 update includes 1078 databases, 110 more than the previous one. The links to more than 80 databases have been updated and 25 obsolete databases have been removed from the list. The complete database list and summaries are available online at the Nucleic Acids Research web site, http://nar.oxfordjournals.org/.

Entities:  

Mesh:

Year:  2007        PMID: 18025043      PMCID: PMC2238887          DOI: 10.1093/nar/gkm1037

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


COMMENTARY

The current issue of the Nucleic Acids Research features 100 new databases and 82 updates for databases that were previously described in NAR or other journals. Inclusion of these databases into the Nucleic Acids Research online Molecular Biology Database Collection (http://www.oxfordjournals.org/nar/database/a/, see Supplementary Table S1) brought the total count to more than 1000 databases for the first time in the 12-year history of this list. Several more databases, recently described in Bioinformatics and other journals, were added to the list. The database collection now includes databases from two more countries, Chile and Tunisia [AlterORF (1), a database of alternate open reading frames in prokaryotic genomes, http://www.cienciavida.cl/alterorf/, no. 1044 in the NAR Database Collection, and BACTIBASE (2), a database of bacteriocins, natural antimicrobial peptides, http://www.pfba-lab.org/bactibase, no. 1167, respectively]. A year after the only Norwegian database, tinyGRAP, was dropped from the list, there is again a database from Norway, SuperCAT [http://mlstoslo.uio.no/, no. 1135, (3)]. Some of the new databases come with remarkably creative names, such as eggNOG (4), a database of evolutionary genealogy of genes: Non-supervised Orthologous Groups (http://eggnog.embl.de, no. 1068), GEISHA (5), a database of Gallus expression in situ hybridization analysis (http://geisha.arizona.edu, no. 1173), PhosPhAt (6), a database of protein phosphorylation sites in Arabidopsis thaliana (http://www.plantenergy.uwa.edu.au/applications/phosphat/, no. 1112), and STITCH (7), a search tool for interactions of chemicals (http://stitch.embl.de/, no. 1134). It was interesting to see researchers in different countries coming up with databases covering very similar areas. Examples include MethyCancer [http://methycancer.genomics.org.cn/, no. 1096, (8)] and PubMeth [http://matrix.ugent.be/pubmeth/, no. 1125, (9)] that both examine the links between DNA methylation levels and cancerogenesis; microRNA.org [http://www.microrna.org/, no. 1097, (10)] and miRGator [http://genome.ewha.ac.kr/miRGator/miRGator.html, no. 1098, (11)] that both deal with prediction of microRNA targets and gene expression data; Greglist [http://tubic.tju.edu.cn/greglist/, no. 1082, (12)] and QuadBase, [http://quadbase.igib.res.in/, no. 1126, (13)] that both search for G-quadruplex motifs in the promoters of potentially G-quadruplex regulated genes. Independent creation of these databases shows the importance of the respective topics and ensures friendly competition that should keep these databases in a good shape. Another pair of databases deals with the genomes of pea aphid Acyrthosiphon pisum [AphidBase, http://www.aphidbase.com/, no 1152, (14)] and its bacterial endosymbiont Buchnera sp. APS [BuchneraBase, http://www.buchnera.org/, no. 1153, (15)]. The combination of the two should provide further insights in the mechanisms of this interesting symbiotic relationship. One of the most remarkable new databases is Déjà vu (http://spore.swmed.edu/dejavu/, no. 1171), a database that uses the eTBLAST tool, recently described in the NAR Web Server issue (16), to find highly similar abstracts in bibliographic databases, including MEDLINE. Most of the citations retrieved that way appear to be relatively benign duplicate publications of the same data by the same authors (including my own comments to the 2006 and 2007 releases of the NAR Database Collection). Some highly similar publications, however, come from different authors and look extremely suspicious. Almost two dozen databases that have been featured in the previous release of the NAR database collection but are no longer maintained have been dropped from the list. One of such casualties was the popular listing of ongoing microbial genome sequencing projects on the TIGR web site (former no. 99). However, this task is carried out by much more comprehensive listings in the GOLD database [http://www.genomesonline.org/, no. 75, (17)] and Entrez Genomes web site [no. 458, (18)]. The recently created diArk database [http://www.diark.org/diark/, no. 1172, (19)] lists eukaryotic genome sequencing projects. Several more databases have been superseded by more advanced or more comprehensive databases. For example, the famous Families of Structurally Similar Proteins (FSSP) database [no. 469, (20)] was superseded by the Dali database [http://ekhidna.biocenter.helsinki.fi/dali/start, no. 442, (21)], while the BayGenomics database [http://baygenomics.ucsf.edu/, no 416, (22)] was superseded by the International Gene Trap Consortium database [http://www.genetrap.org/, no 827, (23)]. Two EBI database projects, the Alternative Splicing Database [ASD, (24)], and the Alternative Transcript Diversity Database [ATD, (25)] have been combined into a single Alternative Splicing and Transcript Diversity database (ASTD, no. 28). In addition, two important and widely used databases had to be removed from the list because they limited access only to the registered users: the Human Gene Mutation Database (HGMD®, http://www.hgmd.cf.ac.uk/, no. 133) and Unified Medical Language System® (UMLS, http://umlsks.nlm.nih.gov/, no. 317). As a result of all these changes, the current release of the NAR online Molecular Biology Database Collection includes 1078 databases, 110 more than the previous one. It is probably useful to reiterate that this listing is by no means exhaustive; it was never intended to represent all molecular biology databases, or even all publicly available ones. Rather, the NAR Collection is a very selective list, chosen among numerous molecular biology-related databases available all over the world. Most of the databases in the list come from the annual NAR database issues and therefore reflect the choice of the editors of these issues, based on scrupulous peer review. Additional databases are selected from publications in other journals, such as Bioinformatics or BMC Bioinformatics. Finally, this collection includes a relatively small number of databases that have been submitted to a NAR database issue, received some positive reviews, but were not accepted for publication because of the highly specialized data and/or narrow target audience. All these databases are subsequently vetted for continuity and those that are not being maintained or updated are being gradually removed from the collection. Summing up, NAR online Molecular Biology Database Collection can itself be viewed as a curated database: each of its entries has been looked at, evaluated and deemed useful for the community.
  25 in total

1.  GEISHA: an in situ hybridization gene expression resource for the chicken embryo.

Authors:  D K Darnell; S Kaur; S Stanislaw; S Davey; J H Konieczka; T A Yatskievych; P B Antin
Journal:  Cytogenet Genome Res       Date:  2007       Impact factor: 1.636

2.  Touring protein fold space with Dali/FSSP.

Authors:  L Holm; C Sander
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

3.  AphidBase: a database for aphid genomic resources.

Authors:  Jean-Pierre Gauthier; Fabrice Legeai; Alain Zasadzinski; Claude Rispe; Denis Tagu
Journal:  Bioinformatics       Date:  2007-01-19       Impact factor: 6.937

4.  BayGenomics: a resource of insertional mutations in mouse embryonic stem cells.

Authors:  Doug Stryke; Michiko Kawamoto; Conrad C Huang; Susan J Johns; Leslie A King; Courtney A Harper; Elaine C Meng; Roy E Lee; Alice Yee; Larry L'Italien; Pao-Tien Chuang; Stephen G Young; William C Skarnes; Patricia C Babbitt; Thomas E Ferrin
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

5.  The Genomes On Line Database (GOLD) v.2: a monitor of genome projects worldwide.

Authors:  Konstantinos Liolios; Nektarios Tavernarakis; Philip Hugenholtz; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

6.  ASD: a bioinformatics resource on alternative splicing.

Authors:  Stefan Stamm; Jean-Jack Riethoven; Vincent Le Texier; Chellappa Gopalakrishnan; Vasudev Kumanduri; Yesheng Tang; Nuno L Barbosa-Morais; Thangavel Alphonse Thanaraj
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  diArk--a resource for eukaryotic genome research.

Authors:  Florian Odronitz; Marcel Hellkamp; Martin Kollmar
Journal:  BMC Genomics       Date:  2007-04-17       Impact factor: 3.969

8.  Database resources of the National Center for Biotechnology Information.

Authors:  David L Wheeler; Tanya Barrett; Dennis A Benson; Stephen H Bryant; Kathi Canese; Vyacheslav Chetvernin; Deanna M Church; Michael DiCuccio; Ron Edgar; Scott Federhen; Lewis Y Geer; Yuri Kapustin; Oleg Khovayko; David Landsman; David J Lipman; Thomas L Madden; Donna R Maglott; James Ostell; Vadim Miller; Kim D Pruitt; Gregory D Schuler; Edwin Sequeira; Steven T Sherry; Karl Sirotkin; Alexandre Souvorov; Grigory Starchenko; Roman L Tatusov; Tatiana A Tatusova; Lukas Wagner; Eugene Yaschenko
Journal:  Nucleic Acids Res       Date:  2006-12-14       Impact factor: 16.971

9.  eTBLAST: a web server to identify expert reviewers, appropriate journals and similar publications.

Authors:  Mounir Errami; Jonathan D Wren; Justin M Hicks; Harold R Garner
Journal:  Nucleic Acids Res       Date:  2007-04-22       Impact factor: 16.971

10.  eggNOG: automated construction and annotation of orthologous groups of genes.

Authors:  Lars Juhl Jensen; Philippe Julien; Michael Kuhn; Christian von Mering; Jean Muller; Tobias Doerks; Peer Bork
Journal:  Nucleic Acids Res       Date:  2007-10-16       Impact factor: 16.971

View more
  27 in total

Review 1.  The ACGT Master Ontology and its applications--towards an ontology-driven cancer research and management system.

Authors:  Mathias Brochhausen; Andrew D Spear; Cristian Cocos; Gabriele Weiler; Luis Martín; Alberto Anguita; Holger Stenzhorn; Evangelia Daskalaki; Fatima Schera; Ulf Schwarz; Stelios Sfakianakis; Stephan Kiefer; Martin Dörr; Norbert Graf; Manolis Tsiknakis
Journal:  J Biomed Inform       Date:  2010-05-11       Impact factor: 6.317

2.  Supporting retrieval of diverse biomedical data using evidence-aware queries.

Authors:  Eithon Cadag; Peter Tarczy-Hornoch
Journal:  J Biomed Inform       Date:  2010-07-17       Impact factor: 6.317

3.  Toward understanding the mechanism of action of the yeast multidrug resistance transporter Pdr5p: a molecular modeling study.

Authors:  Robert M Rutledge; Lothar Esser; Jichun Ma; Di Xia
Journal:  J Struct Biol       Date:  2010-10-27       Impact factor: 2.867

4.  Animal trait ontology: The importance and usefulness of a unified trait vocabulary for animal species.

Authors:  L M Hughes; J Bao; Z-L Hu; V Honavar; J M Reecy
Journal:  J Anim Sci       Date:  2008-02-13       Impact factor: 3.159

Review 5.  Bioinformatics resources for the study of gene regulation in bacteria.

Authors:  Julio Collado-Vides; Heladia Salgado; Enrique Morett; Socorro Gama-Castro; Verónica Jiménez-Jacinto; Irma Martínez-Flores; Alejandra Medina-Rivera; Luis Muñiz-Rascado; Martín Peralta-Gil; Alberto Santos-Zavaleta
Journal:  J Bacteriol       Date:  2008-10-31       Impact factor: 3.490

6.  BIRI: a new approach for automatically discovering and indexing available public bioinformatics resources from the literature.

Authors:  Guillermo de la Calle; Miguel García-Remesal; Stefano Chiesa; Diana de la Iglesia; Victor Maojo
Journal:  BMC Bioinformatics       Date:  2009-10-07       Impact factor: 3.169

Review 7.  Data-driven methods to discover molecular determinants of serious adverse drug events.

Authors:  A P Chiang; A J Butte
Journal:  Clin Pharmacol Ther       Date:  2009-01-28       Impact factor: 6.875

Review 8.  Chemical databases for environmental health and clinical research.

Authors:  Carolyn J Mattingly
Journal:  Toxicol Lett       Date:  2008-10-18       Impact factor: 4.372

9.  XML-based approaches for the integration of heterogeneous bio-molecular data.

Authors:  Marco Mesiti; Ernesto Jiménez-Ruiz; Ismael Sanz; Rafael Berlanga-Llavori; Paolo Perlasca; Giorgio Valentini; David Manset
Journal:  BMC Bioinformatics       Date:  2009-10-15       Impact factor: 3.169

10.  Biomedical discovery acceleration, with applications to craniofacial development.

Authors:  Sonia M Leach; Hannah Tipney; Weiguo Feng; William A Baumgartner; Priyanka Kasliwal; Ronald P Schuyler; Trevor Williams; Richard A Spritz; Lawrence Hunter
Journal:  PLoS Comput Biol       Date:  2009-03-27       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.