| Literature DB >> 20627863 |
Damian Smedley1, Paul Schofield, Chao-Kung Chen, Vassilis Aidinis, Chrysanthi Ainali, Jonathan Bard, Rudi Balling, Ewan Birney, Andrew Blake, Erik Bongcam-Rudloff, Anthony J Brookes, Gianni Cesareni, Christina Chandras, Janan Eppig, Paul Flicek, Georgios Gkoutos, Simon Greenaway, Michael Gruenberger, Jean-Karim Hériché, Andrew Lyall, Ann-Marie Mallon, Dawn Muddyman, Florian Reisinger, Martin Ringwald, Nadia Rosenthal, Klaus Schughart, Morris Swertz, Gudmundur A Thorisson, Michael Zouberakis, John M Hancock.
Abstract
The recent explosion of biological data and the concomitant proliferation of distributed databases make it challenging for biologists and bioinformaticians to discover the best data resources for their needs, and the most efficient way to access and use them. Despite a rapid acceleration in uptake of syntactic and semantic standards for interoperability, it is still difficult for users to find which databases support the standards and interfaces that they need. To solve these problems, several groups are developing registries of databases that capture key metadata describing the biological scope, utility, accessibility, ease-of-use and existence of web services allowing interoperability between resources. Here, we describe some of these initiatives including a novel formalism, the Database Description Framework, for describing database operations and functionality and encouraging good database practise. We expect such approaches will result in improved discovery, uptake and utilization of data resources. Database URL: http://www.casimir.org.uk/casimir_ddf.Entities:
Mesh:
Year: 2010 PMID: 20627863 PMCID: PMC2911849 DOI: 10.1093/database/baq014
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
The CASIMIR Database Description Framework (DDF)
| Category | Level 1 | Level 2 | Level 3 |
|---|---|---|---|
| Quality and Consistency | No explicit process for assuring consistency | Process for assuring consistency, automatic curation only | Process for assuring consistency with manual curation |
| Currency | Closed legacy database or last update more than a year ago | Updates or versions more than once a year | Updates or versions more than once a month |
| Accessibility | Access via browser only | Access via browser and database reports or database dumps | Access via browser and programmatic access (well defined API, SQL access or web services) |
| Output formats | HTML or similar to browser only | HTML or similar to browser and sparse standard file formats, e.g. FASTA | HTML or similar to browser and rich standard file formats, e.g. XML, SBML (Systems Biology Markup Language) |
| Technical documentation | Written text only | Written text and formal structured description, e.g. automatically generated API docs (JavaDoc), DDL (Data Description Language), DTD (Document Type Definition), UML (Unified Modelling Language), etc. | Written text and formal structured description and tutorials or demonstrations on how to use them |
| Data representation standards | Data coded by local formalism only | Some data coded by a recognised controlled vocabulary, ontology or use of minimal information standards (MIBBI) | General use of both recognised vocabularies or ontologies, and minimal information standards (MIBBI) |
| Data structure standards | Data structured with local model only | Data structured with formal model, e.g. an XML schema | Use of recognised standard model, e.g. FUGE |
| User support | User documentation only | User documentation and Email/web form help desk function | User documentation as well as a personal contact help desk function/training |
| Versioning | No provision | Previous version of database available but no tracking of entities between versions | Previous version of database available and tracking of entities between versions |
Figure 1.The DDF query and annotation tool. This tool allows any user to browse a set of resources that have been annotated using the DDF categories. Searches for resources by DDF category and level are also possible. In addition, resource maintainers can log-in and edit their existing annotations or annotate a new resource using a simple web form. This tool is freely available and easy to install for other communities that wish to create their own registry of resources.