| Literature DB >> 21969471 |
Andrea Splendiani1, Michaela Gündel, Jonathan M Austyn, Duccio Cavalieri, Ciro Scognamiglio, Marco Brandizi.
Abstract
Biomedical research relies increasingly on large collections of data sets and knowledge whose generation, representation and analysis often require large collaborative and interdisciplinary efforts. This dimension of 'big data' research calls for the development of computational tools to manage such a vast amount of data, as well as tools that can improve communication and access to information from collaborating researchers and from the wider community. Whenever research projects have a defined temporal scope, an additional issue of data management arises, namely how the knowledge generated within the project can be made available beyond its boundaries and life-time. DC-THERA is a European 'Network of Excellence' (NoE) that spawned a very large collaborative and interdisciplinary research community, focusing on the development of novel immunotherapies derived from fundamental research in dendritic cell immunobiology. In this article we introduce the DC-THERA Directory, which is an information system designed to support knowledge management for this research community and beyond. We present how the use of metadata and Semantic Web technologies can effectively help to organize the knowledge generated by modern collaborative research, how these technologies can enable effective data management solutions during and beyond the project lifecycle, and how resources such as the DC-THERA Directory fit into the larger context of e-science.Entities:
Mesh:
Year: 2011 PMID: 21969471 PMCID: PMC3220873 DOI: 10.1093/bib/bbr051
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
The list of ontologies that are included in the DC-THERA Ontology
| Ontology | Domain | Usage in DC-THERA Directory |
|---|---|---|
| Ontology for biomedical investigations (OBI) [ | Meta-information for Biomedical experiments | Biomaterials, protocols, data sets, documents, tools and methods |
| Dendritic cell ontology [ | Cell-type annotation | Biomaterials, data sets, protocols |
| Cell-type ontology (CL) [ | Cell-type annotation | Biomaterials, data sets, protocols |
| Experimental factor ontology (EFO) [ | Meta-information for microarray and -omics experiments | data sets |
| Microarray experimental conditions (MGED) [ | Meta-information for microarray experiments | data sets |
| Chemical entities of biological interest (CHEBI) [ | Annotation of bio-molecules and administered compounds/drugs | Biomaterials, data sets |
| Foundational model of anatomy (FMA) [ | Annotation of biomaterials | Biomaterials, data sets, protocols |
| NCBI taxonomy [ | Classification of organisms | Biomaterials, data sets, protocols |
Figure 1:Extract from the DC-THERA Ontology. Some of the top-classes and relationships that are part of the DC-THERA Ontology are represented. The diagram makes use of labels in place of identifiers for readability. ‘DC-THERA Resource’ encompasses research assets available within DC-THERA, while ‘Public Resource’ is a more generic class that includes Participants and Persons (both of these classes are at a level of abstraction that is above what is presented to the user).
Figure 2:An example of a ‘resource-centric’ view in the DC-THERA Directory. The information page shown corresponds to the resource ‘DC-ATLAS’ (URL: http://dc-research.eu/tool/101). The classification of this resource as a tool is shown in the upper part of the page. Relationships with other resources in the Directory (context) are shown in the left bar. Note that the category ‘Tools’ is highlighted: this is an indication of the current context, and searches via the search box are performed on tools and related resources.
Figure 3:Web access data for DC-RESEARCH.EU from 1st December 2010 to 14th January 2011. Total figures are reported in blue, returning users in orange and new users in green. The figures show a drop in access towards the end of the DC-THERA project, and a slow and steady reprise afterwards, arguably corresponding to a shift in usage from a project-oriented tool to a generic web resource. Reported values exclude computational access via a SPARQL end-point and access through a replicated platform (cf. ‘Portability and long-term persistence’ section).
Figure 5:Comparison of web-traffic of the DC-THERA Directory and other related resources. The image reports traffic information for some of the information resources presented in the discussion section, as collected from the alexa.com web traffic monitoring service over the period April to June 2011. The x-axis reports the Alexa rank, which is a measure of web-traffic. Numbers are log scaled and range from 703 100 (most visited) for the Alzheimer research forum to 14 009 573 (least viewed) for Eagle-i. The y-axis reports the percentage of the top seven search queries that are relevant to the content of the website. Terms not evidently pertaining to the content of the website have been double checked with Google queries for the term, with scope limited to the website domain. If in doubt, terms have been considered pertinent. ‘Pertinence’ is not related to the performance of individual sites, but rather to the specificity with which a generic query on the web can reach them. Finally, the size of the dots indicated the number of resources linking to the corresponding web resource. All measures from http://alexa.com are derived from a panel of users of which the suitability for the purpose of this study cannot be assessed. These measures should only be considered as indicative.
Figure 4:An example of the use of RelFinder to find relationships, in the Directory, between a researcher and a given data set. This search modality goes consistently beyond a single text search, and allows one to find a ‘contact point’ for a resource of interest. The results reported in the figure can be reproduced by accessing the URL http://tiny.cc/dcdrfdemo.
DC-THERA project history and user feedback
| Release | Features |
|---|---|
| Oct 2009 | Search/Browse functionality Main contents |
| Feb 2009 | Backend with editing functions, available to selected users Contents from all DC-THERA Scientific reports included by curators |
| Jul 2009 | Editing back-end available to all users External services integrated (e.g. |
| Autumn 2009 | Standard Operating Procedures (SOP) added as protocols Graphical look improved Tooltips for categories and acronyms added |
| Mid 2010 | RDF/RDFa/SPARQL export Relfinder integration Links between protocols and bio-materials used were added Contents and their classification reviewed |
| Mid 2011 | Content updates RDF dump loaded in Talis |
The table summarizes the features introduced in the Directory over time. The Directory development followed an iterative approach and at each release user feedback was gathered to plan and prioritize next developments. Reported in italic are the features requested by users, and not originally planned by the steering committee.