| Literature DB >> 27570670 |
Christiana McMahon1, Spiros Denaxas1.
Abstract
Metadata are critical in epidemiological and public health research. However, a lack of biomedical metadata quality frameworks and limited awareness of the implications of poor quality metadata renders data analyses problematic. In this study, we created and evaluated a novel framework to assess metadata quality of epidemiological and public health research datasets. We performed a literature review and surveyed stakeholders to enhance our understanding of biomedical metadata quality assessment. The review identified 11 studies and nine quality dimensions; none of which were specifically aimed at biomedical metadata. 96 individuals completed the survey; of those who submitted data, most only assessed metadata quality sometimes, and eight did not at all. Our framework has four sections: a) general information; b) tools and technologies; c) usability; and d) management and curation. We evaluated the framework using three test cases and sought expert feedback. The framework can assess biomedical metadata quality systematically and robustly.Entities:
Year: 2016 PMID: 27570670 PMCID: PMC5001774
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Dimensions of metadata quality for public health and epidemiological data as identified by the international stakeholder survey.
| Dimensions of metadata quality | Responses | ||
|---|---|---|---|
| N | Percent | CI | |
| Accuracy (correctness of the metadata) | 35 | 14.9% | 10-20 |
| Accessibility (extent to which the metadata can be accessed) | 34 | 14.5% | 10-20 |
| Discoverability (how visible the metadata are - can it be easily found) | 33 | 14.0% | 10-19 |
| Appropriateness (extent to which the metadata are relevant) | 24 | 10.2% | 7-15 |
| Comprehensiveness (extent to which the metadata are complete) | 24 | 10.2% | 7-15 |
| Interoperability (extent to which metadata can be exchanged and used without problems) | 21 | 8.9% | 6-13 |
| Timeliness (is the metadata current, inclusion of temporal information) | 20 | 8.5% | 5-13 |
| Versionability (extent to which a new version may be easily created) | 18 | 7.7% | 5-12 |
| Extendibility (extent to which the metadata may be easily extended) | 14 | 6.0% | 3-10 |
| Meta-metadata (metadata about the metadata) | 10 | 4.3% | 2-8 |
| Other | 2 | 0.9% | 0-3 |
| Total | 235 | 100.0% | |
Figure 1.Sections of the high level quality assessment framework for metadata in epidemiological and public health research settings