| Literature DB >> 29228196 |
Alejandra N Gonzalez-Beltran1, John Campbell2, Patrick Dunn2, Diana Guijarro3, Sanda Ionescu4, Hyeoneui Kim3, Jared Lyle4, Jeffrey Wiser2, Susanna-Assunta Sansone1, Philippe Rocca-Serra1.
Abstract
The DAta Tag Suite (DATS) is a model supporting dataset description, indexing, and discovery. It is available as an annotated serialization with schema.org, a vocabulary used by major search engines, thus making the datasets discoverable on the web. DATS underlies DataMed, the National Institutes of Health Big Data to Knowledge Data Discovery Index prototype, which aims to provide a "PubMed for datasets." The experience gained while indexing a heterogeneous range of >60 repositories in DataMed helped in evaluating DATS's entities, attributes, and scope. In this work, 3 additional exemplary and diverse data sources were mapped to DATS by their representatives or experts, offering a deep scan of DATS fitness against a new set of existing data. The procedure, including feedback from users and implementers, resulted in DATS implementation guidelines and best practices, and identification of a path for evolving and optimizing the model. Finally, the work exposed additional needs when defining datasets for indexing, especially in the context of clinical and observational information.Entities:
Keywords: data discovery; data model; metadata; search engine
Mesh:
Year: 2018 PMID: 29228196 PMCID: PMC6481379 DOI: 10.1093/jamia/ocx119
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Mapping of 12 ICPSR key metadata fields to DATS descriptor core elements
| ICPSR field | DATS dataset entity attributes |
|---|---|
| Study number | identifierInformation |
| Study title/dataset title | Title |
| Summary | Description |
| Kind of data/data type | DataType |
| Distributor | StoredIn |
| Terms of use | License |
| Download URL | doi |
| Investigator | Creator |
| Time period, collection date, release date, date updated | date_info |
| Version | Version |
| File size | Size |
.Section of the DATS model involving dimension
Mapping between DATS dimension and DDI variable
| DATS dimension | DDI variable |
|---|---|
| Identifier | ID |
| Name | Name |
| Types | interval/nature/format |
| DataType | Kind of data |
| PartOf | (Study) title |
| Description | Descriptive text/ question text |
| Values | Valid values range |
| Unit | Measurement unit |
| isAbout | Concept |
| extraProperties | Notes |