| Literature DB >> 29202203 |
Ram Dixit1, Deevakar Rogith1, Vidya Narayana1, Mandana Salimi1, Anupama Gururaj1, Lucila Ohno-Machado2, Hua Xu1, Todd R Johnson1.
Abstract
OBJECTIVE: To present user needs and usability evaluations of DataMed, a Data Discovery Index (DDI) that allows searching for biomedical data from multiple sources.Entities:
Keywords: data discovery; information retrieval; metadata; usability; user needs
Year: 2018 PMID: 29202203 PMCID: PMC7378884 DOI: 10.1093/jamia/ocx134
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.Diagram showing UCD process for DataMed. Phase 1 research was conducted prior to the development of DataMed; Phase 2 evaluations were conducted on versions 0.5 and 2.0 of DataMed.
Characteristics of participants in the Phase 1 user needs analysis for DataMed
| Research Domain | Position | Count |
|---|---|---|
| Clinical Translational Science | Professor | 1 |
| Cardiology | Professor | 1 |
| Genomics | Postdoctoral Researcher | 1 |
| Biomedical Informatics | Professor | 4 |
| Postdoctoral Researcher | 1 | |
| Molecular Biology | Postdoctoral Researcher | 1 |
| Neuroscience | Professor | 1 |
| Mobile Health | Postdoctoral Researcher | 1 |
| Public Health | PhD Student | 1 |
| Anesthesiology | Professor | 1 |
| 13 | ||
Characteristics of participants in Phase 2 usability evaluation of DataMed versions 0.5 and 2.0
| DataMed Version | Research Domain | Position | Count |
|---|---|---|---|
| 0.5 | Molecular Biology | Postdoctoral Researcher | 2 |
| Data-related Professional | 1 | ||
| PhD Student | 1 | ||
| Chemistry | Professor | 1 | |
| Biomedical Informatics | PhD Student | 1 | |
| Library Science | Data-related Professional | 2 | |
| 8 | |||
| 2.0 | Cancer Biology and Genetics | MD, PhD Student | 1 |
| PhD Student | 1 | ||
| Cancer Genomics | PhD Student | 1 | |
| Public Health | Professor | 1 | |
| PhD Student | 1 | ||
| Genetic Epidemiology | Professor | 1 | |
| Systems Biology | Postdoctoral Researcher | 2 | |
| Data Curation | Data-related Professional | 1 | |
| Medical Library | Medical Librarian | 1 | |
| Neuroscience | Postdoctoral Researcher | 1 | |
| 11 | |||
Summary of user needs analysis for biomedical data discovery
| Topic | Difficulties | User Needs |
|---|---|---|
| Searching for Data | Time and effort spent finding relevant data for research purposes | Centralized source for available data and tools for finding research-related data |
| Poor documentation and protocols for accessing data | Standard documentation and protocols for data access | |
| Metadata | Assessing validity and utility of dataset for secondary use | Standard metadata, vocabularies, and documentation of datasets |
| Incomplete, inconsistent, and poor-quality metadata | Tools and guidelines for authors to create metadata | |
| Data Format | Data wrangling and compatibility with analytic methods | Documentation of data provenance |
| Availability of data at various degrees of processing: raw to summarized | Availability of data for compatibility with analytic tools | |
| Visualization | Manual work required for creating custom overviews of data | Online visualization of datasets |
| Limitation of current methods for visualizing and exploring large datasets | New techniques for representing and exploring large datasets |
Figure 2.The homepage of DataMed version 2.0 as of May 8, 2017.
Figure 3.An example of search results for the query “MRI patients Parkinsons” in DataMed version 2.0.
Summary and examples of participants’ expressed metadata needs in searching DataMed
| Metadata Field | Examples |
|---|---|
| Biomedical Concepts | De Novo Acute Myeloid Leukemia |
| Data Type | Gene Expression, Clinical Outcomes |
| Data Collection Technique | Survey, Magnetic Resonance Imaging |
| Data Format | Text, Comma-separated Values, Digital Imaging and Communications in Medicine |
| Data Processing | Raw Data, Abstracted Data, Secondary Data |
| Sample Description | Number of Samples, Species, Population |
| Intervention/Study Design | Case-Control, Cohort |
| Date of Collection | January 2010 to January 2015 |
| Variables | Cell Lines, Hormone Levels, Gene Knockouts |
| Instructions for Data Usage | Data Processing Tools, Algorithms, Tutorials |
| Permissions and Ownership | Protected Health Information, Institutional Review Board, Commercial or Academic Research |
| Research Organization and Principal Investigator | University, Private Institute, International Data |
| Publications Based on Data | Citations, Papers, Related Items |