| Literature DB >> 31984327 |
Philip R O Payne1, Elmer V Bernstam2, Justin B Starren3.
Abstract
There are an ever-increasing number of reports and commentaries that describe the challenges and opportunities associated with the use of big data and data science (DS) in the context of biomedical education, research, and practice. These publications argue that there are substantial benefits resulting from the use of data-centric approaches to solve complex biomedical problems, including an acceleration in the rate of scientific discovery, improved clinical decision making, and the ability to promote healthy behaviors at a population level. In addition, there is an aligned and emerging body of literature that describes the ethical, legal, and social issues that must be addressed to responsibly use big data in such contexts. At the same time, there has been growing recognition that the challenges and opportunities being attributed to the expansion in DS often parallel those experienced by the biomedical informatics community. Indeed, many informaticians would consider some of these issues relevant to the core theories and methods incumbent to the field of biomedical informatics science and practice. In response to this topic area, during the 2016 American College of Medical Informatics Winter Symposium, a series of presentations and focus group discussions intended to define the current state and identify future directions for interaction and collaboration between people who identify themselves as working on big data, DS, and biomedical informatics were conducted. We provide a perspective concerning these discussions and the outcomes of that meeting, and also present a set of recommendations that we have generated in response to a thematic analysis of those same outcomes. Ultimately, this report is intended to: (1) summarize the key issues currently being discussed by the biomedical informatics community as it seeks to better understand how to constructively interact with the emerging biomedical big data and DS fields; and (2) propose a framework and agenda that can serve to advance this type of constructive interaction, with mutual benefit accruing to both fields.Entities:
Keywords: big data; biomedical informatics; data science
Year: 2018 PMID: 31984327 PMCID: PMC6951903 DOI: 10.1093/jamiaopen/ooy032
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Web search activity as measured via Google Trends for the terms “biomedical informatics”, “big data”, “data science”, “informatics”, and “bioinformatics” (2004–Present).
Description of ACMI Fellows participating in 2016 Winter Symposium, stratified by professional setting (academic, industry, or other areas such as non-profit or government entities) and geographic setting (United States and international)
| Professional setting | Academic | Industry | Other (non-profit, government) |
|---|---|---|---|
| Number of participants at 2016 Winter Symposium ( | 40 (82% of meeting participants) | 7 (14% of meeting participants) | 2 (4% of meeting participants) |
| Geographic setting | United States | International | |
| Number of participants at 2016 Winter Symposium ( | 42 (85% of meeting participants) | 7 (15% of meeting participants) | |
Note: Of note, the participants at this meeting represented approximately 16% of all ACMI Fellows at the time of the event.
Figure 2.Overview of an integrated biomedical data, information, and knowledge lifecycle, showing the contributions of theories and methods associated with Big Data, data science, data analytics, and BMI. For each phase of the model, exemplary contributions to such theory and practice as have been generated by the BMI community over the last several decades are shown. BMI: biomedical informatics.
Overview of findings and recommendations generate by breakout groups during the 2016 ACMI Winter Symposium, focusing on synergies and distinctions between BMI and DS
| Question 1: How should the BMI and DS communities engage and communicate with each other so as to coordinate effectively? |
A focus of such efforts should be to explain the relationships between DS and BMI to internal and external community members; and Such explanations must be accessible and use easily understandable examples. |
This effort should ensure that we learn from our history with other differentiated groups that cross-over over otherwise intersect with BMI; A primary dissemination vehicle for such a “view” should be via panels and/or thematic tracks at scientific meetings, so as to discuss such lessons learned and next steps, as well as to highlight projects that effectively implement said models and advance biomedicine; and It will be critical to ensure that relevant publication venues demonstrative inclusiveness to engage diverse stakeholders from both the BMI and DS communities. |
| Question 2: What are the curricular and workforce development needs incumbent to realizing potential synergies between BMI and DS? What are the distinctions between the two fields in this regard? |
Achieving this goal will require the creation of better pipeline for trainees who can enter the two field; and It will be important to engage potential employers to ensure that such curricula are harmonized with workforce development needs. |
| Question 3: How does an increasing emphasis on DS in biomedicine impact the types of shared resources and capabilities commonly found in biomedical research enterprises as are regularly overseen by BMI academic or operational units? |
Data storage Data “wrangling” Computational methods Quantitative methods Visualization |
Who is responsible for data capture and management and who serves as the owner/steward/champion for such data assets; How do we adequately address multi-disciplinary interactions around said services/capabilities and ensuing stakeholder engagement; and Who pays for such services and capabilities and how are they rendered sustainable, particularly as they become an essential and necessary substrate for modern biomedical research, education, and practice. |
| Question 4: What critical socio-cultural and policy issues need to be addressed relative to data reuse and open science paradigms as they pertain to providing the “input” for research paradigms that leverage DS approaches? |
Case studies that show the explicit and implicit value of data reuse (reproducibility, cumulative open science, meta-analyses, etc.); Metadata standards that facilitate data reuse; Collaborative platforms for team-oriented open innovation; Mechanisms and incentive structures to empower data generators as part of derivative projects/teams; Inventories of best practices from other countries to inform US-based efforts in this regard; and Mechanisms to provide credit to data generators when their data is used in other contexts/projects. |
| Question 5: How should DS focused biomedical research be funded and sustained, particularly once initial federal investments (ie, the NIH BD2K program) reach the end of their currently allocated resources? |
BMI: biomedical informatics; DS: data science; NLM: National Library of Medicine; NIH: National Institutes of Health; NSF: National Science Foundation; PCORI: Patient Centered Outcomes Research Institute; AHRQ: Agency for Healthcare Research and Quality.