| Literature DB >> 26541391 |
Elliott M Antman, Emelia J Benjamin, Robert A Harrington, Steven R Houser, Eric D Peterson, Mary Ann Bauman, Nancy Brown, Vincent Bufalino, Robert M Califf, Mark A Creager, Alan Daugherty, David L Demets, Bernard P Dennis, Shahram Ebadollahi, Mariell Jessup, Michael S Lauer, Bernard Lo, Calum A MacRae, Michael V McConnell, Alexa T McCray, Michelle M Mello, Eric Mueller, Jane W Newburger, Sally Okun, Milton Packer, Anthony Philippakis, Peipei Ping, Prad Prasoon, Véronique L Roger, Steve Singer, Robert Temple, Melanie B Turner, Kevin Vigilante, John Warner, Patrick Wayte.
Abstract
BACKGROUND: A 1.5-day interactive forum was convened to discuss critical issues in the acquisition, analysis, and sharing of data in the field of cardiovascular and stroke science. The discussion will serve as the foundation for the American Heart Association's (AHA's) near-term and future strategies in the Big Data area. The concepts evolving from this forum may also inform other fields of medicine and science. METHODS ANDEntities:
Keywords: AHA Scientific Statements; clinical trials; data; epidemiology; ethics; mobile health; preclinical
Mesh:
Year: 2015 PMID: 26541391 PMCID: PMC4845234 DOI: 10.1161/JAHA.115.002810
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 5.501
Figure 1Patients were previously the passive source of data (ie, measurements were taken from patients by others), they are increasingly becoming active generators of data (eg, wireless sensors) and, in doing so, provide a vast new domain of data not previously available. In addition to the patient perspective, 6 other domains are shown. These lenses served as the organizing basis for the American Heart Association's Data Summit.
Stakeholder Domains Relevant to Acquisition, Analysis, and Sharing of Data in Cardiovascular and Stroke Science
| Stakeholder | Interface With Data | |
|---|---|---|
| Current | Future | |
| Patients |
Passive source of data in healthcare environment Beneficiary of acquisition and analysis of data |
Anticipated to assume progressively more active role in generating data (eg, quantified self) Participant in the acquisition and analysis of data (eg, PCORI) Beneficiary of acquisition and analysis of data |
| Basic Scientists |
Focused on discovery science, usually in isolated units (ie, laboratories) |
Sharing of data and experimental materials may facilitate replication of results beyond the original reporting laboratory Use of new methods for scientific discovery (eg, data mining, Big Data analytics) |
| Clinical Investigators |
Focus on hypothesis‐driven research Generally assume a common phenotype of trial participants for condition under study (eg, sample of larger universe of persons with the condition) but may analyze broad subgroup categories Case report form and trial database usually constructed for specific trial with limited or no plans for repeat use in future trials |
Increasing use of Big Data approaches are anticipated to result in a shift from prespecified testable hypotheses to iteratively generated, data‐driven hypotheses Increasing knowledge gained from deep genotyping and phenotyping will lead to more precise characterization of individual participant profiles and targeted treatments Trial documents and databases will be designed for repeated use Sharing of clinical trial data may facilitate planning of future trials (ie, avoid duplication of previous efforts) and permit validation of findings by external groups beyond original investigators |
| Population Scientists/Epidemiologists |
Each study collects data of interest, often without coordination across studies Participants interact at periodic intervals with the study Periodic surveillance of participants health encounters |
Increasing coordination of “thick phenotyping” so that data can be harmonized, shared, and meta‐analyzed across studies. Data collection protocols posted in real time at study Web sites Participants contribute data through mobile health technologies throughout course of year longitudinally Participants share healthcare encounter data in real time |
| Clinicians/Healthcare System Researchers/Administrators |
Navigating the role of the physician in the growing digital world Focus on balancing the validity of risk modeling from Big Data in making care decisions |
Educating providers about the tools available to handle and find information Making the use of the data a natural part of the doctor–patient conversation Using data in areas in which there are physician shortages |
| Industry |
Navigating the current fragmented environment that frequently results in cumbersome “linear” development pathways Limited integration of novel sources of data Increasing challenge of storing exponentially increasing amounts of data |
Creating frameworks for collecting, storing, managing, curating, and analyzing data Navigating data that span various types of analytics including medical images, genomics, and natural language processing Capturing exogenous data to help inform clinical care Cloud computing to address capacity vs utilization issues New devices for monitoring patients with specific conditions |
| Regulatory Authorities |
Current focus is on review of data from clinical trials usually conducted for a focused purpose Constrained by current system that limits ability to share and integrate data across multiple sources |
Collecting data in standard formats for comparability and integration Ethical use of patient data, potential use of sensor data Greater use of data from public health sources Analyzing Big Data to accelerate understanding biological processes and heterogeneous response to treatments Policies and technical infrastructure to encourage data sharing Potential for modernizing the informed consent process |
PCORI indicates Patient‐Centered Outcomes Research Institute.
Figure 2Evolving informatics for an EMR‐based clinical research network. The model illustrates data transfers from individual‐site EMR to storage in an internal data warehouse with data that can then be mapped to a research datamart (with standard data elements) and ultimately transferred to a CT database. CT indicates clinical trial; EMR, electronic medical records.
Figure 3PCORnet: clinical research and patient engagement on a large scale. The proposed organization of the PCORnet is shown. Supported by a coordinating center, CDRNs, and PPRNs, a sustainable network of healthcare centers will be created with interoperable electronic medical records and active patient participation, all overseen by the PCORI staff, board of governors, and advisors. CDRNs include 8 networks with 1 million patients per network. PPRNs include 18 networks. Reprinted from Selby et al.8 CDRNs indicates Clinical Data Research Networks; PCORI, Patient‐Centered Outcomes Research Institute; PCORnet, National Patient‐Centered Clinical Research Network; PPRNs, Patient Powered Research Networks.
Figure 4Major potential threats to the validity of research findings. Discovery scientists work in an environment in which they have a high degree of control over the experimental conditions and use a small sample size. Population scientists operate in an environment in which there is less control over experimental conditions but a large sample size. Clinical trials fall between these 2 extremes and need to be interpreted with respect to internal validity and external generalizability. Other major threats to validity are shown (bottom left and right); the size of the font graphically illustrates the relative importance of the threats.
Figure 5The future of health information. New sources of patient data (in blue) are beginning to be merged with traditional healthcare data sources (in yellow) to better inform clinicians’ diagnosis, treatment and care decisions. The health care of the future can build on this model by incorporating additional existing data sources (in gray) to create an electronic healthcare predictive analytics systems (E‐HPA) that could theoretically use data from any source to clean and analyze, run and/or update predictive models, and output risk estimates back into the health information system to trigger or monitor specific clinical and/or operational activity. Adapted from Amarasingham et al.32 Admin indicates administration; EHR, electronic health record; Lab, laboratory.
Figure 6Vision for clinical research in the future. Examples of wearable sensors (top left). These communicate via Bluetooth to smartphones, which transmit data wirelessly to a research‐grade database on the Internet. Mobile health enables enrollment of large numbers of diverse participants around the globe. Investigators can then more effectively study transitions from a state of ideal health to the development of risk factors and ultimately overt manifestations of disease. Reprinted from Antman.56
Goals of Data Sharing
| Facilitating discovery science: avoiding duplication, ensuring reproducibility |
| Increasing understanding of human disease |
| Improving the design, efficiency, and quality of clinical trials |
| Improving the quality of care in clinical settings |
| Increasing the effectiveness of prevention |
| Translation to public |
Figure 7Major stages of the clinical trial life cycle and recommendations for when to share specific data packages. Data are generated at nearly every stage of the clinical trial life cycle, including the initial protocol and statistical analysis plan prepared prior to registration, the collection of baseline participant data at participant enrollment, and the analysis of the analyzable data set. To help frame the discussion of what data should be shared at what times, the Institute of Medicine report on sharing clinical trial data described the clinical trial life cycle as consisting of 5 major stages with guiding principles and a practical framework for the responsible sharing of data, including the types of data available at different stages of a trial and the optimal times to share them. Reprinted from Institute of Medicine report on Sharing Clinical Trial Data.44
Big Data and Stakeholders
| Stakeholder | What Stakeholders Want From Big Data | The American Heart Association's Role |
|---|---|---|
| Patients |
Controlled access to portable secure medical information Access to best possible health outcomes at affordable cost Easy access to medical research/clinical trials |
High priority
Facilitate the use of Big Data by patients and professionals Patient‐centered advocacy on the use of Big Data Facilitate the evaluation of health technology devices |
| Basic Scientists |
Develop and identify novel targets that otherwise would not be identified by traditional methods and query for CVD‐related genes/proteins Use systems approach to identify multiple genes/proteins that collectively cause CVD Access and use of analytical partners to help advance research goals |
High priority
Create an AHA digital ecosystem to build an AHA knowledge base (AHA Commons) and enable AHA investment for sustainable and long‐lasting impact Grant funding to establish training/learning, novel tools/products for analytic techniques Broker partnerships to advance, collect and analyze data Establish AHA data science policies and guidelines (privacy, ethics, intellectual property) |
| Clinical Investigators |
High‐quality standardized clinical data for secondary use A standardized technology platform with interoperable, feasible, and federated access to a broad range of clinical data Better, novel, more rapid mechanisms of support for analysis of Big Data A mechanism for ongoing discussion of these topics that includes the clinical investigator community |
High priority
Training and funding a new generation of Big Data and users (clinicians to developers) Bridge the gap between the theoretical promise of Big Data to potential use (eg, map EMR to “Get With The Guidelines”; provide leadership in data standard, quality, and validity) Be the match.com for data owners and data researchers |
| Population Scientists/Epidemiologists |
Resource effectively and integrate data from large numbers of diverse people, participants, patients longitudinally gathering across systems and populations Integration of data including, social determinants of health, exposure data, including mobile health as a source of data, physiological data, healthcare encounters/procedures; should be easily accessed Design and deploy the new prevention science using mobile health to empower the individual participant |
High priority
Provide resources for multidisciplinary investigators at all stages to broadly access science across the AHA portfolio including population science and epidemiology Create a data science fellowship whereby top tier data scientists can be funded and mentored by AHA multidisciplinary scientists Commit resources to convene a forum focused on Big Data harmonization and validation inclusive of multiple stakeholders and career levels Help scientists along the clinical–translational continuum translate and educate the public, clinicians, and researchers about Big Data |
| Clinicians/Healthcare System Researchers/Administrators |
Engagement across stakeholder domains Empowering patients using Big Data Identifying at‐risk populations with Big Data decision support Assessing and equipping providers with tools for collection, distillation and visualization Leveraging Big Data to enrich the practice of medicine—more efficient and more enjoyable Teaching providers about Big Data |
High priority
Develop and disseminate accepted clinical standards and benchmarks Sponsor the development of multidisciplinary tools for data analysis Convene all benchmarking communities and stakeholders |
| Industry |
Understanding of the available data:
ability to access, appropriateness of use, context of collection, known quality issues, any inherent biases, clarity of business models Robust analytical methods and scientifically accepted ways to deal with common data issues Rigor and disclosure of analytic methodologies when findings may affect healthcare policy decisions and clear communication of findings Clarity in how observational data may be used in support (or replacement) of RCT data in regulatory decision making Ensuring appropriate patient privacy and confidentiality |
High priority
Provide a mechanism to aggregate different databases allowing researchers to overcome limitation of any single data set (eg, statistical power, sample characteristics) Stimulate use of existing data sets through targeted research funding Help researchers better understand the sampling methods and variables in existing data sets Facilitate appropriate availability of data to a larger community of researchers by serving as an “honest broker” Assist researchers in accessing data sets by clarifying data use requirements |
| Regulatory Authorities |
Promote public health and innovation Careful attention to development of Big Data methodology—is it hypothesis generation only? Investigate heterogeneity of US population Rethink regulatory framework for informed consent Rethink regulatory framework for Big Data (eg, informed consent, OHRP, cluster randomized clinical trials) |
High priority
Develop analytic standards for
statistical methods, stated plans and protocols, data confirmation Develop common terminology and other data standards Preserve evidence standards for new uses and comparative effectiveness so Big Data can lead to changes in patient solutions and new hypotheses Registry of cardiovascular Big Data analyses |
AHA indicates American Heart Association; CVD, cardiovascular disease; EMR, electronic medical record; OHRP, Office for Human Research Protections; RCT, randomized controlled trial.
Figure 8Vision for the Cardiovascular Genome Phenome Study (CVGPS). Deep genotyping will provide assessment of the genetic and epigenetic determinants of disease. When data from diet, the environment, the microbiome, and deep phenotyping are combined, investigators will establish a 360° look at cardiovascular health. Reprinted from Antman.56
Figure 9The AHA‘s effort in modeling ideal cardiovascular health. There are many ways that ideal cardiovascular health can be modeled in the marketplace at the local community level and internationally. Within the model, there are different levels of engaging the public through Web‐based and “app” technology; wearable devices; and corporate, clinic, and faith‐based modules to create a cloud‐based system of improving cardiovascular health. AHA indicates the American Heart Association; BP, blood pressure; EMR, electronic medical record; FQHC, Federally Qualified Health Center; Gov't, government; H&W, health and wellness; HH, heart healthy; MLC, My Life Check; Phys Act, physical activity.
| Writing Group Member | Employment | Research Grant | Other Research Support | Speakers’ Bureau/Honoraria | Expert Witness | Ownership Interest | Consultant/Advisory Board | Other |
|---|---|---|---|---|---|---|---|---|
| Elliott M. Antman | Brigham & Women's Hospital | None | None | None | None | None | None | None |
| Emelia J. Benjamin | Boston University School of Medicine | NIH/NHLBI | None | None | None | None |
| None |
| Mary Ann Bauman | INTEGRIS Health, Inc | None | None | None | None | None | None | None |
| Nancy Brown | American Heart Association | None | None | None | None | None | None | None |
| Vincent Bufalino | Advocate Healthcare Cardiology | None | None | None | None | None | None | None |
| Robert M. Califf | US FDA | None | None | None | None | None | None | None |
| Mark A. Creager | Brigham and Women's Hospital | None | None | None | None | None | None | None |
| Alan Daugherty | University of Kentucky | None | None | None | None | None | None | None |
| David L. Demets | University of Wisconsin | None | None | None | Marino Law Office | None | Actelion Pharmaceutical | None |
| Bernard P. Dennis | Dennis Associates, LLC | None | None | None | None | None | None | None |
| Shahram Ebadollahi | IBM Healthcare | None | None | None | None | None | None | None |
| Robert A. Harrington | Stanford University | None | None | None | None | None | None | None |
| Steven R. Houser | Temple University School of Medicine | None | None | None | None | None | None | None |
| Mariell Jessup | University of Pennsylvania | None | None | None | None | None | None | None |
| Michael S. Lauer | NHLBI | None | None | None | None | None | None | None |
| Bernard Lo | The Greenwall Foundation | None | None | None | None | None | None | None |
| Calum A. MacRae | Harvard Medical School | None | None | None | None | None | None | None |
| Michael V. McConnell | Stanford University Medical Center | GE Healthcare | None | None | None | None | Google Life Sciences | None |
| Alexa T. McCray | Harvard University | None | None | None | None | None | None | None |
| Michelle M. Mello | Stanford University Law School and School of Medicine | The Greenwall Foundation | None | None | None | None | None | None |
| Eric Mueller | Microsoft | None | None | None | None | None | None | None |
| Jane W. Newburger | Boston Children's Hospital/Harvard Medical School | Bristol Myer Squibb | None | None | None | None | Bristol Myer Squibb | None |
| Sally Okun | patientslikeme | None | None | None | None | None | None | None |
| Milton Packer | University of Texas Southwestern Medical Center | None | None | None | None | None | None | None |
| Eric D. Peterson | Duke Clinical Research Institute | None | None | None | None | None | None | None |
| Anthony Philippakis | Venture Partners | Intel | None | None | None | Google Ventures | None | Google Ventures |
| Peipei Ping | UCLA School of Medicine | None | None | None | None | None | None | None |
| Prad Prasoon | American Heart Association | None | None | None | None | None | None | None |
| Véronique L. Roger | Mayo Clinic | None | None | None | None | None | None | None |
| Steve Singer | Accreditation Council for Continuing Medical Education (ACCME) | None | None | None | None | None | None | None |
| Robert Temple | FDA, Center for Drug Evaluation & Research | None | None | None | None | None | None | None |
| Melanie B. Turner | American Heart Association | None | None | None | None | None | None | American Heart Association |
| Kevin Vigilante | Booz Allen Hamilton | None | None | None | None | None | None | None |
| John Warner | UT Southwestern Medical Center | None | None | None | None | None | None | None |
| Patrick Wayte | American Heart Association | None | None | None | None | None | None | None |
This table represents the relationships of writing group members that may be perceived as actual or reasonably perceived conflicts of interest as reported on the Disclosure Questionnaire, which all members of the writing group are required to complete and submit. A relationship is considered to be “significant” if (1) the person receives $10 000 or more during any 12‐month period, or 5% or more of the person's gross income; or (2) the person owns 5% or more of the voting stock or share of the entity, or owns $10 000 or more of the fair market value of the entity. A relationship is considered to be “modest” if it is less than “significant” under the preceding definition.
Modest.
Significant.