| Literature DB >> 34629446 |
Barbara J Kenner1, Natalie D Abrams2, Suresh T Chari3, Bruce F Field1, Ann E Goldberg1, William A Hoos4, David S Klimstra5, Laura J Rothschild1, Sudhir Srivastava2, Matthew R Young2, Vay Liang W Go6.
Abstract
ABSTRACT: The potential of artificial intelligence (AI) applied to clinical data from electronic health records (EHRs) to improve early detection for pancreatic and other cancers remains underexplored. The Kenner Family Research Fund, in collaboration with the Cancer Biomarker Research Group at the National Cancer Institute, organized the workshop entitled: "Early Detection of Pancreatic Cancer: Opportunities and Challenges in Utilizing Electronic Health Records (EHR)" in March 2021. The workshop included a select group of panelists with expertise in pancreatic cancer, EHR data mining, and AI-based modeling. This review article reflects the findings from the workshop and assesses the feasibility of AI-based data extraction and modeling applied to EHRs. It highlights the increasing role of data sharing networks and common data models in improving the secondary use of EHR data. Current efforts using EHR data for AI-based modeling to enhance early detection of pancreatic cancer show promise. Specific challenges (biology, limited data, standards, compatibility, legal, quality, AI chasm, incentives) are identified, with mitigation strategies summarized and next steps identified.Entities:
Mesh:
Year: 2021 PMID: 34629446 PMCID: PMC8542068 DOI: 10.1097/MPA.0000000000001882
Source DB: PubMed Journal: Pancreas ISSN: 0885-3177 Impact factor: 3.243
Health Data Networks and CDMs
| Network/Funders | CDM | Reference and/or URL |
|---|---|---|
| TriNetX | TriNetX | [ |
| OHDSI/Reagan-Udal Foundation, FDA, PhARMA, FNIH | OMOP |
|
| Cleveland Clinic | UMLS-based |
|
| i2b2/NIH NCBC | i2b2/ACT | [ |
| MSK Cancer Center | MSK-specific |
[ |
| PCORI | PCORNet |
|
| REP | REP-specific |
|
| Cancer Research Network Consortium/NCI | VDW | [ |
| VSD/CDC | CDC-specific |
|
| Sentinel Data Partners/ FDA | Sentinel |
|
ACT indicates the Accrual to Clinical Trials project; CDC, the Centers for Disease Control and Prevention; FNIH, the Foundation for the National Institutes of Health; NCBC, the National Centers for Biomedical Computing; NIH, the National Institutes of Health; OHDSI, the Occupational Health Data Sciences and Informatics group; PhARMA, the Pharmaceutical Research and Manufacturers of America; PCORNet, the National Patient-Centered Clinical Research Network; URL, Uniform Resource Locator; VDW, the Virtual Data Warehouse; VSD, the Vaccine Safety Datalink project.
FIGURE 1Early detection conceptual framework. Courtesy of Michael Rosenthal.
Challenges, Potential Mitigation Strategies, and Next Steps
| Challenge | Mitigation Strategy | Next Steps |
|---|---|---|
| Complex PDAC biology | Tracking multiple potential predictors over time | • Develop better understanding of interrelationship between PDAC and other diseases of the pancreas |
| Limited availability of longitudinal data for AI-based risk modeling | Secondary use of EHR and other relevant patient data collected and harmonized by clinical data sharing networks | • Facilitate retrospective cohort discovery by leveraging centralized and federated EHR data repositories compiled by data sharing networks |
| Gap between health data standards and clinical research data standards | Fast Healthcare Interoperability Resource and related efforts | • Incentivize common adoption of standards by clinical and research communities |
| Incompatible CDMs used by health care systems | Data sharing networks, which normalize EHRs using network specific CDMs | • Extend CDM mappings relevant to PDAC |
| Legal, intellectual property, privacy related, data sharing barriers across organizations and networks | Develop privacy-protecting data sharing approaches like federated learning and other private-sector innovations | • Validate and leverage federated learning and other private-sector innovations in data sharing for risk modeling of PDAC |
| Insufficient data quality and reliability | Approaches for validation of data reliability and quality developed by data sharing networks | • Develop shared criteria for evaluating data reliability and quality |
| AI chasm: poor clinical utility and generalizability of AI systems | Transparency of validation, benchmarking competitions and possibly other certification process (ie, FDA, NCCN, USPSTF) | • Incentivize interdisciplinary partnerships and AI validation and benchmarking efforts |
| Lack of incentives for efforts focused on precision prevention of PDAC and public health management | Multidisease partnerships and collaborations to build a virtual network of researchers and clinicians | • Leverage existing networks to promote academic-industry partnerships |
eCRFs indicates electronic case report forms; IT, information technology; NCCN, the National Comprehensive Cancer Network; ONC, the Office of the National Coordinator for Health Information Technology; USPSTF, the US Preventive Services Task Force.