| Literature DB >> 35047833 |
.
Abstract
The Electronic Medical Records and Genomics (eMERGE) Network, established in 2007, is a consortium of academic and integrated health systems conducting discovery and implementation research in translational genomics. Here, we outline the history of the network, highlight major impacts and lessons learned, and present the tools and resources developed for large-scale genomic analyses and translation into a clinical setting. The network developed methods to extract phenotypes from the electronic medical record to perform genome-wide and phenome-wide association studies. Recruited cohorts were clinically sequenced off a custom panel for targeted sequencing of variants and monogenic disease risks and returned to participants to investigate the impact of return of genomic results. After generating a 105,000 participant-imputed genome-wide association study (GWAS) dataset for discovery, the network enrolled and sequenced 24,998 participants. Integration of these results into the medical record and the effects of results on participants provided key lessons to the field. These learned lessons inform genetic research in diverse populations and provide insights into the clinical impact of return and implementation of genomic medicine using the electronic medical record. The lessons produced by the eMERGE Network can be utilized by other consortia as translational genomic medicine research evolves.Entities:
Keywords: consortium; discovery; electronic medical records; genomic; implementation; network
Year: 2020 PMID: 35047833 PMCID: PMC8756524 DOI: 10.1016/j.xhgg.2020.100018
Source DB: PubMed Journal: HGG Adv ISSN: 2666-2477
Figure 1Timeline and impact of the eMERGE Network
The network has produced 68 clinical phenotypes validated across multiple electronic medical record systems, launched tools focused on the reuse of genomic data, created multiple iterations of a large GWAS imputed dataset culminating with 105,108 participants, and sequenced and returned results off a PGRNseq and eMERGEseq custom sequencing platform.
Figure 2eMERGE impacts on clinical care and discovery
The eMERGE Network began by focusing on discovery-based research (white hexagons) before moving into clinical utility-based research (gray hexagons) in phase III. Discovery-based research remains a foundation of the network, contributing to the broad knowledgebase of clinical genomics, which, in turn, can be utilized to inform standards of clinical care and precision medicine in non-research settings. The image describes the main workgroup topics focused on during the third phase and how the network approached both clinical- and discovery-based science.
Breakdown of eMERGE dataset diversity
| eMERGEseqN = 24,956 | GWAS N = 105,108 | PGRNseq N = 9,010 | |
|---|---|---|---|
| African American | 3,914 (16%) | 15,836 (15%) | 1,209 (13%) |
| Hawaiian/Pacific Islander | 54 (0.2%) | 23 (0.02%) | 6 (0.1%) |
| Am Indian/Alaskan | 79 (0.3%) | 170 (0.2%) | 26 (0.3%) |
| Asian American | 1,578 (6%) | 1,246 (1%) | 135 (2%) |
| European | 17,691 (71%) | 79,764 (76%) | 6,065 (67%) |
| Missing/unknown | 1640 (7%) | 8069 (8%) | 1,569 (17%) |
| Hispanic/Latino | 1,506 (6%) | 5,217 (5%) | 413 (5%) |
| Not Hispanic/Latino | 22,551 (90%) | 93,425 (89%) | 7,313 (81%) |
| Missing/unknown | 899 (4%) | 6,466 (6%) | 1,284 (14%) |
Self-reported ancestry counts and percent of participants in the network-wide genomic datasets. Race and ethnicity are captured independently and represented separately.
Overarching lessons learned from the eMERGE Network
| Focus | Lessons Learned | Tools |
|---|---|---|
| Genomic discovery | • defined data freezes with specifications regarding diversity, phenotypic data, discovery, and implementation goals are critical to maximize resources | SPHINX; PheWAS Catalog |
| Electronic clinical phenotyping | • implementation and review of complex clinical algorithms using local experts maximizes phenotype accuracy | PheKB |
| Pharmaco-genomics | • shared variant knowledge base with access to structured data, and knowledge is necessary for implementation | SPHINX; CDS_KB |
| Clinical sequencing | • centralized sequencing allows for harmonization across large networks | |
| Return of results (RoR) | • flexibility in study design allows exploration of different approaches for RoR | MyResults |
| Integration into EMR | • a standard for data flow is essential for returning genomic test results across sites | CDS_KB; DocUBuild |
| Clinical outcomes | • implementation guides for EHR abstraction ensure consistency across personnel and sites |
Lessons broken down by phase III workgroup, with relevant resources available. SPHINX (Sequence & Phenotype Integration Exchange) links sequencing data to drug associations, GWAS variants, and ancestry. PheWAS Catalog functions as a platform for analysis of phenotypes against single gene variants. PheKB (Phenotype KnowledgeBase. Collaborative) is an environment to build and validate electronic phenotypic algorithms. CDS_KB (Clinical Decision Support KnowledgeBase) catalogs and shares clinical decision support implementation artifacts. MyResults provides education targeted to the public and information about genetic test results and disease risks. DocUBuild is a web application for creating and sharing information resources for electronic medical record systems.