| Literature DB >> 32313838 |
Abstract
INTRODUCTION: The European medical information framework (EMIF) was an Innovative Medicines Initiative project jointly supported by the European Union and the European Federation of Pharmaceutical Industries and Associations, that generated a common technology and governance framework to identify, assess and (re)use healthcare data, to facilitate real-world data research. The objectives of EMIF included providing a unified platform to support a wide range of studies within two verification programmes-Alzheimer's disease (EMIF-AD), and metabolic consequences of obesity (EMIF-MET).Entities:
Keywords: EMIF; EMIF‐AD; EMIF‐MET; catalogue; use case
Year: 2019 PMID: 32313838 PMCID: PMC7156868 DOI: 10.1002/lrh2.10214
Source DB: PubMed Journal: Learn Health Syst ISSN: 2379-6146
Figure 1Schematic overview of the infrastructure built during the EMIF project. EMIF Platform has built a federated network of data sources. While JERBOA was initially used, EMIF later switched to using the OMOP CDM and OHDSI tools which were not available at the project commencement. In doing so, a federated network of data sources harmonized to the OMOP CDM was built, on which studies could be run using TASKA as a workflow management tool and Octopus as a private remote research environment. EMIF AD, on the other hand, used two separate infrastructures. The first one relied on TranSMART as a central data repository in which data from cohorts were stored and enriched via multi‐omics analysis on samples of these databases. This allowed us to build the EMIF 1000 cohort which was used to run the EMIF studies. The second infrastructure, however, was the EMIF AD cohort explorer. In this setup, AD cohorts harmonized their data to the Switchbox (an AD specific common data model based on the OMOP CDM). The AD Cohort explorer could then send queries to these harmonized databases after which the aggregated results were shown in the AD cohort explorer (consisting of the patient selection tool [PST] and variable selection tool [VST]). If the cohort custodian agreed to run a research study, the requested data would then be made available in the private remote research environment (PRRE) of the AD cohort explorer, thus implying that both data and governance once again remain local
Figure 2Different levels of access to data sources
Figure 3A high level overview of cohort architecture. Components of cohort architecture: CST (cohort selection tool): This tool provides the researcher with an overview of the availability and applicability of potential cohort data. PST (participant selection tool): The PST allows a researcher to get an overview of patient profiles in a given cohort (currently only supports the AD cohort data sets), filtered on a set of predefined key characteristics. VST (variable selection tool): The VST allows a researcher to get an overview of available variables (aggregated counts, not the actual values). This is followed by a request to the selected cohort owners for data access. PRRE (private remote research environment): For EMIF‐AD, a secured data platform called TranSMART is used for storing, managing and analysing all cohort data. Data uploaded to tranSMART is anonymised and harmonised according to the EMIF‐AD common data model to enable pooling of different cohort data
List of use cases
| Use case | Title |
|---|---|
| 1 | Dementia prevalence and incidence in a federation of European Electronic Health Record (EHR) databases. |
| 2 | BMI and the risk of cardiovascular disease and all‐cause mortality in European electronic medical records databases. |
| 3 | Association of non‐alcoholic fatty liver disease with cardiovascular and liver morbidity in electronic health record databases |
| 4 | Dementia: vascular and metabolic risk factors. |
| 5 | Treatment pathway analysis: An evaluation of treatment patterns and drug utilisation among cases with incident dementia in EHR databases available in the European Medical Information Framework (EMIF). |
| 6 | A nested case–control study of prior history of non‐alcoholic fatty liver disease in demented and cognitively impaired individuals matched to healthy controls in European health records data. |
| 7 | Utilisation of healthcare data to identify sub‐types of heart failure patients based on clinical and/or molecular phenotypes |
| 8 | An exploratory phenome wide association study linking asthma and liver disease single nucleotide polymorphisms and electronic health records from the Estonian Genome Centre at the University of Tartu Database |
| 9 | Investigating the relationship in paediatric population between antibiotics dosing of antibiotics (prescribed, dispensed or administered) and patient's weight. |
| 10 | Trazodone and the risk of dementia: an electronic primary care records analysis. |
| 11 | Identifying cases of type 2 diabetes in heterogeneous data sources: strategy from the EMIF project |
Note: The EMIF‐AD program sought to generate a platform to enable efficient reutilisation of pre‐existing data. Table 1 lists the project use‐cases for reutilisation of this data proposed as the program was set up. Three of these were completed with papers generated as referenced (see below) and others are in various phases of development. However, in addition to these use‐cases, EMIF‐AD had one large “meta use‐case” to re‐use existing cohort data to identify participants to studies who had generated data and donated biofluid samples that would enable biomarker discovery and validation studies. Specifically, we sought to identify biomarkers to facilitate therapeutic trials. This use‐case was singularly successful, rapidly generating a virtual cohort assembled from pre‐existing cohort data and then accessing samples from these individuals. This process is described in Bos et al (Bos I, Vos S, Verhey F, et al. Cerebrospinal fluid biomarkers of neurodegeneration, synaptic integrity, and astroglial activation across the clinical Alzheimer's disease spectrum. Alzheimer's dement. 2019;15(5):644‐54.) and some of the published outcomes listed are here (van Maurik IS, Vos SJ, Bos I, et al. Biomarker‐based prognosis for people with mild cognitive impairment (ABIDE): a modelling study. Lancet Neurol. 2019. https://doi.org/10.1016/S1474-4422(19)30283‐2; Shi L, Westwood S, Baird AL, et al. Discovery and validation of plasma proteomic biomarkers relating to brain amyloid burden by SOMAscan assay. Alzheimer's Dement. 2019. https://doi.org/10.1016/j.jalz.2019.06.4951; Morgan AR, Touchard S, Leckey C, et al. Inflammatory biomarkers in Alzheimer's disease plasma. Alzheimer's dement. 2019;15(6):776‐787; Kim M, Snowden S, Suvitaival T, et al. Primary fatty amides in plasma associated with brain amyloid burden, hippocampal volume, and memory in the European Medical Information Framework for Alzheimer's Disease biomarker discovery cohort. Alzheimer's dement. 2019;15(6):817‐27; Westwood S, Baird AL, Hye A, et al. Plasma protein biomarkers for the prediction of CSF amyloid and Tau and [18F]‐Flutemetamol PET scan result. Front Aging Neurosci. 2018;10:409; Ten Kate M, Redolfi A, Peira E, et al. MRI predictors of amyloid pathology: results from the EMIF‐AD Multimodal Biomarker Discovery study. Alzheimers Res Ther. 2018;10(1):100; Bos I, Vos S, Vandenberghe R, et al. The EMIF‐AD Multimodal Biomarker Discovery study: design, methods and cohort characteristics. Alzheimers Res Ther. 2018;10(1):64; Hong S, Prokopenko D, Dobricic V et al. Genome‐wide association study of Alzheimer's disease CSF biomarkers in the EMIF‐AD Multimodal Biomarker Discovery dataset. bioRxiv. https://doi.org/10.1101/774554.) with others in generation (Hong S, Prokopenko D, Dobricic V et al. Genome‐wide association study of Alzheimer's disease CSF biomarkers in the EMIF‐AD Multimodal Biomarker Discovery dataset. bioRxiv. https://doi.org/10.1101/774554.).