| Literature DB >> 34641870 |
Alvina G Lai1,2, Wai Hoong Chang3,4, Constantinos A Parisinos3, Michail Katsoulis3, Ruth M Blackburn3,4, Anoop D Shah3,5,6, Vincent Nguyen3, Spiros Denaxas3,4,5,7, George Davey Smith8,9, Tom R Gaunt8,9, Krishnarajah Nirantharakumar4,10, Murray P Cox11, Donall Forde12, Folkert W Asselbergs3,4,5,13,14, Steve Harris6, Sylvia Richardson15, Reecha Sofat3,5, Richard J B Dobson3,4,16, Aroon Hingorani4,14, Riyaz Patel14, Jonathan Sterne9, Amitava Banerjee3,17, Alastair K Denniston4,18, Simon Ball4,18, Neil J Sebire19,20, Nigam H Shah21, Graham R Foster22, Bryan Williams5,14,6, Harry Hemingway3,4.
Abstract
BACKGROUND: An Informatics Consult has been proposed in which clinicians request novel evidence from large scale health data resources, tailored to the treatment of a specific patient. However, the availability of such consultations is lacking. We seek to provide an Informatics Consult for a situation where a treatment indication and contraindication coexist in the same patient, i.e., anti-coagulation use for stroke prevention in a patient with both atrial fibrillation (AF) and liver cirrhosis.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34641870 PMCID: PMC8506488 DOI: 10.1186/s12911-021-01638-z
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Informatics Consult Electronic health record request form prototype
Fig. 2Informatics Consult Electronic health record report prototype
Fig. 3Trial evidence and currently recruiting trials of anticoagulation in patients with atrial fibrillation and cirrhosis to reduce stroke risk. A Clinical question and summary of trial evidence. B Previously completed and currently recruiting randomised trials evaluating anticoagulants and stroke outcomes have exclusion criteria related to cirrhosis
Fig. 4New synthesis of prior observational evidence. Meta-analysis of the association between warfarin use and the risk of ischaemic stroke in observational studies including approaches for automation. A Characteristics of observational studies included in the meta-analysis. B Forest plot depicting the hazard ratios calculated with the DerSimonian and Laird random-effects models. HR = hazard ratio; CI = confidence interval; SE = standard error
Fig. 5New observational evidence through target trial emulation (intention-to-treat analysis) where eligibility and treatment assignment were aligned with time zero of follow-up, as is done in randomised controlled trials. A CONSORT diagram showing the selection of eligible individuals for the target trial emulation of anticoagulation therapy in patients with atrial fibrillation and cirrhosis. B Kaplan–Meier plots of the propensity-matched cohort for all-cause mortality and ischaemic stroke. Flow diagram depicts analysis design. P values from logrank tests were indicated. Hazard ratios from Cox proportional hazards regression analyses were indicated. Numbers in parentheses indicate the 95% confidence intervals
Fig. 6Genetic evidence. Two-sample Mendelian randomisation on circulating vitamin K1 levels and risk of stroke. *Indicate significant results
Traditional versus automated approaches for evidence synthesis
| Task | Traditional approach | Approaches to automate |
|---|---|---|
| Search for recruiting trials | Perform search on the clinicaltrials.gov website. Requires manual decisions on relevant search terms | Perform search on the clinicaltrials.gov website using search terms collated from free text input in the Informatics Consult platform. Potential to leverage developments on computable machine-readable trial protocols ( |
| Summarise data of recruiting trials | Download search results from the clinicaltrials.gov website. Manually format tables. Extract additional information not present in downloaded data from the website by inputting NCT numbers | Download search results from the clinicaltrials.gov website. Generate scripts for automated table formatting to retain relevant information. Create a Python web-scraping tool to extract free texts from specific clinical trials and return information on inclusion and exclusion criteria. Note that some websites do not allow web-scraping and exclusions may apply to the clinicaltrials.gov website |
| Search strategy | Requires manual decisions on relevant search terms | Potential for mapping SNOMED-CT terms to MeSH descriptors used in PubMed (PMID:17238584) |
| Identifying existing evidence from published sources and assessing eligibility | Perform searches on PubMed. Manual curation and review of publications. Does not scale | Semi-automated systematic reviews using machine learning and natural language processing for expedited evidence synthesis. For example, using 'bag of words' for classifying documents and using learned coefficients for predicting the probability of an unseen document. Examples of platforms for automating evidence synthesis include RobotReviewer and ExaCT, where the latter employs an information extraction engine that identifies and extracts text fragments that describe clinical trial characteristics on unseen articles ( |
| Extracting data and performing the meta-analysis | Manual extraction of relevant tables and information. Not practical for batch extraction of data | Semi-automated tool for converting PDF documents to XML using a rule-based system such as PDFX. Batch extraction of data from PDF documents can also be performed using the open-source CyberPDF, which improves the accuracy and efficiency in batch data processing. Extracted data is formatted into data frames for subsequent meta-analysis using the meta package in R or other existing packages. ( |
| Specifying the target trial protocol | This process requires a discussion between the clinician and informatician to determine the appropriate criteria, treatment strategies and outcomes | Previous insights on specifying the target trial protocol can be collated automatically and be used to inform future target trial designs |
| Cohort creation based on eligibility criteria in the target trial protocol | Manual cohort creation for each target trial. Does not scale | This process can be pipelined using several functions to create cohorts in a consistent format with the covariates of interest. The DExtER tool for automated cohort creation can be employed |
| Propensity score matching to match initiators and non-initiators | Once a cohort is created in the correct format containing all the covariates of interest, propensity score matching can be performed using the MatchIt package | Additional approaches for causal inference analyses, including causal machine learning using the targeted maximum likelihood estimation approach can be investigated and pipelined |
| Descriptive summary of the cohort before and after matching | The tableone package can be used to generate the baseline tables before and after propensity score matching | Previous descriptive summaries on other related studies can be collated and featured in future target trials that investigate related clinical queries |
| Cox regression on the matched cohort | Cox regression analyses is performed by fitting the coxph function using the survival package | Additional regression analyses can be automated into the pipeline |
| Kaplan Meier analysis on the matched cohort | Survival or cumulative incidence curves are plotted using the survminer package | This can be pipelined to look at multiple outcomes at a time |
| Scaling to other examples and datasets | Limited tractability | Pipeline scalable to other datasets for cohort generation. Free text input from the Informatics Consult request form and report will inform additional opportunities to scale to other clinical questions |
| Identifying genetic variants associated with the exposure (e.g., drug) or risk factor in genome-wide association studies (GWAS). Identifying GWAS summary data for genetic variants associated with the exposure and outcome for a two sample Mendelian randomisation analysis | Manual curation of GWAS summary data. Literature search for published genetic variants for the risk factor | The MR-base platform for Mendelian randomisation can be employed to rapidly identify instruments for the exposure and outcome using GWAS summary data from their catalog. Additional GWAS summary data can be obtained from PhenoScanner, EMBL-EBI GWAS catalog and Integrative Epidemiology Unit OpenGWAS database |
| Performing Mendelian randomisation | Extract and format data identified above. Run Mendelian randomisation in R | MR-base also includes an analytical platform for performing MR analysis. For exposures and outcomes not available in MR-base, this process can be pipelined to transform the GWAS summary data from other public sources into an analysis-ready format. Mendelian randomisation can be performed using the MendelianRandomisation package |