Literature DB >> 20306452

Issues in multiple imputation of missing data for large general practice clinical databases.

Louise Marston1, James R Carpenter, Kate R Walters, Richard W Morris, Irwin Nazareth, Irene Petersen.   

Abstract

PURPOSE: Missing data are a substantial problem in clinical databases. This paper aims to examine patterns of missing data in a primary care database, compare this to nationally representative datasets and explore the use of multiple imputation (MI) for these data.
METHODS: The patterns and extent of missing health indicators in a UK primary care database (THIN) were quantified using 488 384 patients aged 16 or over in their first year after registration with a GP from 354 General Practices. MI models were developed and the resulting data compared to that from nationally representative datasets (14 142 participants aged 16 or over from the Health Survey for England 2006 (HSE) and 4 252 men from the British Regional Heart Study (BRHS)).
RESULTS: Between 22% (smoking) and 38% (height) of health indicator data were missing in newly registered patients, 2004-2006. Distributions of height, weight and blood pressure were comparable to HSE and BRHS, but alcohol and smoking were not. After MI the percentage of smokers and non-drinkers was higher in THIN than the comparison datasets, while the percentage of ex-smokers and heavy drinkers was lower. Height, weight and blood pressure remained similar to the comparison datasets.
CONCLUSIONS: Given available data, the results are consistent with smoking and alcohol data missing not at random whereas height, weight and blood pressure missing at random. Further research is required on suitable imputation methods for smoking and alcohol in such databases.

Entities:  

Mesh:

Year:  2010        PMID: 20306452     DOI: 10.1002/pds.1934

Source DB:  PubMed          Journal:  Pharmacoepidemiol Drug Saf        ISSN: 1053-8569            Impact factor:   2.890


  53 in total

1.  The positivity assumption and marginal structural models: the example of warfarin use and risk of bleeding.

Authors:  Robert William Platt; Joseph Austin Christopher Delaney; Samy Suissa
Journal:  Eur J Epidemiol       Date:  2011-12-08       Impact factor: 8.082

2.  Temporal and within practice variability in the health improvement network.

Authors:  Kevin Haynes; Warren B Bilker; Tom R Tenhave; Brian L Strom; James D Lewis
Journal:  Pharmacoepidemiol Drug Saf       Date:  2011-07-13       Impact factor: 2.890

3. 

Authors:  Eric I Benchimol; Liam Smeeth; Astrid Guttmann; Katie Harron; David Moher; Irene Petersen; Henrik T Sørensen; Jean-Marie Januel; Erik von Elm; Sinéad M Langan
Journal:  CMAJ       Date:  2019-02-25       Impact factor: 8.262

4.  Non-benzodiazepine hypnotic use for sleep disturbance in people aged over 55 years living with dementia: a series of cohort studies.

Authors:  Kathryn Richardson; George M Savva; Penelope J Boyd; Clare Aldus; Ian Maidment; Eduwin Pakpahan; Yoon K Loke; Antony Arthur; Nicholas Steel; Clive Ballard; Robert Howard; Chris Fox
Journal:  Health Technol Assess       Date:  2021-01       Impact factor: 4.014

5.  Collaborative, pooled and harmonized study designs for epidemiologic research: challenges and opportunities.

Authors:  Catherine R Lesko; Lisa P Jacobson; Keri N Althoff; Alison G Abraham; Stephen J Gange; Richard D Moore; Sharada Modur; Bryan Lau
Journal:  Int J Epidemiol       Date:  2018-04-01       Impact factor: 7.196

6.  Insights into social disparities in smoking prevalence using Mosaic, a novel measure of socioeconomic status: an analysis using a large primary care dataset.

Authors:  Aarohi Sharma; Sarah Lewis; Lisa Szatkowski
Journal:  BMC Public Health       Date:  2010-12-07       Impact factor: 3.295

7.  Analyzing partially missing confounder information in comparative effectiveness and safety research of therapeutics.

Authors:  Sengwee Toh; Luis A García Rodríguez; Miguel A Hernán
Journal:  Pharmacoepidemiol Drug Saf       Date:  2012-05       Impact factor: 2.890

8.  Deep learning for clustering of multivariate clinical patient trajectories with missing values.

Authors:  Johann de Jong; Mohammad Asif Emon; Ping Wu; Reagon Karki; Meemansa Sood; Patrice Godard; Ashar Ahmad; Henri Vrooman; Martin Hofmann-Apitius; Holger Fröhlich
Journal:  Gigascience       Date:  2019-11-01       Impact factor: 6.524

9.  Mortality after Transplantation for Hepatocellular Carcinoma: A Study from the European Liver Transplant Registry.

Authors:  Hans-Christian Pommergaard; Andreas Arendtsen Rostved; René Adam; Allan Rasmussen; Mauro Salizzoni; Miguel Angel Gómez Bravo; Daniel Cherqui; Paolo De Simone; Pauline Houssel-Debry; Vincenzo Mazzaferro; Olivier Soubrane; Juan Carlos García-Valdecasas; Joan Fabregat Prous; Antonio D Pinna; John O'Grady; Vincent Karam; Christophe Duvoux; Lau Caspar Thygesen
Journal:  Liver Cancer       Date:  2020-05-12       Impact factor: 11.740

10.  Generalizing Randomized Clinical Trial Results: Implementation and Challenges Related to Missing Data in the Target Population.

Authors:  Jin-Liern Hong; Michele Jonsson Funk; Robert LoCasale; Sara E Dempster; Stephen R Cole; Michael Webster-Clark; Jessie K Edwards; Til Stürmer
Journal:  Am J Epidemiol       Date:  2018-04-01       Impact factor: 4.897

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.