| Literature DB >> 27110215 |
Ofer Harel1, Neil Perkins2, Enrique F Schisterman2.
Abstract
Missing data due to limit of detection and limit of quantification is a common obstacle in epidemiological and biomedical research. We are interested in methodologies that provide unbiased and efficient estimates of these missing data while using popular statistical software. We describe a multiple imputation (MI) procedure for cross-sectional and longitudinal data which examines the sources of variation of hormones levels throughout the menstrual cycle conditional on specific biomarkers. We describe the rational, procedure, advantages and disadvantages of the multiple imputation procedure. We also provide a comparison to commonly used missing data procedures (complete cases analysis and single imputation). We illustrate our approach using the BioCycle data where we are interested in the effects of Vitamin E and Beta-carotene on Progesterone levels. We also evaluate the longitudinal impact of changes in Vitamin E on Progesterone levels over time. Finaly, we demonstrate the advantages of using MI over complete case analysis or naive single replacement in both cross-sectional and longitudinal analysis where measurements below the limit of quantification (LOQ) are unreported. We also illustrate that if available, inclusion of potentially demined unreliable data below the limit of detection (LOD) improves simple estimation substantially.Entities:
Keywords: Complete case analysis; Cross sectional data; Longitudinal analysis; Multiple imputation procedure
Year: 2014 PMID: 27110215 PMCID: PMC4838401 DOI: 10.4038/sljastats.v5i4.7792
Source DB: PubMed Journal: Sri Lankan J Appl Stat ISSN: 1391-4987