Literature DB >> 32049320

A cautionary tale on using imputation methods for inference in matched pairs design.

Burim Ramosaj1, Lubna Amro1, Markus Pauly1.   

Abstract

MOTIVATION: Imputation procedures in biomedical fields have turned into statistical practice, since further analyses can be conducted ignoring the former presence of missing values. In particular, non-parametric imputation schemes like the Random Forest have shown favorable imputation performance compared to the more traditionally used MICE procedure. However, their effect on valid statistical inference has not been analyzed so far. This paper closes this gap by investigating their validity for inferring mean differences in incompletely observed pairs while opposing them to a recent approach that only works with the given observations at hand.
RESULTS: Our findings indicate that machine learning schemes for (multiply) imputing missing values may inflate type-I-error or result in comparably low power in small to moderate matched pairs, even after modifying the test statistics using Rubin's multiple imputation rule. In addition to an extensive simulation study, an illustrative data example from a breast cancer gene study has been considered. AVAILABILITY: The corresponding R-code can be accessed through the authors and the gene expression data can be downloaded at www.gdac.broadinstitute.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2020). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2020        PMID: 32049320     DOI: 10.1093/bioinformatics/btaa082

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  Ranking procedures for repeated measures designs with missing data: Estimation, testing and asymptotic theory.

Authors:  Kerstin Rubarth; Markus Pauly; Frank Konietschke
Journal:  Stat Methods Med Res       Date:  2021-11-29       Impact factor: 3.021

2.  On the Relation between Prediction and Imputation Accuracy under Missing Covariates.

Authors:  Burim Ramosaj; Justus Tulowietzki; Markus Pauly
Journal:  Entropy (Basel)       Date:  2022-03-09       Impact factor: 2.524

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.