Literature DB >> 30069934

Making complex prediction rules applicable for readers: Current practice in random forest literature and recommendations.

Anne-Laure Boulesteix1, Silke Janitza1, Roman Hornung1, Philipp Probst1, Hannah Busen1, Alexander Hapfelmeier2.   

Abstract

Ideally, prediction rules should be published in such a way that readers may apply them, for example, to make predictions for their own data. While this is straightforward for simple prediction rules, such as those based on the logistic regression model, this is much more difficult for complex prediction rules derived by machine learning tools. We conducted a survey of articles reporting prediction rules that were constructed using the random forest algorithm and published in PLOS ONE in 2014-2015 in the field "medical and health sciences", with the aim of identifying issues related to their applicability. Making a prediction rule reproducible is a possible way to ensure that it is applicable; thus reproducibility is also examined in our survey. The presented prediction rules were applicable in only 2 of 30 identified papers, while for further eight prediction rules it was possible to obtain the necessary information by contacting the authors. Various problems, such as nonresponse of the authors, hampered the applicability of prediction rules in the other cases. Based on our experiences from this illustrative survey, we formulate a set of recommendations for authors who aim to make complex prediction rules applicable for readers. All data including the description of the considered studies and analysis codes are available as supplementary materials.
© 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  logistic regression; machine learning; prediction rule; reproducibility; reproducible research

Mesh:

Year:  2018        PMID: 30069934     DOI: 10.1002/bimj.201700243

Source DB:  PubMed          Journal:  Biom J        ISSN: 0323-3847            Impact factor:   2.207


  4 in total

Review 1.  Statistical learning approaches in the genetic epidemiology of complex diseases.

Authors:  Anne-Laure Boulesteix; Marvin N Wright; Sabine Hoffmann; Inke R König
Journal:  Hum Genet       Date:  2019-05-02       Impact factor: 4.132

Review 2.  Essential guidelines for computational method benchmarking.

Authors:  Lukas M Weber; Wouter Saelens; Robrecht Cannoodt; Charlotte Soneson; Alexander Hapfelmeier; Paul P Gardner; Anne-Laure Boulesteix; Yvan Saeys; Mark D Robinson
Journal:  Genome Biol       Date:  2019-06-20       Impact factor: 13.583

3.  Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data.

Authors:  Simon Klau; Vindi Jurinovic; Roman Hornung; Tobias Herold; Anne-Laure Boulesteix
Journal:  BMC Bioinformatics       Date:  2018-09-12       Impact factor: 3.169

4.  Machine Learning Approaches to Define Candidates for Ambulatory Single Level Laminectomy Surgery.

Authors:  Qiyi Li; Haoyan Zhong; Federico P Girardi; Jashvant Poeran; Lauren A Wilson; Stavros G Memtsoudis; Jiabin Liu
Journal:  Global Spine J       Date:  2021-01-07
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.