Literature DB >> 29218882

Improving the explainability of Random Forest classifier - user centered approach.

Dragutin Petkovic1, Russ Altman, Mike Wong, Arthur Vigil.   

Abstract

Machine Learning (ML) methods are now influencing major decisions about patient care, new medical methods, drug development and their use and importance are rapidly increasing in all areas. However, these ML methods are inherently complex and often difficult to understand and explain resulting in barriers to their adoption and validation. Our work (RFEX) focuses on enhancing Random Forest (RF) classifier explainability by developing easy to interpret explainability summary reports from trained RF classifiers as a way to improve the explainability for (often non-expert) users. RFEX is implemented and extensively tested on Stanford FEATURE data where RF is tasked with predicting functional sites in 3D molecules based on their electrochemical signatures (features). In developing RFEX method we apply user-centered approach driven by explainability questions and requirements collected by discussions with interested practitioners. We performed formal usability testing with 13 expert and non-expert users to verify RFEX usefulness. Analysis of RFEX explainability report and user feedback indicates its usefulness in significantly increasing explainability and user confidence in RF classification on FEATURE data. Notably, RFEX summary reports easily reveal that one needs very few (from 2-6 depending on a model) top ranked features to achieve 90% or better of the accuracy when all 480 features are used.

Entities:  

Mesh:

Year:  2018        PMID: 29218882      PMCID: PMC5728671     

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  6 in total

1.  Recognizing complex, asymmetric functional sites in protein structures using a Bayesian scoring function.

Authors:  Liping Wei; Russ B Altman
Journal:  J Bioinform Comput Biol       Date:  2003-04       Impact factor: 1.122

Review 2.  Use of data mining at the Food and Drug Administration.

Authors:  Hesha J Duggirala; Joseph M Tonning; Ella Smith; Roselie A Bright; John D Baker; Robert Ball; Carlos Bell; Susan J Bright-Ponte; Taxiarchis Botsis; Khaled Bouri; Marc Boyer; Keith Burkhart; G Steven Condrey; James J Chen; Stuart Chirtel; Ross W Filice; Henry Francis; Hongying Jiang; Jonathan Levine; David Martin; Taiye Oladipo; Rene O'Neill; Lee Anne M Palmer; Antonio Paredes; George Rochester; Deborah Sholtes; Ana Szarfman; Hui-Lee Wong; Zhiheng Xu; Taha Kass-Hout
Journal:  J Am Med Inform Assoc       Date:  2015-07-23       Impact factor: 4.497

3.  High precision prediction of functional sites in protein structures.

Authors:  Ljubomir Buturovic; Mike Wong; Grace W Tang; Russ B Altman; Dragutin Petkovic
Journal:  PLoS One       Date:  2014-03-14       Impact factor: 3.240

4.  New and continuing developments at PROSITE.

Authors:  Christian J A Sigrist; Edouard de Castro; Lorenzo Cerutti; Béatrice A Cuche; Nicolas Hulo; Alan Bridge; Lydie Bougueleret; Ioannis Xenarios
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

5.  Learning accurate and interpretable models based on regularized random forests regression.

Authors:  Sheng Liu; Shamitha Dissanayake; Sanjay Patel; Xin Dang; Todd Mlsna; Yixin Chen; Dawn Wilkins
Journal:  BMC Syst Biol       Date:  2014-10-22

6.  Machine learning methods for metabolic pathway prediction.

Authors:  Joseph M Dale; Liviu Popescu; Peter D Karp
Journal:  BMC Bioinformatics       Date:  2010-01-08       Impact factor: 3.169

  6 in total
  9 in total

1.  Prediction of slaughter age in pigs and assessment of the predictive value of phenotypic and genetic information using random forest.

Authors:  Ahmad Alsahaf; George Azzopardi; Bart Ducro; Egiel Hanenberg; Roel F Veerkamp; Nicolai Petkov
Journal:  J Anim Sci       Date:  2018-12-03       Impact factor: 3.159

Review 2.  Principles and Practice of Explainable Machine Learning.

Authors:  Vaishak Belle; Ioannis Papantonis
Journal:  Front Big Data       Date:  2021-07-01

3.  DNA methylation-based profiling of bone and soft tissue tumours: a validation study of the 'DKFZ Sarcoma Classifier'.

Authors:  Iben Lyskjaer; Solange De Noon; Roberto Tirabosco; Ana Maia Rocha; Daniel Lindsay; Fernanda Amary; Hongtao Ye; Daniel Schrimpf; Damian Stichel; Martin Sill; Christian Koelsche; Nischalan Pillay; Andreas Von Deimling; Stephan Beck; Adrienne M Flanagan
Journal:  J Pathol Clin Res       Date:  2021-05-05

4.  Clinician Preimplementation Perspectives of a Decision-Support Tool for the Prediction of Cardiac Arrhythmia Based on Machine Learning: Near-Live Feasibility and Qualitative Study.

Authors:  Stina Matthiesen; Søren Zöga Diederichsen; Mikkel Klitzing Hartmann Hansen; Christina Villumsen; Mats Christian Højbjerg Lassen; Peter Karl Jacobsen; Niels Risum; Bo Gregers Winkel; Berit T Philbert; Jesper Hastrup Svendsen; Tariq Osman Andersen
Journal:  JMIR Hum Factors       Date:  2021-11-26

5.  A Novel Integration of IF-DEMATEL and TOPSIS for the Classifier Selection Problem in Assistive Technology Adoption for People with Dementia.

Authors:  Miguel Angel Ortíz-Barrios; Matias Garcia-Constantino; Chris Nugent; Isaac Alfaro-Sarmiento
Journal:  Int J Environ Res Public Health       Date:  2022-01-20       Impact factor: 3.390

6.  Exploration of Biomarkers of Psoriasis through Combined Multiomics Analysis.

Authors:  Lu Xing; Tao Wu; Li Yu; Nian Zhou; Zhao Zhang; Yunjing Pu; Jinnan Wu; Hong Shu
Journal:  Mediators Inflamm       Date:  2022-09-23       Impact factor: 4.529

7.  Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques.

Authors:  Thi-Minh-Trang Huynh; Chuen-Fa Ni; Yu-Sheng Su; Vo-Chau-Ngan Nguyen; I-Hsien Lee; Chi-Ping Lin; Hoang-Hiep Nguyen
Journal:  Int J Environ Res Public Health       Date:  2022-09-26       Impact factor: 4.614

8.  Plasma Proteomics in Healthy Subjects with Differences in Tissue Glucocorticoid Sensitivity Identifies A Novel Proteomic Signature.

Authors:  Nicolas C Nicolaides; Manousos Makridakis; Rafael Stroggilos; Vasiliki Lygirou; Eleni Koniari; Ifigeneia Papageorgiou; Amalia Sertedaki; Jerome Zoidakis; Evangelia Charmandari
Journal:  Biomedicines       Date:  2022-01-16

9.  Anti-Ebola: an initiative to predict Ebola virus inhibitors through machine learning.

Authors:  Akanksha Rajput; Manoj Kumar
Journal:  Mol Divers       Date:  2021-08-06       Impact factor: 2.943

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.