Literature DB >> 32607472

From Local Explanations to Global Understanding with Explainable AI for Trees.

Scott M Lundberg1,2, Gabriel Erion2,3, Hugh Chen2, Alex DeGrave2,3, Jordan M Prutkin4, Bala Nair5,6, Ronit Katz7, Jonathan Himmelfarb7, Nisha Bansal7, Su-In Lee2.   

Abstract

Tree-based machine learning models such as random forests, decision trees, and gradient boosted trees are popular non-linear predictive models, yet comparatively little attention has been paid to explaining their predictions. Here, we improve the interpretability of tree-based models through three main contributions: 1) The first polynomial time algorithm to compute optimal explanations based on game theory. 2) A new type of explanation that directly measures local feature interaction effects. 3) A new set of tools for understanding global model structure based on combining many local explanations of each prediction. We apply these tools to three medical machine learning problems and show how combining many high-quality local explanations allows us to represent global structure while retaining local faithfulness to the original model. These tools enable us to i) identify high magnitude but low frequency non-linear mortality risk factors in the US population, ii) highlight distinct population sub-groups with shared risk characteristics, iii) identify non-linear interaction effects among risk factors for chronic kidney disease, and iv) monitor a machine learning model deployed in a hospital by identifying which features are degrading the model's performance over time. Given the popularity of tree-based machine learning models, these improvements to their interpretability have implications across a broad set of domains.

Entities:  

Year:  2020        PMID: 32607472      PMCID: PMC7326367          DOI: 10.1038/s42256-019-0138-9

Source DB:  PubMed          Journal:  Nat Mach Intell        ISSN: 2522-5839


  12 in total

1.  On the interpretation of weight vectors of linear models in multivariate neuroimaging.

Authors:  Stefan Haufe; Frank Meinecke; Kai Görgen; Sven Dähne; John-Dylan Haynes; Benjamin Blankertz; Felix Bießmann
Journal:  Neuroimage       Date:  2013-11-15       Impact factor: 6.556

2.  Plan and operation of the NHANES I Epidemiologic Followup Study, 1992.

Authors:  C S Cox; M E Mussolino; S T Rothwell; M A Lane; C D Golden; J H Madans; J J Feldman
Journal:  Vital Health Stat 1       Date:  1997-12

3.  Association between Monocyte Count and Risk of Incident CKD and Progression to ESRD.

Authors:  Benjamin Bowe; Yan Xie; Hong Xian; Tingting Li; Ziyad Al-Aly
Journal:  Clin J Am Soc Nephrol       Date:  2017-03-27       Impact factor: 8.237

4.  Explainable machine-learning predictions for the prevention of hypoxaemia during surgery.

Authors:  Scott M Lundberg; Bala Nair; Monica S Vavilala; Mayumi Horibe; Michael J Eisses; Trevor Adams; David E Liston; Daniel King-Wai Low; Shu-Fang Newman; Jerry Kim; Su-In Lee
Journal:  Nat Biomed Eng       Date:  2018-10-10       Impact factor: 25.671

5.  A Randomized Trial of Intensive versus Standard Blood-Pressure Control.

Authors:  Jackson T Wright; Jeff D Williamson; Paul K Whelton; Joni K Snyder; Kaycee M Sink; Michael V Rocco; David M Reboussin; Mahboob Rahman; Suzanne Oparil; Cora E Lewis; Paul L Kimmel; Karen C Johnson; David C Goff; Lawrence J Fine; Jeffrey A Cutler; William C Cushman; Alfred K Cheung; Walter T Ambrosius
Journal:  N Engl J Med       Date:  2015-11-09       Impact factor: 91.245

6.  Screening large-scale association study data: exploiting interactions using random forests.

Authors:  Kathryn L Lunetta; L Brooke Hayward; Jonathan Segal; Paul Van Eerdewegh
Journal:  BMC Genet       Date:  2004-12-10       Impact factor: 2.797

7.  White blood cell count predicts the odds of kidney function decline in a Chinese community-based population.

Authors:  Fangfang Fan; Jia Jia; Jianping Li; Yong Huo; Yan Zhang
Journal:  BMC Nephrol       Date:  2017-06-07       Impact factor: 2.388

8.  Unmasking Clever Hans predictors and assessing what machines really learn.

Authors:  Sebastian Lapuschkin; Stephan Wäldchen; Alexander Binder; Grégoire Montavon; Wojciech Samek; Klaus-Robert Müller
Journal:  Nat Commun       Date:  2019-03-11       Impact factor: 14.919

9.  Repeated observation of breast tumor subtypes in independent gene expression data sets.

Authors:  Therese Sorlie; Robert Tibshirani; Joel Parker; Trevor Hastie; J S Marron; Andrew Nobel; Shibing Deng; Hilde Johnsen; Robert Pesich; Stephanie Geisler; Janos Demeter; Charles M Perou; Per E Lønning; Patrick O Brown; Anne-Lise Børresen-Dale; David Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-26       Impact factor: 12.779

10.  A random forest approach to the detection of epistatic interactions in case-control studies.

Authors:  Rui Jiang; Wanwan Tang; Xuebing Wu; Wenhui Fu
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

View more
  333 in total

1.  Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions.

Authors:  Raquel Rodríguez-Pérez; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2020-05-02       Impact factor: 3.686

2.  Machine Learning in Oncology: Methods, Applications, and Challenges.

Authors:  Dimitris Bertsimas; Holly Wiberg
Journal:  JCO Clin Cancer Inform       Date:  2020-10

3.  Elucidating the constitutive relationship of calcium-silicate-hydrate gel using high throughput reactive molecular simulations and machine learning.

Authors:  Gideon A Lyngdoh; Hewenxuan Li; Mohd Zaki; N M Anoop Krishnan; Sumanta Das
Journal:  Sci Rep       Date:  2020-12-07       Impact factor: 4.379

4.  A governance model for the application of AI in health care.

Authors:  Sandeep Reddy; Sonia Allan; Simon Coghlan; Paul Cooper
Journal:  J Am Med Inform Assoc       Date:  2020-03-01       Impact factor: 4.497

5.  An Interpretable Hand-Crafted Feature-Based Model for Atrial Fibrillation Detection.

Authors:  Rahimeh Rouhi; Marianne Clausel; Julien Oster; Fabien Lauer
Journal:  Front Physiol       Date:  2021-05-13       Impact factor: 4.566

6.  A comparison of machine learning models versus clinical evaluation for mortality prediction in patients with sepsis.

Authors:  William P T M van Doorn; Patricia M Stassen; Hella F Borggreve; Maaike J Schalkwijk; Judith Stoffers; Otto Bekers; Steven J R Meex
Journal:  PLoS One       Date:  2021-01-19       Impact factor: 3.240

7.  Identification of Patients with Sarcopenia Using Gait Parameters Based on Inertial Sensors.

Authors:  Jeong-Kyun Kim; Myung-Nam Bae; Kang Bok Lee; Sang Gi Hong
Journal:  Sensors (Basel)       Date:  2021-03-04       Impact factor: 3.576

8.  A phenomapping-derived tool to personalize the selection of anatomical vs. functional testing in evaluating chest pain (ASSIST).

Authors:  Evangelos K Oikonomou; David Van Dijk; Helen Parise; Marc A Suchard; James de Lemos; Charalambos Antoniades; Eric J Velazquez; Edward J Miller; Rohan Khera
Journal:  Eur Heart J       Date:  2021-07-08       Impact factor: 29.983

9.  Lung mass density prediction using machine learning based on ultrasound surface wave elastography and pulmonary function testing.

Authors:  Boran Zhou; Brian J Bartholmai; Sanjay Kalra; Thomas Osborn; Xiaoming Zhang
Journal:  J Acoust Soc Am       Date:  2021-02       Impact factor: 1.840

10.  Clinical applications of artificial intelligence and machine learning-based methods in inflammatory bowel disease.

Authors:  Shirley Cohen-Mekelburg; Sameer Berry; Ryan W Stidham; Ji Zhu; Akbar K Waljee
Journal:  J Gastroenterol Hepatol       Date:  2021-02       Impact factor: 4.029

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.