Literature DB >> 33353094

Examining the Causal Structures of Deep Neural Networks Using Information Theory.

Scythia Marrow1, Eric J Michaud2, Erik Hoel1.   

Abstract

Deep Neural Networks (DNNs) are often examined at the level of their response to input, such as analyzing the mutual information between nodes and data sets. Yet DNNs can also be examined at the level of causation, exploring "what does what" within the layers of the network itself. Historically, analyzing the causal structure of DNNs has received less attention than understanding their responses to input. Yet definitionally, generalizability must be a function of a DNN's causal structure as it reflects how the DNN responds to unseen or even not-yet-defined future inputs. Here, we introduce a suite of metrics based on information theory to quantify and track changes in the causal structure of DNNs during training. Specifically, we introduce the effective information (EI) of a feedforward DNN, which is the mutual information between layer input and output following a maximum-entropy perturbation. The EI can be used to assess the degree of causal influence nodes and edges have over their downstream targets in each layer. We show that the EI can be further decomposed in order to examine the sensitivity of a layer (measured by how well edges transmit perturbations) and the degeneracy of a layer (measured by how edge overlap interferes with transmission), along with estimates of the amount of integrated information of a layer. Together, these properties define where each layer lies in the "causal plane", which can be used to visualize how layer connectivity becomes more sensitive or degenerate over time, and how integration changes during training, revealing how the layer-by-layer causal structure differentiates. These results may help in understanding the generalization capabilities of DNNs and provide foundational tools for making DNNs both more generalizable and more explainable.

Entities:  

Keywords:  artificial neural networks; causation; information theory

Year:  2020        PMID: 33353094      PMCID: PMC7766755          DOI: 10.3390/e22121429

Source DB:  PubMed          Journal:  Entropy (Basel)        ISSN: 1099-4300            Impact factor:   2.524


  21 in total

1.  Slow feature analysis: unsupervised learning of invariances.

Authors:  Laurenz Wiskott; Terrence J Sejnowski
Journal:  Neural Comput       Date:  2002-04       Impact factor: 2.026

2.  What Caused What? A Quantitative Account of Actual Causation Using Dynamical Causal Networks.

Authors:  Larissa Albantakis; William Marshall; Erik Hoel; Giulio Tononi
Journal:  Entropy (Basel)       Date:  2019-05-02       Impact factor: 2.524

3.  Consciousness as integrated information: a provisional manifesto.

Authors:  Giulio Tononi
Journal:  Biol Bull       Date:  2008-12       Impact factor: 1.818

Review 4.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

Review 5.  Science, technology and the future of small autonomous drones.

Authors:  Dario Floreano; Robert J Wood
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

6.  Unified framework for information integration based on information geometry.

Authors:  Masafumi Oizumi; Naotsugu Tsuchiya; Shun-Ichi Amari
Journal:  Proc Natl Acad Sci U S A       Date:  2016-12-06       Impact factor: 11.205

7.  How causal analysis can reveal autonomy in models of biological systems.

Authors:  William Marshall; Hyunju Kim; Sara I Walker; Giulio Tononi; Larissa Albantakis
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2017-12-28       Impact factor: 4.226

8.  Integrated information in discrete dynamical systems: motivation and theoretical framework.

Authors:  David Balduzzi; Giulio Tononi
Journal:  PLoS Comput Biol       Date:  2008-06-13       Impact factor: 4.475

9.  Measuring information integration.

Authors:  Giulio Tononi; Olaf Sporns
Journal:  BMC Neurosci       Date:  2003-12-02       Impact factor: 3.288

10.  Improved Measures of Integrated Information.

Authors:  Max Tegmark
Journal:  PLoS Comput Biol       Date:  2016-11-21       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.