Literature DB >> 33806454

Solvable Model for the Linear Separability of Structured Data.

Marco Gherardi1,2.   

Abstract

Linear separability, a core concept in supervised machine learning, refers to whether the labels of a data set can be captured by the simplest possible machine: a linear classifier. In order to quantify linear separability beyond this single bit of information, one needs models of data structure parameterized by interpretable quantities, and tractable analytically. Here, I address one class of models with these properties, and show how a combinatorial method allows for the computation, in a mean field approximation, of two useful descriptors of linear separability, one of which is closely related to the popular concept of storage capacity. I motivate the need for multiple metrics by quantifying linear separability in a simple synthetic data set with controlled correlations between the points and their labels, as well as in the benchmark data set MNIST, where the capacity alone paints an incomplete picture. The analytical results indicate a high degree of "universality", or robustness with respect to the microscopic parameters controlling data structure.

Entities:  

Keywords:  data structure; linear separability; storage capacity

Year:  2021        PMID: 33806454      PMCID: PMC7999416          DOI: 10.3390/e23030305

Source DB:  PubMed          Journal:  Entropy (Basel)        ISSN: 1099-4300            Impact factor:   2.524


  19 in total

1.  The linear separability problem: some testing methods.

Authors:  D Elizondo
Journal:  IEEE Trans Neural Netw       Date:  2006-03

2.  An overview of statistical learning theory.

Authors:  V N Vapnik
Journal:  IEEE Trans Neural Netw       Date:  1999

3.  Mean-field message-passing equations in the Hopfield model and its generalizations.

Authors:  Marc Mézard
Journal:  Phys Rev E       Date:  2017-02-14       Impact factor: 2.529

4.  Zipf and Heaps laws from dependency structures in component systems.

Authors:  Andrea Mazzolini; Jacopo Grilli; Eleonora De Lazzari; Matteo Osella; Marco Cosentino Lagomarsino; Marco Gherardi
Journal:  Phys Rev E       Date:  2018-07       Impact factor: 2.529

5.  Linear readout of object manifolds.

Authors:  SueYeon Chung; Daniel D Lee; Haim Sompolinsky
Journal:  Phys Rev E       Date:  2016-06-30       Impact factor: 2.529

6.  Regulation of chain length in two diatoms as a growth-fragmentation process.

Authors:  Marco Gherardi; Alberto Amato; Jean-Pierre Bouly; Soizic Cheminant; Maria Immacolata Ferrante; Maurizio Ribera d'Alcalá; Daniele Iudicone; Angela Falciatore; Marco Cosentino Lagomarsino
Journal:  Phys Rev E       Date:  2016-08-23       Impact factor: 2.529

7.  Nutrient consumption and chain tuning in diatoms exposed to storm-like turbulence.

Authors:  Gianluca Dell'Aquila; Maria I Ferrante; Marco Gherardi; Marco Cosentino Lagomarsino; Maurizio Ribera d'Alcalà; Daniele Iudicone; Alberto Amato
Journal:  Sci Rep       Date:  2017-05-12       Impact factor: 4.379

8.  Separability and geometry of object manifolds in deep neural networks.

Authors:  Uri Cohen; SueYeon Chung; Daniel D Lee; Haim Sompolinsky
Journal:  Nat Commun       Date:  2020-02-06       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.