Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Solvable Model for the Linear Separability of Structured Data.

Literature DB >> 33806454

Solvable Model for the Linear Separability of Structured Data.

Abstract

Linear separability, a core concept in supervised machine learning, refers to whether the labels of a data set can be captured by the simplest possible machine: a linear classifier. In order to quantify linear separability beyond this single bit of information, one needs models of data structure parameterized by interpretable quantities, and tractable analytically. Here, I address one class of models with these properties, and show how a combinatorial method allows for the computation, in a mean field approximation, of two useful descriptors of linear separability, one of which is closely related to the popular concept of storage capacity. I motivate the need for multiple metrics by quantifying linear separability in a simple synthetic data set with controlled correlations between the points and their labels, as well as in the benchmark data set MNIST, where the capacity alone paints an incomplete picture. The analytical results indicate a high degree of "universality", or robustness with respect to the microscopic parameters controlling data structure.

Entities: Disease Gene

Keywords: data structure; linear separability; storage capacity

Year: 2021 PMID： 33806454 PMCID： PMC7999416 DOI： 10.3390/e23030305

Source DB: PubMed Journal: Entropy (Basel) ISSN： 1099-4300 Impact factor: 2.524

Keyword Cloud
References

19 in total

1. The linear separability problem: some testing methods.

Authors: D Elizondo
Journal: IEEE Trans Neural Netw Date: 2006-03

2. An overview of statistical learning theory.

Authors: V N Vapnik
Journal: IEEE Trans Neural Netw Date: 1999

3. Mean-field message-passing equations in the Hopfield model and its generalizations.

Authors: Marc Mézard
Journal: Phys Rev E Date: 2017-02-14 Impact factor: 2.529

4. Zipf and Heaps laws from dependency structures in component systems.

Authors: Andrea Mazzolini; Jacopo Grilli; Eleonora De Lazzari; Matteo Osella; Marco Cosentino Lagomarsino; Marco Gherardi
Journal: Phys Rev E Date: 2018-07 Impact factor: 2.529

5. Linear readout of object manifolds.

Authors: SueYeon Chung; Daniel D Lee; Haim Sompolinsky
Journal: Phys Rev E Date: 2016-06-30 Impact factor: 2.529

6. Regulation of chain length in two diatoms as a growth-fragmentation process.

Authors: Marco Gherardi; Alberto Amato; Jean-Pierre Bouly; Soizic Cheminant; Maria Immacolata Ferrante; Maurizio Ribera d'Alcalá; Daniele Iudicone; Angela Falciatore; Marco Cosentino Lagomarsino
Journal: Phys Rev E Date: 2016-08-23 Impact factor: 2.529

7. Nutrient consumption and chain tuning in diatoms exposed to storm-like turbulence.

Authors: Gianluca Dell'Aquila; Maria I Ferrante; Marco Gherardi; Marco Cosentino Lagomarsino; Maurizio Ribera d'Alcalà; Daniele Iudicone; Alberto Amato
Journal: Sci Rep Date: 2017-05-12 Impact factor: 4.379

8. Separability and geometry of object manifolds in deep neural networks.

Authors: Uri Cohen; SueYeon Chung; Daniel D Lee; Haim Sompolinsky
Journal: Nat Commun Date: 2020-02-06 Impact factor: 14.919