Literature DB >> 31235593

Visualizing probabilistic models and data with Intensive Principal Component Analysis.

Katherine N Quinn1, Colin B Clement2, Francesco De Bernardis2, Michael D Niemack2, James P Sethna2.   

Abstract

Unsupervised learning makes manifest the underlying structure of data without curated training and specific problem definitions. However, the inference of relationships between data points is frustrated by the "curse of dimensionality" in high dimensions. Inspired by replica theory from statistical mechanics, we consider replicas of the system to tune the dimensionality and take the limit as the number of replicas goes to zero. The result is intensive embedding, which not only is isometric (preserving local distances) but also allows global structure to be more transparently visualized. We develop the Intensive Principal Component Analysis (InPCA) and demonstrate clear improvements in visualizations of the Ising model of magnetic spins, a neural network, and the dark energy cold dark matter ([Formula: see text]) model as applied to the cosmic microwave background.

Keywords:  information theory; manifold learning; probabilistic data; probabilistic models; visualization

Year:  2019        PMID: 31235593      PMCID: PMC6628833          DOI: 10.1073/pnas.1817218116

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  5 in total

1.  Parameter space compression underlies emergent theories and predictive models.

Authors:  Benjamin B Machta; Ricky Chachra; Mark K Transtrum; James P Sethna
Journal:  Science       Date:  2013-11-01       Impact factor: 47.728

Review 2.  Perspective: Sloppiness and emergent theories in physics, biology, and beyond.

Authors:  Mark K Transtrum; Benjamin B Machta; Kevin S Brown; Bryan C Daniels; Christopher R Myers; James P Sethna
Journal:  J Chem Phys       Date:  2015-07-07       Impact factor: 3.488

3.  Visualizing High-Dimensional Data: Advances in the Past Decade.

Authors:  Shusen Liu; Dan Maljovec; Bei Wang; Peer-Timo Bremer; Valerio Pascucci
Journal:  IEEE Trans Vis Comput Graph       Date:  2016-12-15       Impact factor: 4.579

4.  Model reduction by manifold boundaries.

Authors:  Mark K Transtrum; Peng Qiu
Journal:  Phys Rev Lett       Date:  2014-08-29       Impact factor: 9.161

5.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps.

Authors:  R R Coifman; S Lafon; A B Lee; M Maggioni; B Nadler; F Warner; S W Zucker
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-17       Impact factor: 12.779

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.