Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Visualizing probabilistic models and data with Intensive Principal Component Analysis.

Literature DB >> 31235593

Visualizing probabilistic models and data with Intensive Principal Component Analysis.

Katherine N Quinn¹, Colin B Clement², Francesco De Bernardis², Michael D Niemack², James P Sethna².

Abstract

Unsupervised learning makes manifest the underlying structure of data without curated training and specific problem definitions. However, the inference of relationships between data points is frustrated by the "curse of dimensionality" in high dimensions. Inspired by replica theory from statistical mechanics, we consider replicas of the system to tune the dimensionality and take the limit as the number of replicas goes to zero. The result is intensive embedding, which not only is isometric (preserving local distances) but also allows global structure to be more transparently visualized. We develop the Intensive Principal Component Analysis (InPCA) and demonstrate clear improvements in visualizations of the Ising model of magnetic spins, a neural network, and the dark energy cold dark matter ([Formula: see text]) model as applied to the cosmic microwave background.

Keywords: information theory; manifold learning; probabilistic data; probabilistic models; visualization

Year: 2019 PMID： 31235593 PMCID： PMC6628833 DOI： 10.1073/pnas.1817218116

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

Keyword Cloud
References

5 in total

Visualizing probabilistic models and data with Intensive Principal Component Analysis.

1. Parameter space compression underlies emergent theories and predictive models.

Review 2. Perspective: Sloppiness and emergent theories in physics, biology, and beyond.

3. Visualizing High-Dimensional Data: Advances in the Past Decade.

4. Model reduction by manifold boundaries.

5. Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps.