Literature DB >> 26761732

Fast Direct Methods for Gaussian Processes.

Sivaram Ambikasaran, Daniel Foreman-Mackey, Leslie Greengard, David W Hogg, Michael O'Neil.   

Abstract

A number of problems in probability and statistics can be addressed using the multivariate normal (Gaussian) distribution. In the one-dimensional case, computing the probability for a given mean and variance simply requires the evaluation of the corresponding Gaussian density. In the n-dimensional setting, however, it requires the inversion of an n ×n covariance matrix, C, as well as the evaluation of its determinant, det(C). In many cases, such as regression using Gaussian processes, the covariance matrix is of the form C = σ(2) I + K, where K is computed using a specified covariance kernel which depends on the data and additional parameters (hyperparameters). The matrix C is typically dense, causing standard direct methods for inversion and determinant evaluation to require O(n(3)) work. This cost is prohibitive for large-scale modeling. Here, we show that for the most commonly used covariance functions, the matrix C can be hierarchically factored into a product of block low-rank updates of the identity matrix, yielding an O (n log(2) n) algorithm for inversion. More importantly, we show that this factorization enables the evaluation of the determinant det(C), permitting the direct calculation of probabilities in high dimensions under fairly broad assumptions on the kernel defining K. Our fast algorithm brings many problems in marginalization and the adaptation of hyperparameters within practical reach using a single CPU core. The combination of nearly optimal scaling in terms of problem size with high-performance computing resources will permit the modeling of previously intractable problems. We illustrate the performance of the scheme on standard covariance kernels.

Year:  2016        PMID: 26761732     DOI: 10.1109/TPAMI.2015.2448083

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  4 in total

1.  Spatial Multivariate Trees for Big Data Bayesian Regression.

Authors:  Michele Peruzzi; David B Dunson
Journal:  J Mach Learn Res       Date:  2022       Impact factor: 5.177

2.  Nonparametric spectral methdods for multivariate spatial and spatial-temporal data.

Authors:  Joseph Guinness
Journal:  J Multivar Anal       Date:  2021-09-06       Impact factor: 1.473

3.  TGFβ-induced cytoskeletal remodeling mediates elevation of cell stiffness and invasiveness in NSCLC.

Authors:  E Gladilin; S Ohse; M Boerries; H Busch; C Xu; M Schneider; M Meister; R Eils
Journal:  Sci Rep       Date:  2019-05-21       Impact factor: 4.379

4.  A low-mass planet candidate orbiting Proxima Centauri at a distance of 1.5 AU.

Authors:  Mario Damasso; Fabio Del Sordo; Guillem Anglada-Escudé; Paolo Giacobbe; Alessandro Sozzetti; Alessandro Morbidelli; Grzegorz Pojmanski; Domenico Barbato; R Paul Butler; Hugh R A Jones; Franz-Josef Hambsch; James S Jenkins; María José López-González; Nicolás Morales; Pablo A Peña Rojas; Cristina Rodríguez-López; Eloy Rodríguez; Pedro J Amado; Guillem Anglada; Fabo Feng; Jose F Gómez
Journal:  Sci Adv       Date:  2020-01-15       Impact factor: 14.136

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.