Literature DB >> 19147882

Hypergraph-based anomaly detection of high-dimensional co-occurrences.

Jorge Silva1, Rebecca Willett.   

Abstract

This paper addresses the problem of detecting anomalous multivariate co-occurrences using a limited number of unlabeled training observations. A novel method based on using a hypergraph representation of the data is proposed to deal with this very high-dimensional problem. Hypergraphs constitute an important extension of graphs which allow edges to connect more than two vertices simultaneously. A variational Expectation-Maximization algorithm for detecting anomalies directly on the hypergraph domain without any feature selection or dimensionality reduction is presented. The resulting estimate can be used to calculate a measure of anomalousness based on the False Discovery Rate. The algorithm has O(np) computational complexity, where n is the number of training observations and p is the number of potential participants in each co-occurrence event. This efficiency makes the method ideally suited for very high-dimensional settings, and requires no tuning, bandwidth or regularization parameters. The proposed approach is validated on both high-dimensional synthetic data and the Enron email database, where p > 75,000, and it is shown that it can outperform other state-of-the-art methods.

Entities:  

Mesh:

Year:  2009        PMID: 19147882     DOI: 10.1109/TPAMI.2008.232

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  4 in total

1.  Synergistic Coding by Cortical Neural Ensembles.

Authors:  Mehdi Aghagolzadeh; Seif Eldawlatly; Karim Oweiss
Journal:  IEEE Trans Inf Theory       Date:  2010-02-01       Impact factor: 2.501

2.  Embedding Learning with Events in Heterogeneous Information Networks.

Authors:  Huan Gui; Jialu Liu; Fangbo Tao; Meng Jiang; Brandon Norick; Lance Kaplan; Jiawei Han
Journal:  IEEE Trans Knowl Data Eng       Date:  2017-11       Impact factor: 6.977

3.  Learning High-dimensional Generalized Linear Autoregressive Models.

Authors:  Eric C Hall; Garvesh Raskutti; Rebecca M Willett
Journal:  IEEE Trans Inf Theory       Date:  2018-12-04       Impact factor: 2.501

4.  Individual-specific edge-network analysis for disease prediction.

Authors:  Xiangtian Yu; Jingsong Zhang; Shaoyan Sun; Xin Zhou; Tao Zeng; Luonan Chen
Journal:  Nucleic Acids Res       Date:  2017-11-16       Impact factor: 16.971

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.