| Literature DB >> 35295683 |
Pratik Jawahar1, Thea Aarrestad1, Nadezda Chernyavskaya1, Maurizio Pierini1, Kinga A Wozniak1,2, Jennifer Ngadiuba3,4, Javier Duarte5, Steven Tsan5.
Abstract
We investigate how to improve new physics detection strategies exploiting variational autoencoders and normalizing flows for anomaly detection at the Large Hadron Collider. As a working example, we consider the DarkMachines challenge dataset. We show how different design choices (e.g., event representations, anomaly score definitions, network architectures) affect the result on specific benchmark new physics models. Once a baseline is established, we discuss how to improve the anomaly detection accuracy by exploiting normalizing flow layers in the latent space of the variational autoencoder.Entities:
Keywords: Large Hadron Collider (LHC); anomaly detection (AD); convolutional neural net; graph convolutional network (GCN); new physics beyond standard model; normalizing flow (NF); variational auto encoder (VAE)
Year: 2022 PMID: 35295683 PMCID: PMC8919050 DOI: 10.3389/fdata.2022.803685
Source DB: PubMed Journal: Front Big Data ISSN: 2624-909X
Summary of the available dataset size.
|
|
|
|
|
|
|---|---|---|---|---|
| Training | 193, 800 | 13, 425 | 238, 450 | 7, 100, 934 |
| Validation | 10, 200 | 707 | 12, 550 | 373, 733 |
| Bkg. test | 10, 000 | 5, 868 | 89, 000 | 1, 025, 333 |
| Sig. test | 38, 666 | 5, 868 | 89, 676 | 1, 023, 320 |
BSM processes contributing to the signal dataset in each channel.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Z′ + jet | × | × | × | ||
| Z′+ W/Z | × | ||||
| Z′ + t | × | × | |||
| Z′ in LFV U(1)Lμ−Lτ | × | × | |||
| × | × | ||||
| / | × | × | × | ||
| / | × | × | |||
| SUSY | × | × | × | × | |
| × | × | × | × | ||
| SUSY | × | × | |||
| SUSY | × | × | |||
| SUSY | × | × | |||
| × | × | ||||
| SUSY | × | ||||
| × | |||||
| × |
The process code, adopted in this study, is taken from Aarrestad et al. (.
Figure 1Anomaly detection performance for the Conv-VAE with different inputs given (see text for more details): all physics objects in the event (AllObj); truncated input object list (TrdObj); all objects and array of object multiplicity (AllObj+Mult); truncated input object list and array of object multiplicity (TrdObj+Mult).
Figure 2Comparison of the GCN-VAE and Conv-VAE performances, in terms of the benchmark figures of merit adopted in the article.
Figure 3Comparison of the GCN-VAE performance with and without high-level features added as a separate input.
Figure 4Comparison of anomaly detection performance from different anomaly score definitions, applied to the GCN-VAE.
Figure 5ROC curves for the baseline GCN-VAE model in channel 1 (top left), channel 2a (top right), channel 2b (bottom left), and channel 3 (bottom right), computed from the ϵS and ϵB values obtained on the background sample and the benchmark signal samples. Most of the ROC curves are not smooth, due to the small dataset size for some of the channels.
Figure 6Comparison of anomaly detection performance for GCN-VAE models with different normalizing flow architectures in the latent space.
Figure 7ROC curves of GCN-VAE_HouseholderSNF for all signals in each of channel 1 (top left), channel 2a (top right), channel 2b (bottom left), and channel 3 (bottom right).