| Literature DB >> 36212113 |
Marco Letizia1,2, Gianvito Losapio1, Marco Rando1, Gaia Grosso3,4,5, Andrea Wulzer3, Maurizio Pierini5, Marco Zanetti3,4, Lorenzo Rosasco1,6,7.
Abstract
We present a machine learning approach for model-independent new physics searches. The corresponding algorithm is powered by recent large-scale implementations of kernel methods, nonparametric learning algorithms that can approximate any continuous function given enough data. Based on the original proposal by D'Agnolo and Wulzer (Phys Rev D 99(1):015014, 2019, arXiv:1806.02350 [hep-ph]), the model evaluates the compatibility between experimental data and a reference model, by implementing a hypothesis testing procedure based on the likelihood ratio. Model-independence is enforced by avoiding any prior assumption about the presence or shape of new physics components in the measurements. We show that our approach has dramatic advantages compared to neural network implementations in terms of training times and computational resources, while maintaining comparable performances. In particular, we conduct our tests on higher dimensional datasets, a step forward with respect to previous studies.Entities:
Year: 2022 PMID: 36212113 PMCID: PMC9534824 DOI: 10.1140/epjc/s10052-022-10830-y
Source DB: PubMed Journal: Eur Phys J C Part Fields ISSN: 1434-6044 Impact factor: 4.991
Average training times per single run with standard deviations (low level features and reference toys). Note that time measured in hours (for NN) and seconds (for Falkon)
| Model | DIMUON | SUSY | HIGGS |
|---|---|---|---|
| FLK | ( | ( | ( |
| NN | (4.23 ± 0.73) h | (73.1 ± 10) h | (112 ± 9) h |
Bold values indicate the lowest for each column (lower is better)
Fig. 10a Average test statistics as a function of the number of Nyström centers. b Average training time as a function of the number of Nyström centers
Fig. 14Observed significance at varying kernel bandwidth
Fig. 1Distribution of the test statistics under the null and alternative hypotheses for the DIMUON (left) and HIGGS (right) datasets
Fig. 2Observed significance against estimated ideal significance with low-level input features
Fig. 3Probability of finding a evidence for new physics as a function of the ideal significance
Fig. 4Comparison of the observed significance obtained with Falkon using low level features only and all the features
Fig. 5Observed significance with the Falkon implementation against neural networks
Fig. 6Examples of reconstructed density ratios as a functions of high-level features (not given as inputs) for the DIMUON (left) and SUSY (right) datasets with new physics components in the data. Note that the SUSY dataset is normalized
Fig. 7Observed significance as a function of the size of the reference sample (left). Example of distribution of the test statistics given a small reference sample (right)
Specifications of the machine used to perform the experiments with Falkon
| OS | Ubuntu 18.04.1 |
| CPU(s) | |
| RAM | 256GB |
| GPU(s) | |
| CUDA version | 10.2 |