
Direct optimization of the discovery significance in machine learning for new physics searches in particle colliders
Author(s) -
A. Elwood,
D. Krücker,
M. Shchedrolosiev
Publication year - 2020
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1525/1/012110
Subject(s) - algorithm , gradient boosting , machine learning , large hadron collider , artificial intelligence , computer science , decision tree , physics , particle physics , random forest
We introduce two new loss functions designed to directly optimize the statistical significance of the expected number of signal events when training neural networks and decision trees to classify events as signal or background. The loss functions are designed to directly maximize commonly used estimates of the statistical significance, s / s + b , and the so-called Asimov estimate, Z a . We consider their use in a toy search for Supersymmetric particles with 30 fb −1 of 14 TeV data collected at the LHC. In the case that the search for this model is dominated by systematic uncertainties, it is found that the loss function based on Z a can outperform the binary cross entropy in defining an optimal search region. The same approach is applied to a boosted decision tree by modifying the objective function used in gradient tree boosting.