Higgs Boson Discovery using Machine Learning Methods with Pyspark
Author(s) -
Mourad Azhari,
Abdallah Abarda,
Badia Ettaki,
Jamal Zerouaoui,
Mohamed Dakkon
Publication year - 2020
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2020.03.053
Subject(s) - higgs boson , particle physics , computer science , decision tree , artificial intelligence , boson , random forest , machine learning , alternating decision tree , tree (set theory) , logistic regression , support vector machine , physics , mathematics , decision tree learning , combinatorics , incremental decision tree
Higgs Boson is an elementary particle that gives the mass to everything in the natural world. The discovery of the Higgs Boson is a major challenge for particle physics. This paper proposes to solve the Higgs Boson Classification Problem with four Machine Learning (ML) Methods, using the Pyspark environment: Logistic Regression (LR), Decision Tree (DT), Random Forest (RF) and Gradient Boosted Tree (GBT). We compare the accuracy and AUC metrics of those ML Methods. We use a large dataset as Higgs Boson, collected from public site UCI and Higgs dataset downloaded from Kaggle site, in the experimentation stage.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom