Penanganan Data Tidak Seimbang pada Pemodelan Rotation Forest Keberhasilan Studi Mahasiswa Program Magister IPB
Author(s) -
Junjun Wijaya,
Agus Mohamad Soleh,
Akbar Rizki
Publication year - 2018
Publication title -
xplore journal of statistics
Language(s) - English
Resource type - Journals
eISSN - 2655-2744
pISSN - 2302-5751
DOI - 10.29244/xplore.v2i2.99
Subject(s) - oversampling , rotation (mathematics) , value (mathematics) , computer science , sensitivity (control systems) , artificial intelligence , receiver operating characteristic , random forest , pattern recognition (psychology) , algorithm , machine learning , engineering , electronic engineering , telecommunications , bandwidth (computing)
Graduate school of Bogor Agricultural University (SPs-IPB) stated that not all students of IPB master program successfully complete their studies. This becomes an evaluation for IPB to be more selective in choosing students in the future. This study aims to model the success classification of IPB master students in 2011 to 2015. The classification method used is rotation forest. The percentage of students who graduated is very large compared to those who did not pass, this can cause the evaluation value different. SMOTE (Synthetic Minority Oversampling Technique) is one of method to handle such unbalanced data by generating artificial data. The ROC (Receiver Operating Characteristic) curve is built to see the optimum cut off value. There are two classification models, they are rotation forest models before and after handled by SMOTE. The comparison results show that the rotation forest model after SMOTE with cut off value 0.6 is the best model. This model can increase the sensitivity value more than 50% although the accuracy and specificity value decreased compared to the modeling before SMOTE.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom