
The Classification Status of River Water Quality in Riau Province Using Modified K-Nearest Neighbor Algorithm with STORET Modeling and Water Pollution Index
Author(s) -
Mustakim Mustakim,
Rosdina,
Dian Ramadhani,
M. Afdal,
Medyantiwi Rahmawita
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1783/1/012020
Subject(s) - pollution , water quality , k nearest neighbors algorithm , river pollution , pollutant , division (mathematics) , environmental science , quality (philosophy) , water pollution , water resources , computer science , hydrology (agriculture) , data mining , water resource management , mathematics , machine learning , engineering , ecology , philosophy , chemistry , arithmetic , organic chemistry , epistemology , geotechnical engineering , environmental chemistry , biology
The Department of Environment and Forestry, Pollution and Environmental Damage Control Division, has an active role in monitoring water quality in Riau Province. The rivers that are still monitored and managed are Kampar River, Siak River and Indragiri River. Division of Environment Pollution calculates river quality status manually using Microsoft Excel, this is not maximally done since this important information should be processed quickly. Division of water pollution must determine the right calculation to get the results of the water quality status. Because of many calculation formulas set by the government, the commonly used method is the STORET method and the Pollution Index. So, in overcoming the problem of classification, the researcher proposes the use of learning methods that can predict or determine the status of water quality with classification techniques on data mining that is Modified K-Nearest Neighbor (MKNN) which is a modification of K-NN. The calculation of the MKNN algorithm produced the highest accuracy of 85.10% at K = 5 using STORET result data as training data. While, using the Pollution Index data results, the highest accuracy is 76.92% at K = 1. Based on the analysis with attribute analysis, the attributes that influence the determination of river water quality are BOD, COD, NH3, Fecal Coli and Total Coli. This result can be taken into consideration by the Division of Environmental Pollution in the process of overcoming and reducing pollutant overload that exceeds quality standards.