
Applying Deep Learning and Machine Learning Algorithms to Estimate PM $_{2.5}$ Concentration Using Satellite Data and Meteorological Data
Author(s) -
Ishwor Thapa,
Bidur Devkota,
Badri Raj Lamichhane,
Bhawana Poudel Devkota,
Raju Dhakal,
Teerayut Horat
Publication year - 2025
Publication title -
ieee journal of selected topics in applied earth observations and remote sensing
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 1.246
H-Index - 88
eISSN - 2151-1535
pISSN - 1939-1404
DOI - 10.1109/jstars.2025.3620408
Subject(s) - geoscience , signal processing and analysis , power, energy and industry applications
Air pollution, particularly fine particulate matter (PM $_{2.5}$ ), poses significant health risks and environmental challenges worldwide. Therefore, it is essential to monitor air pollution to effectively act against it. In this study, PM $_{2.5}$ levels were estimated using meteorological data and Sentinel-5P air pollution data through machine learning algorithms. The meteorological data utilized included air temperature, relative humidity (RH), wind speed (WS), and Sentinel-5P data. Three Air Quality Monitoring (AQM) stations in Kathmandu, Nepal, were selected as the study area. The effectiveness of several machine learning methods, such as K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), Random Forest (RF), ensemble methods, and hybrid methods, were evaluated. Both RF and XGBoost consistently outperformed SVM and KNN regarding PM $_{2.5}$ estimation accuracy. Among all the methods studied, XGBoost achieved the highest R 2 ; value of 0.8284 and the lowest RMSE value of 11.0024 using only the Sentinel-5P dataset. The addition of meteorological data further improved the model's performance. After including meteorological data with the Sentinel-5P data, the stacking ensemble demonstrated a maximum R 2 ; score of 0.8324 and a minimum RMSE score of 10.8747. Hence, this study demonstrated that utilizing advanced technologies such as machine learning (ML), deep learning (DL), and novel datasets obtained from satellites can accurately estimate PM $_{2.5}$ levels. This approach can significantly aid in monitoring and controlling air pollution by providing precise and timely information on air quality. These findings have significant implications for stakeholders such as policymakers and urban planners, as integrating noble technologies and datasets like machine learning with satellite and meteorological data can lead to more effective air quality management strategies. By availing low-cost solutions for accurate and timely air pollution estimates, this approach can support informed decision-making, reduce public exposure to pollutants, and improve general public health.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom