Random Forest Algorithm Implementation for Air Quality Classification in DKI Jakarta Based on ISPU

Authors

  • Khairanisa Salsabila Universitas Negeri Padang
  • Tessy Octavia Mukhti Universitas Negeri Padang

DOI:

https://doi.org/10.24036/ujsds/vol4-iss2/474

Keywords:

Air Quality, ISPU, Random Forest, DKI Jakarta

Abstract

Air quality is an essential factor that has a direct impact on human health. High concentrations of air pollutants have the potential to cause various health impacts, across short-term and long-term horizons. This study aims to classify air quality in DKI Jakarta using the Air Pollution Standard Index (ISPU) data via the random forest algorithm. The dataset covers a timeframe from 2021 to 2025 and includes air pollutant parameters, namely PM10 and PM2.5 particulate matter, carbon monoxide (CO), nitrogen dioxide (NO2), sulfur dioxide (SO2), dan ozone (O3). The research method employs a supervised learning approach, in which the data are stratified and evakuated through the implementation of K-Fold Cross Validation (k = 10) to ensure objective and stable model performance. Model performance was measured using Accuracy, Precision, Recall, and F1-Score metrics, along with Confusion Matrix and Feature Importance analyses. It can be seen from the results that the Random Forest model can classify air quality categories with excellent performance, reaching 100% Accuracy on training data and 98.44% on testing data. The Confusion Matrix analysis indicates that most data in each air quality are correctly classified. Furthermore, the Feature Importance analysis reveals PM2.5 that is most influential parameter in determining air quality categories. Therefore, this study indicates that the Random Forest algorithm proves effective for air quality classificati and can function as a decision-support tool for air pollution control and management in DKI Jakarta.

Published

2026-05-31

How to Cite

Salsabila, K., & Tessy Octavia Mukhti. (2026). Random Forest Algorithm Implementation for Air Quality Classification in DKI Jakarta Based on ISPU . UNP Journal of Statistics and Data Science, 4(2), 141–149. https://doi.org/10.24036/ujsds/vol4-iss2/474

Most read articles by the same author(s)

1 2 3 4 > >>