Classification of Poor Households in Padang City Using the Naïve Bayes Algorithm with Synthetic Minority Oversampling Technique

Authors

  • anice kartika unp
  • Dina Fitria Universitas Negeri Padang
  • Syafriandi Syafriandi Universitas Negeri Padang
  • Tessy Octavia Mukhti Universitas Negeri Padang

DOI:

https://doi.org/10.24036/ujsds/vol2-iss4/241

Keywords:

Imbalance data, Naïve Bayes, Poor Hauseholds, Synthetic Minority Oversampling Technique

Abstract

Poverty is a condition where a person is unable to meet minimum basic needs or a condition caused by the influence of development policies that have not been able to reach all levels of society. In Indonesia, the government has designed various programs to overcome poverty, but these programs are often not on target. One method to improve the effectiveness of the program is through proper classification of poor and non-poor households. This study uses the Naïve Bayes classification method which is popular in data mining to predict data categories based on the probability distribution of its features. However, challenges arise when the data is unbalanced between different classes. To overcome this, the Synthetic Minority Oversampling Technique (SMOTE) method is used to balance the data. Based on the analysis that has been carried out To determine the performance of Naïve Bayes using SMOTE and without SMOTE in classifying poor households in Padang City in 2023, classification using the Naïve Bayes method without SMOTE produced an accuracy value of 98%, precision of 0%, and recall of 0%. Meanwhile, the classification using the Naïve Bayes method with SMOTE produces an accuracy value of 90%, precision of 87%, and recall of 92% and the results of the criteria for poor households in Padang City in 2023 using Naïve Bayes can be seen from the results that the probability of poor households is much greater than that of non-poor households, therefore the data is classified as  group of households that are classified as poor.

Published

2024-11-28

How to Cite

kartika, anice, Dina Fitria, Syafriandi Syafriandi, & Tessy Octavia Mukhti. (2024). Classification of Poor Households in Padang City Using the Naïve Bayes Algorithm with Synthetic Minority Oversampling Technique. UNP Journal of Statistics and Data Science, 2(4), 446–452. https://doi.org/10.24036/ujsds/vol2-iss4/241

Most read articles by the same author(s)

1 2 3 4 5 6 > >>