Classification of Poor Households in Padang City Using the Naïve Bayes Algorithm with Synthetic Minority Oversampling Technique
DOI:
https://doi.org/10.24036/ujsds/vol2-iss4/241Keywords:
Imbalance data, Naïve Bayes, Poor Hauseholds, Synthetic Minority Oversampling TechniqueAbstract
Poverty is a condition where a person is unable to meet minimum basic needs or a condition caused by the influence of development policies that have not been able to reach all levels of society. In Indonesia, the government has designed various programs to overcome poverty, but these programs are often not on target. One method to improve the effectiveness of the program is through proper classification of poor and non-poor households. This study uses the Naïve Bayes classification method which is popular in data mining to predict data categories based on the probability distribution of its features. However, challenges arise when the data is unbalanced between different classes. To overcome this, the Synthetic Minority Oversampling Technique (SMOTE) method is used to balance the data. Based on the analysis that has been carried out To determine the performance of Naïve Bayes using SMOTE and without SMOTE in classifying poor households in Padang City in 2023, classification using the Naïve Bayes method without SMOTE produced an accuracy value of 98%, precision of 0%, and recall of 0%. Meanwhile, the classification using the Naïve Bayes method with SMOTE produces an accuracy value of 90%, precision of 87%, and recall of 92% and the results of the criteria for poor households in Padang City in 2023 using Naïve Bayes can be seen from the results that the probability of poor households is much greater than that of non-poor households, therefore the data is classified as group of households that are classified as poor.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 anice kartika, Dina Fitria, Syafriandi Syafriandi, Tessy Octavia Mukhti
This work is licensed under a Creative Commons Attribution 4.0 International License.