ANALISIS KOMPARASI ALGORITMA RANDOM FOREST, XGBOOST, DAN MULTILAYER PERCEPTRON (MLP) DALAM PREDIKSI RISIKO GAGAL BAYAR KREDIT

Authors

  • D. Febry Wulangsih Universitas Pembangunan Nasional “Veteran” Jawa Timur Author
  • Keysya Alifia Zabina Universitas Pembangunan Nasional “Veteran” Jawa Timur Author

DOI:

https://doi.org/10.5281/zenodo.20742401

Keywords:

Credit Risk, Random Forest, XGBoost, Multilayer Perceptron, Class Imbalance

Abstract

Credit risk prediction is a critical task for financial institutions in identifying customers at risk of default. This study compares the performance of three machine learning and deep learning algorithms as in Multilayer Perceptron (MLP), Random Forest, and XGBoost in predicting credit card defaults using the “Default of Credit Card Clients” dataset from the UCI Machine Learning Repository. The dataset consists of 30,000 records with 23 features covering demographic information, payment history, bills, and payment amounts over a six-month period. Class imbalance was addressed using the Synthetic Minority Oversampling Technique (SMOTE), which was applied only to the training data. Model performance was evaluated using accuracy, precision, recall, F1-score, and AUC-ROC metrics. The experimental results show that Random Forest achieved the best overall performance with an F1-score of 0.5275 and an AUC-ROC of 0.768, outperforming MLP (F1-score 0.5182, AUC-ROC 0.7610) and XGBoost (F1-score 0.5080, AUC-ROC 0.7616%). These findings indicate that ensemble-based methods remain competitive compared to deep learning approaches for tabular credit data, and provide valuable insights for financial institutions in implementing data-driven risk management.

Downloads

Download data is not yet available.

References

Brown, K., Smith, J., & Davis, L. (2023). Integrated ensemble-based machine learning with SMOTE for financial default analysis. Expert Systems with Applications, 215, Article 119842.

Hassan, M. A., Shukur, Z., & Al-Amrani, A. (2023). Optimizer and architectural impact analysis on multilayer perceptron performance for structural risk tasks. IEEE Access, 11, 74210-74225.

Sun, T., Zhao, X., & Liu, Y. (2025). Hybrid clustering-multilayer perceptron architecture for credit evaluation over high-dimensional datasets. Expert Systems with Applications, 260, Article 125110.

Tetteh, J. A., Mensah, E. O., & Asante, K. (2024). Multi-layer perceptron networks for loan default classification in imbalanced environments. Applied Soft Computing, 152, Article 111234.

Yeh, I. C., & Lien, C. H. (2009). Default of credit card clients [Dataset]. UCI Machine Learning Repository. https://archive.ics.uci.edu/dataset/186/default+of+credit+card+clients

Downloads

Published

2026-06-18