Possibility of improving credit scoring systems: Using machine learning models

Authors

DOI:

https://doi.org/10.53468/ijsshr-miyr.v5i3.10

Keywords:

Machine learning, Credit risk, Credit scoring, Financial inclusion, XGBoost, Non-bank financial institution, Explainable AI

Abstract

This study investigates the potential for improving credit scoring systems in non-bank financial institutions (NBFIs) by applying advanced machine learning techniques. The primary objective is to develop predictive models that more accurately assess credit risk compared to traditional statistical methods. Using real-world, anonymized loan data from a Mongolian NBFI, the study applies and compares three ensemble learning algorithms: Random Forest, XGBoost, and LightGBM. Key variables such as loan amount, repayment history, age, interest rate, and loan term were used to train and validate the models. Among the tested models, XGBoost demonstrated the highest predictive accuracy, with 93% classification performance and strong model robustness as indicated by ROC-AUC and error metrics. The findings highlight that machine learning models, particularly XGBoost, outperform conventional approaches in terms of both accuracy and practical applicability. Moreover, the integration of explainable AI techniques enhances the transparency of the credit scoring process. This research contributes to the localization of modern credit scoring tools in emerging markets and provides a scalable solution to improve financial inclusion and risk-based decision-making in NBFIs

References

Xu, T. (2024). Comparative analysis of machine learning algorithms for consumer credit risk assessment. Transactions on Computer Science and Intelligent Systems Research, 4. doi.org/10.62051/r1m3pg16

Golbayani, P., Florescu, I., & Chatterjee, R. (2020). A comparative study of forecasting corporate credit ratings using neural networks, support vector machines, and decision trees. arXiv. doi.org/10.48550/arXiv.2007.06617

Lundberg, S., & Lee, S.-I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30. doi.org/10.48550/arXiv.1705.07874

Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). “Why should I trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135–1144. doi.org/10.1145/2939672.2939778

Jagtiani, J. A., & Lemieux, C. (2019). The roles of alternative data and machine learning in fintech lending: Evidence from the LendingClub consumer platform. Financial Management, 48(4), 1009–1029. doi.org/10.1111/fima.12295

Mestiri, S. (2024). Credit scoring using machine learning and deep learning based models. Data Science in Finance and Economics, 4(2), 236–248. doi.org/10.3934/DSFE.2024009

Gambacorta, L., Huang, Y., Qiu, H., & Wang, J. (2024). How do machine learning and non-traditional data affect credit scoring? New evidence from a Chinese fintech firm. Journal of Financial Stability, 73, 101284. doi.org/10.1016/j.jfs.2024.101284

Pathipati, P., & Kumar, S. M. (2024). Credit card approval prediction using XGBoost compared with Random Forest. Risks, 12(11), 174. doi.org/10.3390/risks12110174

Capuano, F., & Vento, M. (2024). A novel profit-driven framework for model evaluation in credit scoring. Engineering Applications of Artificial Intelligence, 112, 109137. doi.org/10.1016/j.engappai.2024.109137

Valdrighi, G., Ribeiro, A. M., Pereira, J. S. B., et al. (2024). Best practices for responsible machine learning in credit scoring. arXiv. doi.org/10.48550/arXiv.2409.20536

Li, Y., Zhao, R., & Sha, M. (2024). A hybrid credit risk evaluation model based on three-way decisions and stacking ensemble approach. Computational Economics. doi.org/10.1007/s10614-024-10747-6

Zhou, Z.-H. (2025). Ensemble methods: Foundations and algorithms (2nd ed.). CRC Press. doi.org/10.1007/978-981-15-1967-3

Rotari, M., & Kulahci, M. (2023). Variable selection wrapper in presence of correlated input variables: Boruta algorithm. Quality and Reliability Engineering International. doi.org/10.1002/qre.3398

Gür, Y., Toğaçar, M., & Solak, B. (2025). Integration of CNN models and machine learning methods in credit score classification. Computational Economics. doi.org/10.1007/s10614-025-10893-5

Sodnomdavaa, T., Sodnomdavaa, T., & Amgalanbat, N. (2025). Possibility of predicting inflation: Using machine learning model. International Journal of Social Science and Humanities Research-MIYR, 5(2), 49–64. doi.org/10.53468/mifyr.2025.05.02.49

Downloads

Published

2025-10-31

How to Cite

Sodnomdavaa, T., Sodnomdavaa, T., & Tsogzol, Z. (2025). Possibility of improving credit scoring systems: Using machine learning models. International Journal of Social Science and Humanities Research-MIYR, 5(3), 43–53. https://doi.org/10.53468/ijsshr-miyr.v5i3.10