Submitted:
17 March 2025
Posted:
17 March 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Literature Review
3. Data Introduction
4. Model Introduction
5. Model Results Analysis
| Model | Mean F1 Score | Mean Precision | Mean Recall |
|---|---|---|---|
| Catboost | 0.9161 | 0.9338 | 0.8991 |
| XGBoost | 0.8926 | 0.8925 | 0.8928 |
| LGBM | 0.8812 | 0.8603 | 0.9032 |
| Category | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| 0 | 0.9995 | 0.9996 | 0.9995 | 257748 |
| 1 | 0.9319 | 0.9114 | 0.9215 | 1501 |
| Accuracy | 0.9991 | 0.9991 | 0.9991 | 0.9991 |
| Macro Avg | 0.9657 | 0.9555 | 0.9605 | 259249 |
| Weighted Avg | 0.9991 | 0.9991 | 0.9991 | 259249 |


6. Conclusions
References
- Ali, A.; Abd Razak, S.; Othman, S.H.; et al. Financial fraud detection based on machine learning: a systematic literature review. Applied Sciences 2022, 12, 9637. [Google Scholar]
- Hernandez Aros, L.; Bustamante Molano, L.X.; Gutierrez-Portela, F.; et al. Financial fraud detection through the application of machine learning techniques: a literature review. Humanities and Social Sciences Communications 2024, 11, 1–22. [Google Scholar]
- Alsuwailem AA, S.; Salem, E.; Saudagar, A.K.J. Performance of different machine learning algorithms in detecting financial fraud. Computational Economics 2023, 62, 1631–1667. [Google Scholar]
- Obeng, S.; Iyelolu, T.V.; Akinsulire, A.A.; et al. Utilizing machine learning algorithms to prevent financial fraud and ensure transaction security. World Journal of Advanced Research and Reviews 2024, 23, 1972–1980. [Google Scholar]
- Ashtiani, M.N.; Raahemi, B. Intelligent fraud detection in financial statements using machine learning and data mining: a systematic literature review. Ieee Access, 7250. [Google Scholar]
- Whiting, D.G.; Hansen, J.V.; McDonald, J.B.; et al. Machine learning methods for detecting patterns of management fraud. Computational Intelligence 2012, 28, 505–527. [Google Scholar]
- Song, X.P.; Hu, Z.H.; Du, J.G.; et al. Application of machine learning methods to risk assessment of financial statement fraud: evidence from China. Journal of Forecasting 2014, 33, 611–626. [Google Scholar]
- Prokhorenkova, L.; Gusev, G.; Vorobev, A.; et al. CatBoost: unbiased boosting with categorical features. Advances in neural information processing systems 2018, 31. [Google Scholar]
- Hancock, J.T.; Khoshgoftaar, T.M. CatBoost for big data: an interdisciplinary review. Journal of big data 2020, 7, 94. [Google Scholar] [PubMed]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system[C]//Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016: 785-794.
- Ke, G.; Meng, Q.; Finley, T.; et al. Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems 2017, 30. [Google Scholar]
- Wang, D.; Li, L.; Zhao, D. Corporate finance risk prediction based on LightGBM. Information Sciences 2022, 602, 259–268. [Google Scholar] [CrossRef]



| variable | description |
| trans_date_trans_time | Timestamp of the transaction. |
| cc_num | Credit card number (hashed or anonymized). |
| merchant | Merchant or store where the transaction occurred. |
| category | Type of transaction (e.g., grocery, entertainment). |
| amt | Amount of the transaction. |
| gender | Gender of the cardholder. |
| City/state/zip | Address details of the cardholder. |
| Lat/long | Geographical coordinates of the transaction. |
| city_pop | Population of the city where the transaction occurred. |
| job | Occupation of the cardholder. |
| dob | Date of birth of the cardholder. |
| trans_num | Unique transaction number. |
| unix_time | Unix timestamp of the transaction. |
| merch_lat/merch_long | Geographical coordinates of the merchant. |
| is_fraud | Indicator of whether the transaction is fraudulent. |
| merch_zipcode | Geographical coordinates of the merchant. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).