Version 1
: Received: 20 April 2024 / Approved: 22 April 2024 / Online: 23 April 2024 (11:50:51 CEST)
How to cite:
PRASAD, M.; SRIKANTH, T. Multi-Entity Real-Time Fraud Detection System using Machine Learning: Improving Fraud Detection Efficiency using FROST-Enhanced Oversampling. Preprints2024, 2024041461. https://doi.org/10.20944/preprints202404.1461.v1
PRASAD, M.; SRIKANTH, T. Multi-Entity Real-Time Fraud Detection System using Machine Learning: Improving Fraud Detection Efficiency using FROST-Enhanced Oversampling. Preprints 2024, 2024041461. https://doi.org/10.20944/preprints202404.1461.v1
PRASAD, M.; SRIKANTH, T. Multi-Entity Real-Time Fraud Detection System using Machine Learning: Improving Fraud Detection Efficiency using FROST-Enhanced Oversampling. Preprints2024, 2024041461. https://doi.org/10.20944/preprints202404.1461.v1
APA Style
PRASAD, M., & SRIKANTH, T. (2024). <strong>Multi-Entity Real-Time Fraud Detection System using Machine Learning: Improving Fraud Detection Efficiency using FROST-Enhanced Oversampling</strong>. Preprints. https://doi.org/10.20944/preprints202404.1461.v1
Chicago/Turabian Style
PRASAD, M. and T SRIKANTH. 2024 "<strong>Multi-Entity Real-Time Fraud Detection System using Machine Learning: Improving Fraud Detection Efficiency using FROST-Enhanced Oversampling</strong>" Preprints. https://doi.org/10.20944/preprints202404.1461.v1
Abstract
Fraudulent transactions pose a significant threat to financial institutions and e-commerce platforms.Machine learning models, trained on historical labeled data (fraudulent vs. legitimate transactions), are often employed to identify and prevent fraud. However, real-world datasets frequently exhibit class imbalance, where fraudulent transactions (minority class) are significantly outnumbered by legitimate transactions (majority class). Machine learning models may perform poorly as a result of this imbalance, underestimating fraud and favouring the majority class. This paper proposes a novel approach to address class imbalance and improve fraud detection accuracy. We explore the implementation of FROST (Feature space RObust Synthetic saTuration) oversampling, a technique specifically designed to generate synthetic samples for the minority class. The FROST function leverages the k-nearest neighbors (KNN) algorithm and a user-defined amplification factor (m) to create synthetic data points that closely resemble existing minority class instances. We integrate the FROST-enhanced oversampling technique into the machine learning pipeline for fraud detection. The paper evaluates the effectiveness of this approach compared to traditional oversampling methods and analyzes its impact on classification accuracy metrics.
Computer Science and Mathematics, Computer Science
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.