Handling Class Imbalance in Credit Card Fraud Using Various Sampling Techniques

Authors

  • Rabiul Alam Bhuiyan Dept. of Computer Science and Engineering Jashore University of Science and Technology Jashore, Bangladesh
  • Mst. Shimu Khatun Dept. of Computer Science and Engineering Jashore University of Science and Technology Jashore, Bangladesh
  • Md. Taslim Dept. of Computer Science and Engineering Jashore University of Science and Technology Jashore, Bangladesh
  • Md.Alam Hossain Dept. of Computer Science and Engineering Jashore University of Science and Technology Jashore, Bangladesh

DOI:

https://doi.org/10.54536/ajmri.v1i4.633

Keywords:

Credit Card Fraud, Class Imbalance, Under- Sampling, Over-Sampling, SMOTE, Adaptive Synthetic

Abstract

Over the last few decades, credit card fraud (CCF) has been a severe problem for both cardholders and card providers. Credit card transactions are fast expanding as internet technology advances, significantly relying on the internet. With advanced technology and increased credit card usage, fraud rates are becoming a problem for the economy. However, the credit card dataset is highly imbalanced and skewed. Many classification techniques are used to classify fraud and non-fraud but in a certain condition, they may not generate the best results. Different types of sampling techniques such as under-over sampling, Synthetic Minority Oversampling, and Adaptive synthetic techniques have been used to overcome the class imbalance problem in the credit card dataset. Then, the sampled datasets are classified using different machine learning techniques like Decision Tree, Random Forest, K-Nearest Neighbors, Logistic Regression, and Naive Bayes. Recall, F1- score, accuracy, precision, and error rate used to evaluate the model performance. The Logistic Regression model achieved the highest result with 99.94% after under sampling techniques and Random Forest model achieved the highest result with 99.964% after over sampling techniques.

Downloads

Download data is not yet available.

References

Asha, R. B., & KR, S. K. (2021). Credit card fraud detection using artificial neural network. Global Transitions Proceedings, 2(1), 35-41. https://doi.org/10.1016/j.gltp.2021.01.006

Awoyemi, J. O., Adetunmbi, A. O., & Oluwadare, S. A. (2017, October). Credit card fraud detection using machine learning techniques: A comparative analysis. In 2017 international conference on computing networking and informatics (ICCNI), 1-9.

Ali, A., Shamsuddin, S. M., & Ralescu, A. L. (2013). Classification with class imbalance problem. Int. J. Advance Soft Compu. Appl, 5(3).

Azhan, M., & Meraj, S. (2020, December). Credit card fraud detection using machine learning and deep learning techniques. In 2020 3rd International Conference on Intelligent Sustainable Systems (ICISS), 514-518.

Alam, T. M., Shaukat, K., Hameed, I. A., Luo, S., Sarwar, M. U., Shabbir, S., ... & Khushi, M. (2020). An investigation of credit card default prediction in the imbalanced datasets. IEEE Access, 8, 201173-201198

Bansal, A., & Garg, H. (2021, April). An Efficient Techniques for Fraudulent detection in Credit Card Dataset: A Comprehensive study. In IOP Conference Series: Materials Science and Engineering, IOP Publishin, 1116(1), 012181).

Dighe, D., Patil, S., & Kokate, S. (2018, August). Detection of credit card fraud transactions using machine learning algorithms and neural networks: A comparative study. In 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), 1-6.

Hordri, N. F., Yuhaniz, S. S., Azmi, N. F. M., & Shamsuddin, S. M. (2018). Handling class imbalance in credit card fraud using resampling methods. Int. J. Adv. Comput. Sci. Appl, 9(11), 390-396.

Harshit Lamba,(2022). Credit Card Fraud Detection In Real Time.

Hordri, N. F., Yuhaniz, S. S., Azmi, N. F. M., & Shamsuddin, S. M. (2018). Handling class imbalance in credit card fraud using resampling methods. Int. J. Adv. Comput. Sci. Appl, 9(11), 390-396.

Itoo, F., & Singh, S. (2021). Comparison and analysis of logistic regression, Naïve Bayes and KNN machine learning algorithms for credit card fraud detection. International Journal of Information Technology, 13(4), 1503-1511. https://doi.org/10.1007/s41870-020-00430-y

Khatri, S., Arora, A., & Agrawal, A. P. (2020, January). Supervised machine learning algorithms for credit card fraud detection: a comparison. In 2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence), 680-683.

Kaggle.com. (2019). Credit Card Fraud Detection. Retrieved 10 Jan. 2019. https://www.kaggle.com/mlg-ulb/creditcardfraud

Luthra, R., Nath, G., & Chellani, R. (2019). A Review on Class Imbalanced Correction Techniques: A Case of Credit Card Default Prediction on A Highly Imbalanced Dataset. Praxis Business School.

Li, Z., Huang, M., Liu, G., & Jiang, C. (2021). A hybrid method with dynamic weighted entropy for handling the problem of class imbalance with overlap in credit card fraud detection. Expert Systems with Applications, 175, 114750. https://doi.org/10.1016/j.eswa.2021.114750

Mˆına˘stireanu, E.A., and Mes¸ni¸ta˘, G. (2020). Methods of Handling Unbalanced Datasets in Credit Card Fraud Detection. BRAIN. Broad Research in Artificial Intelligence and Neuroscience, 11(1), 131-143. https://doi.org/10.18662/brain/11.1/19

Mohammed, R. A., Wong, K. W., Shiratuddin, M. F., & Wang, X. (2018, August). Scalable machine learning techniques for highly imbalanced credit card fraud detection: a comparative study. In Pacific Rim International Conference on Artificial Intelligence (pp. 237-246). Springer, Cham.

Mallidi, M. K. R., & Zagabathuni, Y. (2021). Analysis of Credit Card Fraud Detection using Machine Learning models on balanced and imbalanced datasets. International Journal of Emerging Trends in Engineering Research, 9(7). https://doi.org/10.30534/ijeter/2021/02972021

Najadat, H., Altiti, O., Aqouleh, A. A., & Younes, M. (2020, April). Credit card fraud detection based on machine and deep learning. In 2020 11th International Conference on Information and Communication Systems (ICICS), 204-208.

Ramentol, E., Caballero, Y., Bello, R., & Herrera, F. (2012). SMOTE-RSB*: a hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets theory. Knowledge and information systems, 33(2), 245-265.

Sisodia, D. S., Reddy, N. K., & Bhandari, S. (2017, September). Performance evaluation of class balancing techniques for credit card fraud detection. In 2017 IEEE International Conference on Power, Control, Signals and Instrumentation Engineering (ICPCSI), 2747-2752.

Sohony, I., Pratap, R., & Nambiar, U. (2018, January). Ensemble learning for credit card fraud detection. In Proceedings of the ACM India Joint International Conference on Data Science and Management of Data (pp. 289-294). https://doi.org/10.1145/3152494.3156815

Saheed, Y. K., Hambali, M. A., Arowolo, M. O., & Olasupo, Y. A. (2020, November). Application of GA feature selection on Naive Bayes, random forest and SVM for credit card fraud detection. In 2020 international conference on decision aid sciences and application (DASA), 1091-1097.

Shen, A., Tong, R., & Deng, Y. (2007, June). Application of classification models on credit card fraud detection. In 2007 International conference on service systems and service management, 1-4.

Varmedja, D., Karanovic, M., Sladojevic, S., Arsenovic, M., & Anderla, A. (2019, March). Credit card fraud detection-machine learning methods. In 2019 18th International Symposium. 1-5.

Zareapoor, M., & Yang, J. (2017). A novel strategy for mining highly imbalanced data in credit card transactions. Intelligent Automation & Soft Computing, 1-7. https://doi.org/10.1080/10798587.2017.1321228

Downloads

Published

2022-10-03

How to Cite

Rabiul Alam, B., Shimu Khatun, M., Taslim, M., & Alam Hossain, M. (2022). Handling Class Imbalance in Credit Card Fraud Using Various Sampling Techniques. American Journal of Multidisciplinary Research and Innovation, 1(4), 160–168. https://doi.org/10.54536/ajmri.v1i4.633