Interdisciplinary Journal of Environment and Computer Innovations

Comparative Analysis of Machine Learning Models for Customer Churn Prediction Using Engineered Behavioral Features

Pink V. Acas, Jahilyn S. Lacuarin, Janna B. Nayve, Charlene O. Curan, John Frank H. Galagar

Volume: 1, Issue: 1, Pages: 42-49, Published: 2026-06-01
Online ISSN: 3116-5850
Publisher: Bohol Island State University Candijay Campus College of Sciences

Abstract

This study presents a machine learning approach for bank customer churn prediction using a leakage-controlled preprocessing pipeline and engineered behavioral features. The dataset consisted of 10,000 customer records, where identifier and leakage-prone variables were removed to improve methodological validity. Five classification models—Logistic Regression, Naive Bayes, Support Vector Machine (SVM), Random Forest, and Gradient Boosting—were evaluated. Feature engineering introduced composite variables, including BalanceSalaryRatio, ProductsPerTenure, CreditScoreAgeIndex, ActivityAdjustedBalance, and PointsPerProduct, to capture customer behavior patterns. The dataset was split into 80% training and 20% testing sets using stratified sampling with a fixed random state. Model performance was assessed using Accuracy, Precision, Recall, F1 Score, and ROC AUC. Results show that Gradient Boosting achieved the best overall balance, with Accuracy = 0.8690, F1 Score = 0.6124, and ROC AUC = 0.8753. The findings indicate that leakage-controlled preprocessing and behavior-based feature engineering provide a practical and interpretable approach for customer churn prediction and retention analytics.

Keywords

Customer Churn Prediction, Machine Learning, Feature Engineering, Predictive Analytics, Customer Retention

Full Paper: The full paper will be available soon.