You are building a churn prediction model for a subscription business. Churn is defined as whether a customer cancels or becomes inactive in the next 30 days. The data is tabular with a mix of numeric and categorical features. The positive class (churners) is typically imbalanced.
Describe, step-by-step, how you would construct, train, validate, and evaluate a churn-prediction model in scikit-learn, including:
Include and explain the use of Pipeline, ColumnTransformer, GridSearchCV, cross-validation, and joblib.
Login required