This question evaluates a data scientist's competency in large-scale feature engineering, model selection and tuning, model explainability, and production debugging for a weekly churn prediction pipeline operating on millions of users.
You own a weekly churn-prediction pipeline that trains on 10 million users. The goal is to predict who will churn so the business can target retention interventions.
Hints: Discuss imbalance handling, SHAP, feature drift, and offline/online parity checks.
Login required