This question evaluates a candidate's competency in diagnosing and debugging production machine learning pipelines, covering data validation, temporal leakage detection, label and feature drift analysis, calibration, and training regularization considerations.
You are evaluating a binary churn prediction system with:
Assume the goal is to predict whether a customer will churn in the next period using only information available up to an "as-of" cutoff time.
Describe, step-by-step, how you would debug this system. Cover:
Propose concrete experiments to isolate root causes, the metrics you would inspect, and recommend fixes plus a plan to verify improvements and prevent regressions (tests, data versioning, monitoring).
Login required