Predicting User Churn for a Subscription App
Context
You are building a model to predict which active subscribers are likely to churn soon so the team can target retention offers. Assume:
-
Label: churn in the next 30 days (cancel subscription or no activity for 30 days).
-
Features: recent engagement (recency, frequency, session duration), tenure, plan type, payment history, support interactions, device/geo, marketing touches.
-
Data is time-ordered; avoid leakage by using only information available before the prediction date.
Tasks
(a) Which modeling techniques would you consider and why?
(b) How would you address class imbalance?
(c) What evaluation metrics would you use, and how would you choose thresholds for action?