Context
A product offers a free trial. You are asked: “What is the probability of conversion for a group of people who started a free trial?”
Data (assume available)
A table of trial starts:
-
user_id
-
trial_start_at
(timestamp)
-
trial_length_days
(int)
-
country/channel/plan/device
(optional segments)
A table of paid conversions:
-
user_id
-
paid_start_at
(timestamp, nullable if never converted)
-
revenue
(optional)
Task
-
Define precisely what “conversion probability” means, including the
time window
and cohort definition.
-
Show how you would estimate the conversion probability for a cohort of trial starters, including how you would handle:
-
users whose trial has not ended yet (incomplete observation)
-
delayed conversions (convert after trial)
-
segmentation and potential confounding (e.g., channel mix changes)
-
Provide a way to quantify uncertainty (confidence/credible intervals) and communicate results.
-
Suggest follow-up analyses/metrics (diagnostics and guardrails) to ensure the estimate is decision-useful.