How do I approach Statistics & Math interview questions?

Statistics & Math questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master statistics & math interviews.

What difficulty level is this interview question?

This is a hard difficulty Statistics & Math question, commonly asked during Onsite rounds at DoorDash.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at DoorDash during technical interviews.

Define and compute retention and churn precisely

Quick Overview

This question evaluates a Data Scientist's competency in statistical measurement of user retention and churn, covering cohort definition, activity rules, risk-set-aware retention and churn formulas, censoring and delayed conversion handling, and survival-analysis concepts like time-to-churn, hazard, and cumulative incidence.

Retention and Churn for a Transactional Consumer App

Context: You are analyzing retention and churn for a transactional consumer app (e.g., food delivery, ride-hailing). Users place discrete orders over time. Your goal is to define, compute, and interpret retention and churn correctly for product, marketing, and experimentation use-cases.

Tasks

Definitions and Justification

Choose precise definitions for:
- Cohorts: signup vs. first-purchase (activation) cohorts.
- Activity: active if 1+ order in a period.
- Retention types: N-day, week N, rolling, and bracket retention.
- Churn: no activity for K consecutive periods.
Justify choices based on decision use-cases.

Formulas and Correct Computation

Provide formulas for cohort retention and churn using proper risk sets.
Handle right-censoring and delayed conversion.
Explain pitfalls such as survivorship bias, Simpson’s paradox, and seasonality.

Measuring Long-term Retention Impact of a Treatment via Survival Analysis

Define time-to-churn, hazard, and cumulative incidence.
Specify how to compare treatment/control curves (log-rank or stratified tests).
Explain covariate adjustment.

Rolling vs. Strict Cohort Retention

Show how rolling retention can disagree with strict cohort retention.
Include a made-up numerical example and compute both correctly.
Explain how to reconcile for executives.

Windows and Experimental Design

Explain how you would set washout, observation, and attribution windows.
Discuss how these choices affect experiment power and bias.

Quick Overview

Tasks

Definitions and Justification

Choose precise definitions for:

Cohorts: signup vs. first-purchase (activation) cohorts.
Activity: active if 1+ order in a period.
Retention types: N-day, week N, rolling, and bracket retention.
Churn: no activity for K consecutive periods.

Justify choices based on decision use-cases.

Formulas and Correct Computation

Provide formulas for cohort retention and churn using proper risk sets.

Handle right-censoring and delayed conversion.

Explain pitfalls such as survivorship bias, Simpson’s paradox, and seasonality.

Measuring Long-term Retention Impact of a Treatment via Survival Analysis

Define time-to-churn, hazard, and cumulative incidence.

Specify how to compare treatment/control curves (log-rank or stratified tests).

Explain covariate adjustment.

Rolling vs. Strict Cohort Retention

Show how rolling retention can disagree with strict cohort retention.

Include a made-up numerical example and compute both correctly.

Explain how to reconcile for executives.

Windows and Experimental Design

Explain how you would set washout, observation, and attribution windows.

Discuss how these choices affect experiment power and bias.

Define and compute retention and churn precisely

Quick Overview

Retention and Churn for a Transactional Consumer App

Tasks

Solution

Submit Your Answer to Earn 20XP

Define and compute retention and churn precisely

Quick Overview

Retention and Churn for a Transactional Consumer App

Tasks

Solution

Submit Your Answer to Earn 20XP