How do I approach Analytics & Experimentation interview questions?

Analytics & Experimentation questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master analytics & experimentation interviews.

What difficulty level is this interview question?

This is a medium difficulty Analytics & Experimentation question, commonly asked during Technical Screen rounds at OpenAI.

What role is this question designed for?

This question is commonly asked for Data Scientist candidates at OpenAI during technical interviews.

How would you evaluate a free-trial A/B test?

Q: How would you evaluate a free-trial A/B test?

Evaluates skills in A/B test design and causal inference within the Analytics & Experimentation domain, emphasizing precise metric definition (including guardrails), choice of analysis population (ITT vs per‑protocol), handling of post‑treatment selection, and interpretation of binary conversion and retention metrics; the abstraction level is technical and oriented toward intermediate-to-senior data scientists who must combine statistical rigor with product impact thinking. Commonly asked because it tests the ability to balance acquisition versus downstream retention and revenue, to identify typical analysis pitfalls (time-window errors, incorrect joins/denominators, leakage, peeking, and selection bias), and to translate experiment results into a business recommendation linked to value metrics like LTV and payback period.

You run an online marketing experiment to evaluate whether offering a free 1‑month trial increases growth.

Experiment context

Eligible visitors are randomly assigned at first exposure to one of two variants:
- Control : no free-trial offer
- Treatment : shown a free 1‑month trial offer
The business cares about:
1. Signup rate (did the user start a trial?)
2. Retention (did the user come back after signing up?)
Concern: Treatment could increase signups but attract lower-intent users, potentially hurting downstream retention and/or revenue.

Your tasks

Define metrics precisely
- Propose a primary metric and key secondary metrics.
- Include at least one guardrail (e.g., revenue/cost/abuse).
- Give concrete definitions for “signup rate” and “retention” (e.g., D7/D30), including the denominator.
Choose the analysis approach
- Specify the analysis population(s): ITT vs per-protocol , and how you would handle users who never saw the offer after assignment.
- Explain how you would estimate the treatment effect for:
  - a binary conversion metric (signup)
  - a retention metric that is only defined for users who signed up (post-treatment selection)
Identify common pitfalls / logic errors in an experiment analysis codebase Without writing code, list the most likely bugs or setup problems you would look for when reviewing Python analysis code for this experiment (e.g., bad time windows, wrong joins, leakage, incorrect denominators, repeated-measures issues, post-treatment filtering, peeking).
Make a business recommendation
- Describe how you would decide whether to ship the free-trial offer, iterate, or stop.
- Discuss what additional analyses you would do to connect the experiment to business value (e.g., LTV, payback period, heterogeneity, novelty effects).

Assume standard frequentist inference unless you justify alternatives.

You run an online marketing experiment to evaluate whether offering a free 1‑month trial increases growth.

Experiment context

Eligible visitors are randomly assigned at first exposure to one of two variants:
- Control : no free-trial offer
- Treatment : shown a free 1‑month trial offer
The business cares about:
1. Signup rate (did the user start a trial?)
2. Retention (did the user come back after signing up?)
Concern: Treatment could increase signups but attract lower-intent users, potentially hurting downstream retention and/or revenue.

Your tasks

Define metrics precisely
- Propose a primary metric and key secondary metrics.
- Include at least one guardrail (e.g., revenue/cost/abuse).
- Give concrete definitions for “signup rate” and “retention” (e.g., D7/D30), including the denominator.
Choose the analysis approach
- Specify the analysis population(s): ITT vs per-protocol , and how you would handle users who never saw the offer after assignment.
- Explain how you would estimate the treatment effect for:
  - a binary conversion metric (signup)
  - a retention metric that is only defined for users who signed up (post-treatment selection)
Identify common pitfalls / logic errors in an experiment analysis codebase Without writing code, list the most likely bugs or setup problems you would look for when reviewing Python analysis code for this experiment (e.g., bad time windows, wrong joins, leakage, incorrect denominators, repeated-measures issues, post-treatment filtering, peeking).
Make a business recommendation
- Describe how you would decide whether to ship the free-trial offer, iterate, or stop.
- Discuss what additional analyses you would do to connect the experiment to business value (e.g., LTV, payback period, heterogeneity, novelty effects).

Assume standard frequentist inference unless you justify alternatives.

How would you evaluate a free-trial A/B test?

Quick Overview

Experiment context

Your tasks

Solution

Comments (0)

How would you evaluate a free-trial A/B test?

Quick Overview

Experiment context

Your tasks

Solution

Comments (0)