This question evaluates a candidate's competency in experimental design and statistical inference, covering sample-size and power calculations, the choice between Z and T tests, sequential testing/alpha-spending strategies, and causal-inference planning for non-randomized rollouts.
You are planning a two-variant A/B test on a signup funnel with the following parameters:
(a) Compute the required sample size per variant and the expected calendar duration (in days) to reach it given the traffic constraints above. State your formula and any continuity or pooled-variance assumptions.
(b) Explain whether you would use a Z-test or a T-test for the primary proportion metric and why. Under what conditions do their results materially differ? Include how estimating variance from data, small-sample corrections, and unequal sample sizes/variances affect your choice.
(c) If product wants the test to stop “as soon as it looks good,” propose a sequential testing or alpha-spending approach (e.g., group-sequential boundaries) that controls Type I error. Specify stopping rules and how they change the nominal sample size and expected duration.
(d) Suppose randomization is not possible for a related rollout. Sketch a causal inference plan: draw a DAG to identify confounders, propose an identification strategy (e.g., difference-in-differences with parallel trends checks or an instrumental variable) and list key assumptions you would need to defend. How would you validate those assumptions in practice?
Login required