This question evaluates statistical inference and experimental-design competencies for A/B testing—specifically the selection and validation of hypothesis tests, estimation and interpretation of p-values and confidence intervals, characterization of Type I and Type II errors, power and minimal detectable effect calculations, and approaches for multiple comparisons control, and is commonly asked to assess a candidate's ability to draw reliable conclusions from randomized experiments. It is in the Statistics & Math category and emphasizes practical application of statistical concepts (conceptual understanding of assumptions plus applied interpretation of results) rather than purely theoretical derivations.
You ran a two-week A/B experiment on a new algorithm. The primary metric is user active minutes. Assume the unit of randomization is the user, and each user is assigned to control or treatment for the full duration. You will analyze the per-user total (or average) active minutes over the two weeks.
Login required