This question evaluates statistical inference for proportions, sequential monitoring, and multiple-comparison error control—assessing competence with binomial-based CTR comparison, pooled standard errors, z-tests and confidence intervals, Type I error inflation from peeking, and FWER/FDR considerations.

You run a two-arm A/B test of click-through rate (CTR).
Let p_c and p_t be the CTRs (as proportions).
(a) Compute:
(b) Suppose you peeked at significance daily over 14 days using the same test threshold each day (no alpha spending). Quantify the approximate inflation of the overall Type I error, and then propose a corrected sequential monitoring plan using either:
(c) You also track 8 guardrail metrics. Explain how you would control familywise error rate (FWER) or false discovery rate (FDR) across these guardrails (and how this interacts with sequential monitoring).
Login required