This question evaluates a data scientist's competency in A/B test diagnostics, causal inference, and experiment validity checks within the Analytics & Experimentation domain, covering concepts such as sample ratio mismatch (SRM), covariate and exposure balance, sequential testing bias, and segment-level heterogeneity in retention.

An A/B test changed a button's color from green (control) to red (treatment). The primary metric (e.g., Day-7 user retention) decreased in the treatment. Stakeholders suspect the retention drop could be due to traffic allocation issues rather than the color itself.
Assume:
Outline a step-by-step plan to investigate whether traffic distribution problems caused the retention decrease. What diagnostics, balance checks, and statistical tests would you run before concluding the new color harms retention? Discuss:
Provide concrete tests, decision thresholds, and how you would interpret outcomes.
Login required