This question evaluates a data scientist's understanding of causal inference, confounding effects such as Simpson's paradox, heterogeneous treatment effects, and the specification of estimands in A/B testing.
You ran an A/B test in two geographies (City X and City Y). Within each city, variant A outperforms variant B. However, when you pool the data across both cities, the combined result shows variant B performing better than A.
Login required