This question evaluates understanding of Simpson's paradox, causal inference, experimental design, metric selection, and statistical inference within A/B testing for email campaigns, testing Analytics & Experimentation competencies for a Data Scientist role.
A marketing team tests a new email variant B vs control A.
The experiment ran for two weeks in two cities (e.g., SF and NY). When you look within each city-week segment, variant B appears to outperform A. But when you aggregate all data together, variant A appears to outperform B.