This question evaluates a data scientist's competency in causal inference and observational impact estimation within the Analytics & Experimentation domain, emphasizing handling time-series seasonality, cross-sectional heterogeneity, panel and exposure data, and identification challenges.
A product feature was launched to 100% of traffic simultaneously, so there is no explicit control or holdout group. You need to estimate the causal impact of the launch on a key business metric (e.g., conversion rate, bookings, revenue), in a marketplace with strong seasonality and heterogeneity across geographies, devices, and user segments.
Assume you have historical data (pre- and post-launch), unit-level and geo-level panels, exposure logs (who actually saw/used the feature), and standard product/marketing covariates.
Outline:
Hints: Discuss pre–post analysis, synthetic controls, difference-in-differences, and propensity-based approaches; emphasize validation and sensitivity checks.
Login required