This question evaluates a data scientist's competency in causal inference and experimental design, marketplace health metric construction, ROI estimation, and the handling of selection bias, spillover effects, multiple testing, and heterogeneity in two-sided platforms.
Context: You plan a rider‑side incentive (e.g., “20% off up to $10”) targeted by a propensity model. You must estimate causal incrementality and ROI in a two‑sided marketplace with selection and spillovers.
Do the following:
Define and provide formulas for:
Propose identification strategies for:
Write the ROI formula including cannibalization, subsidy burn, surge/ETA externalities, and habit formation. Specify how to estimate LTV lift and amortize CAC. Include confidence intervals via the delta method or bootstrap.
Design a two‑stage randomization (city×week, then rider) to identify direct vs spillover effects, and define estimands for total, direct, and indirect effects.
Choose and justify a correction strategy (Bonferroni vs BH vs hierarchical testing) for: 1 primary, 3 key secondaries, and ~20 diagnostics.
Outline a pre‑registered plan to detect effect moderation by city tier and weather using causal forests or group‑wise models while controlling Type‑S/M errors.
Enumerate pre‑trend checks, placebo windows, and negative‑control outcomes. Define criteria for declaring success and for rollback.
Login required