Prove new allocation outperforms manual baseline
Company: Amazon
Role: Data Scientist
Category: Analytics & Experimentation
Difficulty: hard
Interview Round: Onsite
Propose a rigorous plan to prove an automated package-allocation system outperforms the manual baseline. Address interference and spillovers by choosing a randomization unit (e.g., station-hour switchback or courier-level cluster randomization) and justify it. Define primary/guardrail metrics (on-time rate, SLA breaches, courier overtime, customer contacts, fairness Gini), pre-specify analysis (intention-to-treat, CUPED/covariate adjustment, heterogeneity by zip density), and a power/MDE calculation (baseline on-time 92%, 100k orders/day, intra-cluster correlation assumptions). Include ramp/rollback criteria, duration, and a diff-in-diff fallback if perfect randomization isn’t feasible. Explain how you’ll prevent gaming and contamination (inventory locking, shadow assignments), and how you’ll conclude superiority with statistical and practical significance (e.g., uplift ≥0.6 pp with non-inferior guardrails).
Quick Answer: This question evaluates skills in experimental design, causal inference, statistical power/MDE calculations, metric definition, and operational governance for large‑scale randomized trials in logistics.