PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/DoorDash

Design and analyze batching algorithm experiment

Last updated: Mar 29, 2026

Quick Overview

This question evaluates experiment design and causal inference competencies—covering geo-randomization and spillover control, precise metric specification with guardrails, cluster-based power and sample-size calculations, intention-to-treat analysis with cluster-robust inference, heterogeneity analysis, and operational rollout and decision-rule planning in the Analytics & Experimentation domain. It is commonly asked because interviewers need to assess both conceptual understanding and practical application: designing robust cluster-randomized geo-experiments that limit interference, define and pre-register metrics and analysis, compute cluster-adjusted power, and specify operational safeguards and clear shipping criteria.

  • hard
  • DoorDash
  • Analytics & Experimentation
  • Data Scientist

Design and analyze batching algorithm experiment

Company: DoorDash

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Onsite

DoorDash plans to test a new order-batching/dispatch algorithm in 10 cities during August–September 2025. Spillovers between nearby areas are likely. Design and analyze the experiment: 1) Randomization unit: Propose a geo-cell clustering and stratified randomization plan that limits interference (e.g., 2–5 km hex cells), ensuring balance on baseline order volume, cuisine mix, and dasher supply. How will you detect and mitigate cross-cell spillovers? 2) Primary/secondary metrics: Choose one primary success metric (e.g., orders delivered per dasher hour, or 90th percentile delivery time) and at least three guardrails (e.g., customer cancellations, courier wait-at-store, restaurant prep SLA breaches, fairness across neighborhoods). Define each precisely, including inclusion/exclusion rules and winsorization. 3) Power: Baseline P90 delivery time is 40.0 minutes with SD 7.0; you expect a −1.2 minute improvement. There are 50 geo-cells per arm with average 8,000 orders per cell over the test. Intracluster correlation (ICC) is 0.20. Compute required sample size or achieved power using cluster-robust approximations; state any design effects and assumptions. 4) Analysis: Specify the intention-to-treat model with cluster-robust SEs, include pre-period CUPED covariates, and a plan for SRM checks. Provide the exact regression you would run (formula and covariates), how you will handle right-skew (e.g., log transform or quantile regression), and how you will aggregate cell-level quantiles. 5) Heterogeneity: Pre-register subgroup analyses (e.g., by time-of-day, cuisine, weather severity index). Show how you will control false discovery. What minimum subgroup sample size do you require? 6) Operational rollout: Describe a safe ramp plan and “kill switches.” If a mid-test storm hits 3 cities, explain how you’ll use difference-in-differences or synthetic control on impacted cells without biasing the ITT. 7) Decision rule: Write the exact thresholding rule (effect size, confidence, and guardrail constraints) for shipping, with an example calculation using hypothetical results.

Quick Answer: This question evaluates experiment design and causal inference competencies—covering geo-randomization and spillover control, precise metric specification with guardrails, cluster-based power and sample-size calculations, intention-to-treat analysis with cluster-robust inference, heterogeneity analysis, and operational rollout and decision-rule planning in the Analytics & Experimentation domain. It is commonly asked because interviewers need to assess both conceptual understanding and practical application: designing robust cluster-randomized geo-experiments that limit interference, define and pre-register metrics and analysis, compute cluster-adjusted power, and specify operational safeguards and clear shipping criteria.

Related Interview Questions

  • Evaluate Biker Feature Success - DoorDash (hard)
  • How would you test product changes? - DoorDash (hard)
  • How to test bike delivery? - DoorDash (medium)
  • Investigate LA successful orders drop - DoorDash (easy)
  • How would you diagnose a completed orders drop? - DoorDash (easy)
DoorDash logo
DoorDash
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Analytics & Experimentation
4
0
Loading...

Experiment Design: New Order-Batching/Dispatch Algorithm (Aug–Sep 2025)

You are tasked with designing and analyzing a geo-experiment for a new order-batching/dispatch algorithm across 10 cities, where nearby areas may interfere with each other (e.g., dashers traveling across borders, demand shifting). The test runs for August–September 2025.

Provide a detailed plan for each item below.

  1. Randomization unit and spillover control
  • Propose a geo-cell construction (e.g., 2–5 km hex cells) and a clustering/assignment scheme that limits interference across treatment and control.
  • Describe stratified randomization to balance baseline order volume, cuisine mix, and dasher supply.
  • Explain how you will detect spillovers across cells and how you’ll mitigate them in design and analysis.
  1. Primary and secondary metrics
  • Choose exactly one primary success metric (e.g., Orders Delivered per Dasher Hour (ODDH) or P90 delivery time) and at least three guardrails (e.g., customer cancellations, courier wait-at-store, restaurant prep SLA breaches, fairness across neighborhoods).
  • Define each metric precisely, including:
    • Inclusion/exclusion rules (what orders are in-scope, how to treat outliers, outages, bad telemetry).
    • Any winsorization or trimming and at what cutoffs (e.g., 1st/99th percentiles).
  1. Power and sample size
  • Baseline P90 delivery time = 40.0 minutes with SD = 7.0 minutes; expected improvement = −1.2 minutes.
  • There are 50 geo-cells per arm with an average of 8,000 orders per cell over the test; intracluster correlation (ICC) = 0.20.
  • Compute required sample size or achieved power using cluster-robust approximations. Show formulas, design effects, and assumptions.
  1. Analysis plan (ITT)
  • Specify the intention-to-treat model with cluster-robust standard errors.
  • Include how you will use pre-period CUPED covariates.
  • Describe SRM checks.
  • Provide the exact regression you would run (formula and covariates), how you will handle right-skew (e.g., log transform or quantile regression), and how you will aggregate cell-level quantiles if needed.
  1. Heterogeneity
  • Pre-register subgroup analyses (e.g., by time-of-day, cuisine, weather severity index).
  • Explain how you will control the false discovery rate and the minimum subgroup sample size required for valid inference.
  1. Operational rollout and shocks
  • Describe a safe ramp plan and concrete kill switches.
  • If a mid-test storm hits 3 cities, explain how you’ll use difference-in-differences or synthetic control on impacted cells without biasing the ITT.
  1. Decision rule
  • Write the exact shipping rule (effect size, statistical confidence, and guardrail constraints).
  • Include a worked example with hypothetical results.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More DoorDash•More Data Scientist•DoorDash Data Scientist•DoorDash Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.