This question evaluates experimental-design and causal-inference competencies, specifically handling interference and spillovers, defining experimental units and treatment assignment, drawing causal DAGs, identifying bias sources, selecting primary and guardrail metrics, and pre-specifying decision rules.
A new driver-queue algorithm is being tested at a single airport with multiple terminals. The algorithm can reassign drivers across terminals, which may influence nearby terminals and the overall airport driver pool.
Decide whether to use a classic user-level A/B test or a switchback/geo-time experiment. Then:
Assume you can toggle the algorithm on/off at the airport level and collect standard operational metrics (wait times, cancellations, earnings, throughput).
Login required