PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Analytics & Experimentation/Uber

Design an ETA experiment under interference

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competency in experimental design and causal inference for two-sided marketplaces, focusing on interference, partial compliance, metric definition, randomization strategy, instrumentation, estimation, and power calculations within the Analytics & Experimentation domain.

  • hard
  • Uber
  • Analytics & Experimentation
  • Data Scientist

Design an ETA experiment under interference

Company: Uber

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Technical Screen

A new rider ETA model may change perceived wait times and cancellations in a two‑sided marketplace where drivers roam across zones (interference likely). Design an experiment to estimate the causal impact on rider experience and marketplace health. Requirements: - Define up to 3 primary success metrics (demand) and up to 5 guardrails (supply/quality). Give exact formulas and units; e.g., request→trip conversion = completed_requests / total_requests; driver earnings/hour; post‑dispatch cancellation within 5 minutes; pickup ETA p50. - Choose the randomization unit among rider, trip, driver, or zone (geohash‑6) × hour. Justify the choice with respect to SUTVA/interference, and specify blocking/stratification (city tier × weekday/weekend). - If unit randomization is infeasible, propose a stepped‑wedge rollout with a 10% geo holdout. Describe contamination controls (driver shadow pools, sticky assignment, exclusion buffers) and how you’ll measure and cap cross‑arm exposure. - Define instrumentation: an exposure variable that confirms the UI actually showed the new ETA. Specify an as‑treated estimator robust to partial compliance and how you’ll bound bias from post‑randomization selection. - Compare classic A/B vs synthetic control vs diff‑in‑diff: when each is valid here; what pre‑trend diagnostics you’ll run; how to build city‑level synthetic controls (donor pool, regularization, covariates). - Powering: Compute required sample size to detect a 1.5% relative drop in cancellations with baseline 12%, power 80%, alpha 0.05, ICC=0.02, average 1,500 trips/day/zone over 28 days. Show the clustered variance formula, the design effect, and the minimum number of zones per arm. - List at least four biases (e.g., demand shocks, surge, time‑of‑day heterogeneity, learning/fatigue) and a mitigation for each (blocking, CUPED with pre‑period cancels, calendar alignment, heterogeneity pre‑spec). - Monitoring: propose sequential harm boundaries (e.g., alpha‑spending) and a rollout rule using a cost‑benefit model that includes driver earnings impacts and SLA guardrails. - Deliverables: a pre‑registration outline (estimands, metrics, exclusions, missing‑data rules, interim looks) and a plan to report heterogeneity by city tier and weather without p‑hacking.

Quick Answer: This question evaluates a data scientist's competency in experimental design and causal inference for two-sided marketplaces, focusing on interference, partial compliance, metric definition, randomization strategy, instrumentation, estimation, and power calculations within the Analytics & Experimentation domain.

Related Interview Questions

  • Design a Maps Address Search Bar - Uber
  • Evaluate a cold-start rating launch - Uber (medium)
  • Design Pricing Model Experiment - Uber (medium)
  • Evaluate marketplace interventions - Uber (medium)
  • Evaluate UberEATS priority delivery and membership - Uber (medium)
Uber logo
Uber
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
12
0

Experiment Design: Estimating Causal Impact of a New Rider ETA Model in a Two-Sided Marketplace

Context

You are testing a new rider ETA model that changes the ETA shown in the app. Because drivers roam across zones, interference between units is likely (violating SUTVA if not handled). Design an experiment to estimate the causal effect on rider experience (demand) and overall marketplace health (supply/quality), accounting for interference and partial compliance.

Tasks

  1. Metrics
  • Define up to 3 primary success metrics (demand) and up to 5 guardrails (supply/quality).
  • Provide exact formulas and units; for example: request→trip conversion = completed_requests / total_requests; driver earnings/hour; post-dispatch cancellation within 5 minutes; pickup ETA p50.
  1. Randomization Unit
  • Choose the randomization unit among rider, trip, driver, or zone (geohash-6) × hour.
  • Justify the choice with respect to SUTVA/interference.
  • Specify blocking/stratification: city tier × weekday/weekend.
  1. If Unit Randomization Is Infeasible
  • Propose a stepped-wedge rollout with a 10% geo holdout.
  • Describe contamination controls (e.g., driver shadow pools, sticky assignment, exclusion buffers).
  • Explain how you will measure and cap cross-arm exposure.
  1. Instrumentation and Estimation
  • Define an exposure variable that confirms the rider UI actually showed the new ETA.
  • Specify an as-treated estimator robust to partial compliance and how you will bound bias from post-randomization selection.
  1. Estimation Design Comparison
  • Compare classic A/B vs synthetic control vs difference-in-differences: when each is valid here.
  • Describe pre-trend diagnostics you will run.
  • Explain how to build city-level synthetic controls (donor pool, regularization, covariates).
  1. Powering
  • Compute the required sample size to detect a 1.5% relative drop in cancellations with baseline 12%, power 80%, alpha 0.05, ICC = 0.02, average 1,500 trips/day/zone over 28 days.
  • Show the clustered variance formula, the design effect, and the minimum number of zones per arm.
  1. Biases and Mitigations
  • List at least four biases (e.g., demand shocks, surge, time-of-day heterogeneity, learning/fatigue) and a mitigation for each (blocking, CUPED with pre-period cancels, calendar alignment, heterogeneity pre-spec).
  1. Monitoring and Rollout
  • Propose sequential harm boundaries (e.g., alpha-spending).
  • Specify a rollout rule using a cost-benefit model that includes driver earnings impacts and SLA guardrails.
  1. Deliverables
  • Provide a pre-registration outline (estimands, metrics, exclusions, missing-data rules, interim looks).
  • Provide a plan to report heterogeneity by city tier and weather without p-hacking.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Uber•More Data Scientist•Uber Data Scientist•Uber Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.