PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Analytics & Experimentation/Uber

Design an Uber A/B experiment end-to-end

Last updated: Mar 29, 2026

Quick Overview

This question evaluates experimental design and causal inference skills, covering A/B testing, randomization and interference considerations, metric selection and guardrails, instrumentation and logging, sample size calculation, ramping and bias/variance controls in the Analytics & Experimentation domain.

  • hard
  • Uber
  • Analytics & Experimentation
  • Data Scientist

Design an Uber A/B experiment end-to-end

Company: Uber

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Technical Screen

Uber is considering a redesign of the pickup ETA card shown to riders after they request a trip. The hypothesis is that clearer ETA presentation reduces cancellations and increases completed trips. Design the experiment end-to-end. Provide: - Experimental unit and randomization: Choose rider-level, request-level, or geo/time cluster; justify and discuss interference risks (e.g., supply-side coupling, surge, driver behavior) and how you would mitigate them. - Target population and eligibility: Precisely define who is included/excluded (e.g., first-time vs repeat riders, specific cities, iOS/Android versions), and how to handle riders with multiple requests in the window. - Primary metric and guardrails: Pick a single primary metric; propose 2–3 guardrails (e.g., driver acceptance rate, average wait time, surge incidence). Define each precisely, including numerator/denominator and window. - Instrumentation plan: List exact events/fields to log (e.g., request_id, rider_id, device_id, city_id, treatment_arm, event_name, timestamp_ms, cancel_reason, trip_started, trip_completed, driver_supply_at_request, ETA_shown). Include sampling rate, idempotency, and assignment audit (SRM checks). - Sample size and duration: Assume baseline booking-to-trip-completion rate p0 = 0.120, minimum detectable relative lift = +5%, two-sided α = 0.05, power = 0.80, 1:1 split, and 200,000 eligible requests per day (stable). Compute the per-arm sample size for a difference-in-proportions test and the expected experiment duration (days) to reach it, accounting for data loss of 3%. - Ramp strategy: Propose a safe ramp (e.g., 1%→10%→50%→100%), pre-specified stopping rules, and monitoring (including SRM and guardrails) at each ramp. - Bias/variance controls: Describe how you would handle seasonality, city heterogeneity (e.g., stratification or CUPED with pre-period completion), and bot/abuse filtering. - Success criteria and rollout: Pre-register decision thresholds and what to do if primary improves but a guardrail degrades. State any additional assumptions you need and show the sample size math clearly.

Quick Answer: This question evaluates experimental design and causal inference skills, covering A/B testing, randomization and interference considerations, metric selection and guardrails, instrumentation and logging, sample size calculation, ramping and bias/variance controls in the Analytics & Experimentation domain.

Related Interview Questions

  • Design a Maps Address Search Bar - Uber
  • Evaluate a cold-start rating launch - Uber (medium)
  • Design Pricing Model Experiment - Uber (medium)
  • Evaluate marketplace interventions - Uber (medium)
  • Evaluate UberEATS priority delivery and membership - Uber (medium)
Uber logo
Uber
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
23
0

Experiment Design: Pickup ETA Card Redesign

Context: After a rider requests a trip, the app shows a pickup ETA card. The hypothesis is that clearer ETA presentation reduces cancellations and increases completed trips. Design an end-to-end A/B test to evaluate this.

Provide the following:

  1. Experimental unit and randomization
  • Choose one: rider-level, request-level, or geo/time cluster.
  • Justify your choice and discuss interference risks (e.g., supply-side coupling, surge dynamics, driver behavior) and how you would mitigate them.
  1. Target population and eligibility
  • Precisely define inclusion/exclusion (e.g., first-time vs repeat riders, specific cities, iOS/Android versions).
  • State how you will handle riders with multiple requests during the experiment window.
  1. Primary metric and guardrails
  • Select a single primary metric.
  • Propose 2–3 guardrails (e.g., driver acceptance rate, average wait time, surge incidence).
  • Define each metric precisely: numerator, denominator, and measurement window.
  1. Instrumentation plan
  • List exact events/fields to log, e.g., request_id, rider_id, device_id, city_id, treatment_arm, event_name, timestamp_ms, cancel_reason, trip_started, trip_completed, driver_supply_at_request, ETA_shown.
  • Include sampling rate, idempotency strategy, and assignment audit plan (including SRM checks).
  1. Sample size and duration
  • Assume: baseline booking-to-trip-completion rate p0 = 0.120; minimum detectable relative lift = +5%; two-sided α = 0.05; power = 0.80; 1:1 split; 200,000 eligible requests per day (stable).
  • Compute the per-arm sample size for a two-sample difference-in-proportions test and the expected experiment duration (days) to reach it, accounting for 3% data loss.
  • Show the sample size math clearly.
  1. Ramp strategy
  • Propose a safe ramp (e.g., 1% → 10% → 50% → 100%), pre-specified stopping rules, and monitoring (including SRM and guardrails) at each ramp.
  1. Bias/variance controls
  • Describe controls for seasonality, city heterogeneity (e.g., stratification or CUPED with pre-period completion), and bot/abuse filtering.
  1. Success criteria and rollout
  • Pre-register decision thresholds and actions if the primary metric improves but a guardrail degrades.

State any additional assumptions you need.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Uber•More Data Scientist•Uber Data Scientist•Uber Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.