PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Statistics & Math/Uber

Apply instrumental variables under interference

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of causal inference with instrumental variables in the presence of interference, testing skills in defining units and aggregation levels for market‑level spillovers, articulating IV assumptions (relevance, exclusion restriction, independence, monotonicity), and formulating estimation frameworks such as two‑stage least squares. Commonly asked in Statistics & Math interviews for data scientist roles because networked marketplaces invalidate simple A/B tests, it sits in the econometrics/causal inference domain and primarily assesses practical application of IV methods while requiring conceptual understanding of identification, robustness diagnostics, and sensitivity analysis.

  • hard
  • Uber
  • Statistics & Math
  • Data Scientist

Apply instrumental variables under interference

Company: Uber

Role: Data Scientist

Category: Statistics & Math

Difficulty: hard

Interview Round: Technical Screen

Suppose a clean A/B test isn’t feasible for a new ride‑sharing feature due to interference. Propose an instrumental‑variables approach to estimate its causal effect on trip volume. State and justify all IV assumptions precisely—relevance, exclusion, independence (as‑if random), and monotonicity (if claiming LATE). Give at least two concrete, plausibly exogenous instruments (e.g., staggered driver app version eligibility, exogenous weather shocks affecting demand but not the feature directly) and write the first‑stage and second‑stage (2SLS) equations. Describe how you’ll diagnose weak instruments (first‑stage F‑stat), run over‑identification tests (Sargan/Hansen), handle clustering/heteroskedasticity, and assess violations of exclusion under marketplace spillovers. Would an effectively unlimited supply environment make the exclusion restriction more or less credible, and why? If assumptions partially fail, outline sensitivity analyses or bounds (e.g., Conley‑type).

Quick Answer: This question evaluates understanding of causal inference with instrumental variables in the presence of interference, testing skills in defining units and aggregation levels for market‑level spillovers, articulating IV assumptions (relevance, exclusion restriction, independence, monotonicity), and formulating estimation frameworks such as two‑stage least squares. Commonly asked in Statistics & Math interviews for data scientist roles because networked marketplaces invalidate simple A/B tests, it sits in the econometrics/causal inference domain and primarily assesses practical application of IV methods while requiring conceptual understanding of identification, robustness diagnostics, and sensitivity analysis.

Related Interview Questions

  • Should Uber double member discounts? - Uber (medium)
  • Compare Two Coin Proportions - Uber (medium)
  • Analyze the Accident-Rate Spike - Uber (easy)
  • How do you derive CDF from a PDF? - Uber (easy)
  • Derive a CDF from a PDF - Uber (medium)
Uber logo
Uber
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Statistics & Math
7
0

IV estimation for a ride‑sharing feature when A/B testing is infeasible due to interference

Context

You need to estimate the causal effect of a new ride‑sharing feature on trip volume. A clean A/B test is not feasible because users (drivers/riders) interact within a marketplace, creating interference/spillovers across units (e.g., one driver's treatment can affect other drivers' and riders' outcomes in the same market/time).

Task

  • Propose an instrumental‑variables (IV) strategy to identify the causal effect of feature exposure/adoption on trip volume in the presence of interference.
  • Clearly define the unit of analysis, treatment, outcome, and the level at which interference is addressed (e.g., market × time clustering/aggregation).
  • State and justify all IV assumptions precisely:
    1. Relevance
    2. Exclusion restriction
    3. Independence (as‑if random)
    4. Monotonicity (if you claim a LATE interpretation)
  • Provide at least two concrete, plausibly exogenous instruments and justify them. Examples to consider include:
    • Staggered driver app version eligibility (e.g., forced update schedule, app‑store review lags)
    • An encouragement design (e.g., hash‑bucket canary eligibility) or a weather‑based interaction that shifts usage only among eligibles
  • Write the first‑stage and second‑stage (2SLS) equations, including controls and fixed effects.
  • Describe how you will:
    • Diagnose weak instruments (first‑stage F‑statistics; Kleibergen‑Paap for robust/clustered settings)
    • Run over‑identification tests (Sargan/Hansen J)
    • Handle heteroskedasticity and clustering (e.g., two‑way clustering by market and time; wild cluster bootstrap if few clusters)
    • Assess and mitigate violations of exclusion in the presence of marketplace spillovers
  • Discuss whether an effectively unlimited supply environment makes the exclusion restriction more or less credible, and why.
  • If assumptions partially fail, outline sensitivity analyses or bounds (e.g., Conley‑type plausibly exogenous bounds, Anderson‑Rubin tests).

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Uber•More Data Scientist•Uber Data Scientist•Uber Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.