PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Statistics & Math/Uber

Analyze results and large p-values correctly

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's mastery of experimental analysis and applied causal inference, touching on intent-to-treat estimation, cluster-robust inference, variance reduction, handling of ratio metrics and skewed outcomes, non-compliance and instrumental approaches, decision frameworks for large p-values, and heterogeneity with multiple-testing control. Commonly asked in the Statistics & Math domain to assess practical application with conceptual understanding, it measures the ability to reason about appropriate analysis level, validity of inference, and interpretation of ambiguous results rather than just computational skill.

  • hard
  • Uber
  • Statistics & Math
  • Data Scientist

Analyze results and large p-values correctly

Company: Uber

Role: Data Scientist

Category: Statistics & Math

Difficulty: hard

Interview Round: Technical Screen

After the experiment ends, show exactly how you will analyze it: compute the intent‑to‑treat lift at the user assignment level with cluster‑robust standard errors; use CUPED or pre‑period covariates to reduce variance; correctly handle ratio metrics (delta method or Fieller) and skewed outcomes. Explain why session‑level analysis is problematic here (repeated measures per user, session counts correlated with treatment, non‑independence) and how to fix it (aggregate to user, mixed models, cluster‑robust SE). Handle non‑compliance/partial exposure (users who never opened) and estimate TOT via 2SLS using assignment as the instrument. If the p‑value is large, decide whether to: fail to reject vs claim no effect, run a post‑hoc power/MDE check, and/or run equivalence/non‑inferiority tests (TOST); optionally compare to a Bayesian posterior with a ROPE. Outline heterogeneity analysis and multiple‑testing control.

Quick Answer: This question evaluates a candidate's mastery of experimental analysis and applied causal inference, touching on intent-to-treat estimation, cluster-robust inference, variance reduction, handling of ratio metrics and skewed outcomes, non-compliance and instrumental approaches, decision frameworks for large p-values, and heterogeneity with multiple-testing control. Commonly asked in the Statistics & Math domain to assess practical application with conceptual understanding, it measures the ability to reason about appropriate analysis level, validity of inference, and interpretation of ambiguous results rather than just computational skill.

Related Interview Questions

  • Should Uber double member discounts? - Uber (medium)
  • Compare Two Coin Proportions - Uber (medium)
  • Analyze the Accident-Rate Spike - Uber (easy)
  • How do you derive CDF from a PDF? - Uber (easy)
  • Derive a CDF from a PDF - Uber (medium)
Uber logo
Uber
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Statistics & Math
5
0

Experiment Analysis Plan: User-Level ITT with Robust Inference, Variance Reduction, Ratios, Skew, Non-Compliance, and Decision Framework

Context

You ran a randomized experiment with randomization at the user level. Post-period outcomes are recorded at both session and user levels. Some users never opened/engaged (non-compliance/partial exposure). There are ratio metrics (e.g., revenue per session, conversion rate) and some outcomes are skewed. You have reliable pre-period data for variance reduction.

Task

Show exactly how you will analyze the experiment:

  1. Primary estimand and inference
    • Compute the intent-to-treat (ITT) lift at the user assignment level.
    • Use cluster-robust standard errors at the user level.
    • Use CUPED or pre-period covariates to reduce variance.
  2. Ratio metrics and skewed outcomes
    • Correctly handle ratio metrics via the delta method or Fieller's theorem.
    • Address skewed outcomes with appropriate techniques.
  3. Level-of-analysis rationale
    • Explain why session-level analysis is problematic here.
    • Provide fixes: aggregate to user, mixed models/GEE, cluster-robust SE.
  4. Non-compliance / partial exposure
    • Handle users who never opened.
    • Estimate treatment-on-the-treated (TOT) via 2SLS using assignment as the instrument.
  5. Decision framework
    • If the p-value is large, decide whether to fail to reject vs claim no effect.
    • Run a post-hoc power/MDE check.
    • Optionally run equivalence/non-inferiority tests (TOST) and/or compare to a Bayesian posterior with a ROPE.
  6. Explore heterogeneity and control multiplicity
    • Outline heterogeneity analysis and multiple-testing control.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Uber•More Data Scientist•Uber Data Scientist•Uber Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.