PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Analytics & Experimentation/Amazon

Estimate live sports impact on subscriptions

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competence in causal inference and observational study design within analytics and experimentation, covering skills such as defining treatment and control, selecting outcome metrics and covariates, and choosing identification strategies like matching, IV, and difference-in-differences; it is commonly asked to assess the ability to produce credible causal estimates when randomized experiments are unavailable. It tests both conceptual understanding of identification assumptions (e.g., unconfoundedness, overlap, SUTVA, IV validity) and practical application of matching algorithms, propensity modeling, regression specifications, diagnostics, and staggered-rollout/event-study designs for real-world program evaluation.

  • hard
  • Amazon
  • Analytics & Experimentation
  • Data Scientist

Estimate live sports impact on subscriptions

Company: Amazon

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Onsite

Amazon is considering adding live broadcasts of selected sports events to Prime Video. Using observational data, estimate the causal impact on Prime membership subscriptions and engagement. Precisely specify: (1) treatment and control at the user level (e.g., users exposed to/actually watching live sports vs not), (2) primary outcomes (e.g., subscription starts, retention, upgrades, watch‑time), and (3) key covariates for selection on observables. Start with matching: choose a method (k‑NN, caliper, Mahalanobis, or propensity‑score matching), define distance or the propensity model class, explain overfitting controls (regularization, cross‑fitting), and provide balance diagnostics you will require (SMD thresholds, variance ratios, overlap plots). State assumptions (unconfoundedness, overlap, SUTVA) and how you will test/justify them. Then propose an instrument to address unobservables via randomized variation in promotion prominence for the live stream (e.g., hero banner vs standard tile). Write the 2SLS explicitly: First stage Z→LiveWatch with controls and fixed effects; second stage LiveWatch_hat→Outcome with the same controls. Discuss IV validity (relevance, exclusion, independence, monotonicity), weak‑IV checks (first‑stage F), and over‑identification tests if multiple instruments. Finally, outline a DID/event‑study alternative using staggered rollout, detail the regression, fixed effects, and heterogeneity, and list key threats (interference, spillovers, time‑varying confounding) with mitigation.

Quick Answer: This question evaluates a data scientist's competence in causal inference and observational study design within analytics and experimentation, covering skills such as defining treatment and control, selecting outcome metrics and covariates, and choosing identification strategies like matching, IV, and difference-in-differences; it is commonly asked to assess the ability to produce credible causal estimates when randomized experiments are unavailable. It tests both conceptual understanding of identification assumptions (e.g., unconfoundedness, overlap, SUTVA, IV validity) and practical application of matching algorithms, propensity modeling, regression specifications, diagnostics, and staggered-rollout/event-study designs for real-world program evaluation.

Related Interview Questions

  • Explain why CTR rises but CVR unchanged - Amazon (medium)
  • How would you test a price increase? - Amazon (medium)
  • How to evaluate adding video ads in a game - Amazon (easy)
  • How would you analyze and test a price increase? - Amazon (easy)
  • How would you evaluate adding video ads? - Amazon (medium)
Amazon logo
Amazon
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Analytics & Experimentation
1
0

Estimate the Causal Impact of Live Sports on Prime Subscriptions and Engagement

Context

Amazon is considering adding live broadcasts of selected sports events to Prime Video. Using observational data, design an analysis to estimate the causal impact on Prime membership subscriptions and user engagement.

Tasks

  1. Define treatment and control at the user level
    • Specify both an exposure-based definition and an uptake-based definition (e.g., users exposed to or actually watching live sports vs not).
  2. Define primary outcome metrics
    • Examples: subscription starts, retention, upgrades, watch-time.
  3. List key covariates to justify selection on observables.
  4. Matching design
    • Choose a method (k-NN, caliper, Mahalanobis, or propensity-score matching).
    • Define the distance metric or the propensity model class.
    • Explain overfitting controls (regularization, cross-fitting).
    • Specify balance diagnostics you require (SMD thresholds, variance ratios, overlap plots).
  5. Assumptions
    • State unconfoundedness, overlap, SUTVA, and how you will test or justify them.
  6. Instrumental variables
    • Propose an instrument based on randomized variation in promotion prominence for the live stream (e.g., hero banner vs standard tile).
    • Write the 2SLS explicitly: First stage Z → LiveWatch with controls and fixed effects; second stage LiveWatch_hat → Outcome with the same controls.
    • Discuss IV validity: relevance, exclusion, independence, monotonicity; weak-IV checks (first-stage F); and over-identification tests if multiple instruments.
  7. DID and event study alternative
    • Outline a staggered rollout design.
    • Provide the regression specification, fixed effects, heterogeneity, and key threats (interference, spillovers, time-varying confounding) with mitigations.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Amazon•More Data Scientist•Amazon Data Scientist•Amazon Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.