PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Statistics & Math/Amazon

Estimate Treatment Effects Using PSM, DiD, and DML Methods

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competency in causal inference methods—covering propensity score matching, difference-in-differences, synthetic control, and double machine learning—and the ability to discuss identification assumptions, overlap/balance diagnostics, robustness checks, and handling of high-dimensional confounders.

  • hard
  • Amazon
  • Statistics & Math
  • Data Scientist

Estimate Treatment Effects Using PSM, DiD, and DML Methods

Company: Amazon

Role: Data Scientist

Category: Statistics & Math

Difficulty: hard

Interview Round: Onsite

##### Scenario Marketing analytics team wants to measure the causal impact of campaigns. ##### Question Explain how you would use Propensity Score Matching (PSM) and Difference-in-Differences (DiD) to estimate treatment effect. When is a synthetic control method preferable to DiD? Describe the Double Machine Learning (DML) framework for causal inference and why it helps with high-dimensional covariates. ##### Hints Cover identification assumptions, overlap, parallel trends, cross-fitting, and robustness checks.

Quick Answer: This question evaluates a data scientist's competency in causal inference methods—covering propensity score matching, difference-in-differences, synthetic control, and double machine learning—and the ability to discuss identification assumptions, overlap/balance diagnostics, robustness checks, and handling of high-dimensional confounders.

Related Interview Questions

  • Compute an A/B test p-value by hand - Amazon (medium)
  • Compute and interpret quantile loss vs RMSE - Amazon (medium)
  • Compute CIs, power, and multiple testing - Amazon (medium)
  • Plan and analyze an A/B test - Amazon (hard)
  • Compute p-values, CIs, and adjust multiples - Amazon (Medium)
Amazon logo
Amazon
Aug 4, 2025, 10:55 AM
Data Scientist
Onsite
Statistics & Math
17
0

Causal Impact of Marketing Campaigns: PSM, DiD, Synthetic Control, and DML

Scenario

You have observational data from a marketing campaign where some users/regions were exposed to a campaign (treatment) and others were not (control). You also have outcomes measured before and after the campaign for each unit.

Tasks

  1. Explain how you would use Propensity Score Matching (PSM) to estimate the treatment effect. Specify assumptions, how you would check overlap and balance, and what robustness checks you would run.
  2. Explain how you would use Difference-in-Differences (DiD) to estimate the treatment effect. State identification assumptions (e.g., parallel trends), how you would implement it in practice, and robustness checks.
  3. When is a Synthetic Control Method preferable to DiD? Provide the intuition and the key conditions that make it stronger than standard DiD.
  4. Describe the Double Machine Learning (DML) framework for causal inference, focusing on why it is useful with high-dimensional covariates. Include the role of cross-fitting and orthogonalization, and the required assumptions.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Amazon•More Data Scientist•Amazon Data Scientist•Amazon Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.