
You are given observational, user-level product data where users self-select into receiving a new feature. The goal is to estimate the Average Treatment Effect on the Treated (ATT) for 7-day retention.
State clearly how you will handle:
Provide the following:
(a) Propensity model specification (logistic vs gradient boosting) and rationale.
(b) Matching strategy (1:1 nearest neighbor with/without replacement), caliper definition and computation, and how you would tune these choices.
(c) Formal balance diagnostics (standardized mean differences, variance ratios, KS tests), thresholds (e.g., SMD < 0.1), and what you do when balance fails (re-weighting/rematching).
(d) Detection of lack of overlap and remedies (e.g., trimming/restriction to common support).
(e) Variance estimation approach (Abadie–Imbens vs bootstrap) and when each is valid.
(f) Rosenbaum sensitivity analysis: report the Γ (Gamma) at which conclusions would flip and how to communicate that interpretation to a PM.
Login required