PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Statistics & Math/TikTok

Control confounding in observational ad lift

Last updated: Mar 29, 2026

Quick Overview

This question evaluates mastery of observational causal inference for ATE estimation, covering causal DAGs and identification, pre-treatment adjustment, propensity-score and inverse-probability/doubly-robust estimation, diagnostic checks, sensitivity analysis for unobserved confounding, and variance/uncertainty quantification in the context of ad exposure data. It is commonly asked because real-world advertising analyses cannot rely on randomization and interviewers need assurance of both conceptual understanding of identification assumptions and practical application of estimation and diagnostic techniques; the category is Statistics & Math and the level of abstraction spans conceptual reasoning and applied implementation.

  • hard
  • TikTok
  • Statistics & Math
  • Data Scientist

Control confounding in observational ad lift

Company: TikTok

Role: Data Scientist

Category: Statistics & Math

Difficulty: hard

Interview Round: Onsite

You cannot randomize ad exposure. Users differ in age, education, and income. Propose a causal inference plan to estimate the ATE of ad exposure on conversions. Include: (1) a DAG to justify a valid adjustment set; (2) a propensity score model and either matching or inverse-probability weighting with stabilized weights; (3) formulas for ATE via IPW and doubly robust (AIPW) estimators; (4) diagnostics (overlap checks, standardized mean differences before/after, effective sample size, weight trimming); (5) sensitivity analysis to unobserved confounding (e.g., Rosenbaum bounds); (6) avoiding post-treatment bias (exclude engagement mediators); (7) variance estimation and uncertainty reporting. Discuss when you’d prefer diff-in-diff or CUPED and required assumptions.

Quick Answer: This question evaluates mastery of observational causal inference for ATE estimation, covering causal DAGs and identification, pre-treatment adjustment, propensity-score and inverse-probability/doubly-robust estimation, diagnostic checks, sensitivity analysis for unobserved confounding, and variance/uncertainty quantification in the context of ad exposure data. It is commonly asked because real-world advertising analyses cannot rely on randomization and interviewers need assurance of both conceptual understanding of identification assumptions and practical application of estimation and diagnostic techniques; the category is Statistics & Math and the level of abstraction spans conceptual reasoning and applied implementation.

Related Interview Questions

  • Explain Type I/II errors vs precision/recall - TikTok (easy)
  • Compute cluster-aware significance and sequential corrections - TikTok (medium)
  • Model overdispersed counts; estimate treatment lift - TikTok (Medium)
  • Decide if subgroup increases imply overall increase - TikTok (medium)
  • Act when A/B result is not significant - TikTok (hard)
TikTok logo
TikTok
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Statistics & Math
5
0

Estimating the ATE of Ad Exposure on Conversions (Observational Setup)

You cannot randomize ad exposure. Users differ in age, education, income, and other characteristics. Propose a causal inference plan to estimate the Average Treatment Effect (ATE) of ad exposure on conversions.

Assume we observe users i = 1, …, n over a fixed window. Let A_i ∈ {0,1} indicate whether user i saw the focal ad (at least one impression), and Y_i ∈ {0,1} indicate whether the user converted in the window. Let X_i be pre-exposure covariates.

Include the following:

  1. A DAG that justifies a valid pre-treatment adjustment set and a brief identification argument.
  2. A propensity score model using pre-exposure covariates, and either:
    • Matching on the propensity score, or
    • Inverse-probability weighting with stabilized weights.
  3. Formulas for ATE via IPW and doubly robust (AIPW) estimators.
  4. Diagnostics: overlap checks, standardized mean differences before/after (unweighted and weighted), effective sample size, and weight trimming/clipping.
  5. Sensitivity analysis for unobserved confounding (e.g., Rosenbaum bounds). Optionally, alternatives like E-values or Oster’s δ.
  6. How to avoid post-treatment bias (exclude engagement mediators such as clicks/dwell time that occur after exposure).
  7. Variance estimation and uncertainty reporting (SEs, CIs), including recommendations for sandwich/robust SEs, cluster-robust options, and bootstrap.

Also discuss when you would prefer difference-in-differences (DiD) or CUPED, and the assumptions required.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More TikTok•More Data Scientist•TikTok Data Scientist•TikTok Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.