PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Statistics & Math/TikTok

Model overdispersed counts; estimate treatment lift

Last updated: Mar 29, 2026

Quick Overview

This question evaluates modeling and inference for overdispersed, zero‑inflated count data, including estimation of treatment lift (rate ratios), dispersion assessment, standard error quantification, cluster-robust inference, bootstrap resampling, and multiple-comparison correction.

  • Medium
  • TikTok
  • Statistics & Math
  • Data Scientist

Model overdispersed counts; estimate treatment lift

Company: TikTok

Role: Data Scientist

Category: Statistics & Math

Difficulty: Medium

Interview Round: Onsite

Weekly posts per creator are overdispersed and zero‑inflated. In a creator‑level randomized test of a nudge: - Control: n_c=40,000 creators, total posts=72,000 (mean=1.8) - Treatment: n_t=40,000 creators, total posts=75,600 (mean=1.89) - Historical control variance per creator s_c^2≈6.5 (suggesting overdispersion). Answer: 1) Choose an appropriate model (e.g., Negative Binomial with log link). Using var( Y ) = μ + μ^2/k, estimate k from the control statistics and compute the estimated log rate ratio, its standard error, and a 95% CI for the treatment lift. 2) If you instead used a Poisson model, quantify the expected underestimation of SE relative to the NB and discuss when that would inflate Type I error. 3) Outline a cluster‑robust approach if randomization had been by geo (state/clusters), and a nonparametric bootstrap you’d trust here. Be explicit about the resampling unit and how you’d construct the CI for the rate ratio. 4) Given meaningful heterogeneity by creator tenure, propose a pre‑specified analysis (e.g., stratified NB or interaction terms) and how you’d correct for multiple comparisons across 10 geos (e.g., BH‑FDR).

Quick Answer: This question evaluates modeling and inference for overdispersed, zero‑inflated count data, including estimation of treatment lift (rate ratios), dispersion assessment, standard error quantification, cluster-robust inference, bootstrap resampling, and multiple-comparison correction.

Related Interview Questions

  • Explain Type I/II errors vs precision/recall - TikTok (easy)
  • Compute cluster-aware significance and sequential corrections - TikTok (medium)
  • Decide if subgroup increases imply overall increase - TikTok (medium)
  • Control confounding in observational ad lift - TikTok (hard)
  • Act when A/B result is not significant - TikTok (hard)
TikTok logo
TikTok
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Statistics & Math
2
0
Loading...

Weekly posts per creator are overdispersed and zero‑inflated. In a creator‑level randomized test of a nudge:

  • Control: n_c=40,000 creators, total posts=72,000 (mean=1.8)
  • Treatment: n_t=40,000 creators, total posts=75,600 (mean=1.89)
  • Historical control variance per creator s_c^2≈6.5 (suggesting overdispersion).

Answer:

  1. Choose an appropriate model (e.g., Negative Binomial with log link). Using var( Y ) = μ + μ^2/k, estimate k from the control statistics and compute the estimated log rate ratio, its standard error, and a 95% CI for the treatment lift.
  2. If you instead used a Poisson model, quantify the expected underestimation of SE relative to the NB and discuss when that would inflate Type I error.
  3. Outline a cluster‑robust approach if randomization had been by geo (state/clusters), and a nonparametric bootstrap you’d trust here. Be explicit about the resampling unit and how you’d construct the CI for the rate ratio.
  4. Given meaningful heterogeneity by creator tenure, propose a pre‑specified analysis (e.g., stratified NB or interaction terms) and how you’d correct for multiple comparisons across 10 geos (e.g., BH‑FDR).

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More TikTok•More Data Scientist•TikTok Data Scientist•TikTok Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.