PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Statistics & Math/Meta

Estimate bots and CI from DAU spike

Last updated: Mar 29, 2026

Quick Overview

This question evaluates proficiency in mixture modeling for anomaly detection, parametric and nonparametric inference for mean differences, handling overdispersed count data, and bootstrap resampling under heavy-tailed distributions.

  • medium
  • Meta
  • Statistics & Math
  • Data Scientist

Estimate bots and CI from DAU spike

Company: Meta

Role: Data Scientist

Category: Statistics & Math

Difficulty: medium

Interview Round: Onsite

Daily comments spiked by 20% on day T for a product with DAU = 2,000,000. Historically, per-user comments are ~5.5 on normal days; on day T the mean is 6.6. Suspect bots averaging 500 comments each that day. 1) Under a simple two-component mixture (humans unchanged at 5.5, bots at 500), estimate the number and fraction of bot accounts present on day T; show formulas and a numeric result. 2) If the human mean also drifted to 5.8 that day, re-estimate the bot count. 3) From the day-T DAU, two independent 1% simple random samples are drawn. Compute a 95% confidence interval for the difference of their sample means (Sample A mean minus Sample B mean): (a) assuming per-user comments are Poisson with mean 6.6; (b) assuming Negative Binomial with mean 6.6 and dispersion k = 2. State any approximations (e.g., CLT) and justify them. 4) If the true distribution is heavy-tailed with ~1% of humans having >50 comments, discuss when the normal approximation breaks and outline a bootstrap procedure for the CI. 5) Briefly critique the initial normality assumption and propose a quick diagnostic using only sample aggregates (n, mean, variance).

Quick Answer: This question evaluates proficiency in mixture modeling for anomaly detection, parametric and nonparametric inference for mean differences, handling overdispersed count data, and bootstrap resampling under heavy-tailed distributions.

Related Interview Questions

  • Compute probability an account is fake - Meta (easy)
  • Compute Bayes probability for fake accounts - Meta (easy)
  • Compute probabilities for chatbot response quality - Meta (easy)
  • Compute posterior fake probability using Bayes' rule - Meta (medium)
  • Estimate Portal’s causal lift on video-call usage - Meta (Medium)
Meta logo
Meta
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Statistics & Math
7
0
Loading...

Mixture Spike and Mean-Difference Inference for Daily Comments

Context

A product has DAU (daily active users) = 2,000,000. On day T, total comments increased by 20%. Historically, the per-user mean is 5.5 comments; on day T, the mean is 6.6 (= 1.2 × 5.5). You suspect a small fraction of bot accounts that each posted around 500 comments on day T.

Tasks

  1. Two-component mixture model: assume humans remained at 5.5 comments/user and bots posted 500 comments/user. Estimate the number and fraction of bot accounts on day T. Show formulas and a numeric result.
  2. Re-estimate if the human mean drifted up to 5.8 on day T (bots still at 500).
  3. From day-T DAU, two independent simple random samples (SRS) of size 1% are drawn (Sample A and Sample B). Compute a 95% confidence interval (CI) for the difference of sample means, mean(A) − mean(B): a) assuming per-user comments are Poisson with mean 6.6. b) assuming Negative Binomial with mean 6.6 and dispersion k = 2 (use Var = μ + μ²/k). State and justify approximations (e.g., CLT). You may comment on finite population correction (FPC) if relevant.
  4. If the true distribution is heavy-tailed with ~1% of humans having >50 comments, discuss when the normal approximation can break and outline a nonparametric bootstrap procedure to obtain a CI for mean(A) − mean(B).
  5. Briefly critique assuming normality and propose a quick diagnostic using only sample aggregates (n, mean, variance).

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Meta•More Data Scientist•Meta Data Scientist•Meta Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.