PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Statistics & Math/Meta

Model comment counts and detect anomalies

Last updated: Mar 29, 2026

Quick Overview

This question evaluates competency in statistical modeling of heavy-tailed count data, model selection and comparison, and the formulation of robust monitoring metrics for anomaly detection, testing both theoretical understanding and applied data-science skills.

  • hard
  • Meta
  • Statistics & Math
  • Data Scientist

Model comment counts and detect anomalies

Company: Meta

Role: Data Scientist

Category: Statistics & Math

Difficulty: hard

Interview Round: Onsite

Comment counts per post are heavy-tailed. You observe mean=4 and variance=50 at the post-day level. (a) Test whether a Poisson model is appropriate; if not, propose a negative binomial or discrete lognormal and outline parameter estimation. (b) Compare fits via likelihood ratio or Vuong tests and report which tail behavior each captures. (c) Define robust monitoring metrics (e.g., trimmed mean, P50, Gini) and control limits to detect manipulation (e.g., purchased comments). (d) Given a sudden 3× increase in the 99th percentile but stable P50, propose a rule to trigger investigation while minimizing false alarms.

Quick Answer: This question evaluates competency in statistical modeling of heavy-tailed count data, model selection and comparison, and the formulation of robust monitoring metrics for anomaly detection, testing both theoretical understanding and applied data-science skills.

Related Interview Questions

  • Compute probability an account is fake - Meta (easy)
  • Compute Bayes probability for fake accounts - Meta (easy)
  • Compute probabilities for chatbot response quality - Meta (easy)
  • Compute posterior fake probability using Bayes' rule - Meta (medium)
  • Estimate bots and CI from DAU spike - Meta (medium)
Meta logo
Meta
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Statistics & Math
2
0

Modeling Heavy-Tailed Comment Counts and Robust Monitoring

You are analyzing daily comment counts at the post–day level. The distribution is heavy-tailed. From a recent period you observe:

  • Sample mean = 4
  • Sample variance = 50

Tasks:

(a) Test whether a Poisson model is appropriate. If not, propose an alternative (negative binomial or discrete lognormal/Poisson–lognormal) and outline how to estimate parameters.

(b) Compare model fits using likelihood-based tests (likelihood ratio for nested models; Vuong test for non-nested models). Explain which tail behavior each model captures.

(c) Define robust monitoring metrics (e.g., trimmed mean, P50/median, Gini) and specify control limits suitable for detecting manipulation (e.g., purchased comments) under heavy tails.

(d) Suppose the daily 99th percentile (P99) suddenly increases 3× while the median (P50) remains stable. Propose a practical rule to trigger investigation while minimizing false alarms.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Meta•More Data Scientist•Meta Data Scientist•Meta Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.