PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Statistics & Math/Meta

Analyze daily comments distribution and sampling

Last updated: Mar 29, 2026

Quick Overview

This question evaluates proficiency in statistical inference, descriptive statistics (mean, median, percentiles), sampling theory and the Central Limit Theorem within the Statistics & Math domain for data scientist interviews.

  • medium
  • Meta
  • Statistics & Math
  • Data Scientist

Analyze daily comments distribution and sampling

Company: Meta

Role: Data Scientist

Category: Statistics & Math

Difficulty: medium

Interview Round: Onsite

You are given daily counts of comments per active user. (1) Define and compute mean, median, and 95th percentile (P95) for a day; write explicit formulas, including how to compute P95 from a sorted empirical distribution. (2) Explain the Central Limit Theorem (CLT) and why the sampling distribution of the sample mean tends toward normality; specify required conditions. (3) If you repeatedly sample 200 independent simple random samples of size n from the same day’s user-level comments, what n is sufficient for the sample mean to be approximately normal, and how does the sampling distribution’s variance change with n? (4) Show how the plots of daily mean, median, and P95 change as n increases, and why. (5) If comments/user increased week-over-week, list at least three plausible statistical or product reasons and how you would test each.

Quick Answer: This question evaluates proficiency in statistical inference, descriptive statistics (mean, median, percentiles), sampling theory and the Central Limit Theorem within the Statistics & Math domain for data scientist interviews.

Related Interview Questions

  • Compute probability an account is fake - Meta (easy)
  • Compute Bayes probability for fake accounts - Meta (easy)
  • Compute probabilities for chatbot response quality - Meta (easy)
  • Compute posterior fake probability using Bayes' rule - Meta (medium)
  • Estimate bots and CI from DAU spike - Meta (medium)
Meta logo
Meta
Oct 13, 2025, 9:49 PM
Data Scientist
Onsite
Statistics & Math
3
0

Daily Comments per Active User: Sampling and Inference

You have, for a given day d, the count of comments made by each active user. Let there be m active users on day d, and let X_i be the number of comments for user i (i = 1, 2, …, m). Answer the following:

  1. Define and compute the daily mean, median, and 95th percentile (P95). Provide explicit formulas, including how to compute P95 from a sorted empirical distribution.
  2. Explain the Central Limit Theorem (CLT) and why the sampling distribution of the sample mean tends toward normality. State the required conditions.
  3. Suppose you repeatedly take 200 independent simple random samples of size n from the same day’s user-level comments. What n is sufficient for the sample mean to be approximately normal, and how does the sampling distribution’s variance change with n?
  4. Describe how the plots of the sampling distributions of the daily mean, median, and P95 evolve as n increases, and explain why.
  5. If comments per user increased week-over-week, list at least three plausible statistical or product reasons and how you would test each.

Solution

Show

Submit Your Answer

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Meta•More Data Scientist•Meta Data Scientist•Meta Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.