PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/Meta

Estimate Fake Accounts Using Data Signals and Sampling

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's competency in designing analytics solutions to estimate the prevalence and absolute count of fake accounts using data signals, sampling strategies, statistical classification, and validation techniques.

  • hard
  • Meta
  • Analytics & Experimentation
  • Data Scientist

Estimate Fake Accounts Using Data Signals and Sampling

Company: Meta

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Onsite

##### Scenario Social media company wants to estimate the number of fake accounts on the platform. ##### Question How would you design an analytics approach to estimate the proportion or absolute count of fake accounts on Facebook? What data signals, sampling strategy, and validation method would you use? ##### Hints Consider random sampling, supervised classification, manual labeling, capture-recapture, and confidence intervals.

Quick Answer: This question evaluates a candidate's competency in designing analytics solutions to estimate the prevalence and absolute count of fake accounts using data signals, sampling strategies, statistical classification, and validation techniques.

Related Interview Questions

  • Measure scheduled posts feature success - Meta (medium)
  • Estimate ads ranking revenue impact - Meta (medium)
  • How should you evaluate unconnected content? - Meta (medium)
  • Should WhatsApp launch group calls? - Meta (medium)
  • How would you grow Meta products? - Meta (medium)
Meta logo
Meta
Aug 4, 2025, 10:55 AM
Data Scientist
Onsite
Analytics & Experimentation
3
0

Estimating Fake Accounts on a Social Network

Background

A large social platform wants to estimate the proportion and absolute count of fake accounts on the service. "Fake" includes accounts that are inauthentic (bots, impersonations, coordinated inauthentic behavior), excluding clearly legitimate users. Estimates should be produced for a defined time window (e.g., monthly) and a defined population (e.g., all accounts, or active accounts in the last 30 days).

Task

Design an analytics approach to estimate the prevalence (percentage) and count of fake accounts on Facebook. Specify:

  1. Data signals/features you would use.
  2. A sampling strategy for labeling and estimation.
  3. A modeling approach to classify fakes and estimate prevalence.
  4. A validation plan and how you would compute confidence intervals.

Hints: Consider random/stratified sampling, supervised classification with manual labeling, capture–recapture, and confidence intervals.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Meta•More Data Scientist•Meta Data Scientist•Meta Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.