PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Machine Learning/Meta

Identify Fake Accounts Using Machine Learning Techniques

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competency in designing end-to-end machine learning systems for detecting fake accounts, covering problem framing, feature engineering across signals and time windows, modeling choices (supervised, unsupervised, semi/weakly-supervised, graph-based), evaluation metrics and A/B testing, monitoring for data/concept drift, and quantifying business impact. It is commonly asked to assess the ability to balance precision–recall and the business costs of false positives versus false negatives, and falls under the Machine Learning category with a mix of conceptual understanding and practical application.

  • hard
  • Meta
  • Machine Learning
  • Data Scientist

Identify Fake Accounts Using Machine Learning Techniques

Company: Meta

Role: Data Scientist

Category: Machine Learning

Difficulty: hard

Interview Round: Onsite

##### Scenario Detecting fake accounts on a social-commerce platform ##### Question How would you identify fake accounts in our user base? What features and modeling approach would you use to build a classifier? How would you evaluate the model’s performance and monitor drift? What is the impact of fake users on the overall ecosystem (buyers, sellers, ads)? We found that 3% of accounts are fake—what conclusions or next steps does this number suggest? ##### Hints Discuss feature engineering, supervised vs. unsupervised methods, precision-recall trade-offs, A/B testing for business impact.

Quick Answer: This question evaluates a data scientist's competency in designing end-to-end machine learning systems for detecting fake accounts, covering problem framing, feature engineering across signals and time windows, modeling choices (supervised, unsupervised, semi/weakly-supervised, graph-based), evaluation metrics and A/B testing, monitoring for data/concept drift, and quantifying business impact. It is commonly asked to assess the ability to balance precision–recall and the business costs of false positives versus false negatives, and falls under the Machine Learning category with a mix of conceptual understanding and practical application.

Related Interview Questions

  • Design and evaluate an ads ranking algorithm - Meta (easy)
  • How would you design a Shop Ads ranking algorithm? - Meta (easy)
  • Derive Linear Regression Solution - Meta (medium)
  • Explain key ML metrics and techniques - Meta (medium)
  • Design an ad recommendation ranking approach - Meta (easy)
Meta logo
Meta
Aug 4, 2025, 10:55 AM
Data Scientist
Onsite
Machine Learning
101
0

Scenario

You are a data scientist at a social‑commerce platform responsible for trust and safety. You need to design a system to detect and mitigate fake accounts (bots, spam, fraud, coordinated inauthentic behavior) while minimizing friction for real users.

Question

  1. Problem framing: How would you define "fake accounts" and decide on actions (auto-block, soft friction, human review)? What are the business costs of false positives vs. false negatives?
  2. Features: What signals would you engineer (signup, device/network, behavior, content, social graph, marketplace/ads) and over what time windows? How would you handle cold start?
  3. Modeling: Would you use supervised, unsupervised, semi/weakly-supervised, or graph-based methods? Describe a classifier architecture you would deploy at scale.
  4. Evaluation: How would you evaluate performance (metrics, splits, PR trade-offs), choose thresholds for different actions, and run A/B tests to measure business impact?
  5. Monitoring and drift: How would you monitor data/concept drift, recalibrate, and retrain? What guardrails would you put in place?
  6. Business impact: What harms do fake users cause across buyers, sellers, and ads ecosystems?
  7. Interpreting prevalence: If 3% of accounts are fake, what conclusions or next steps does that suggest?

Hints: Discuss feature engineering, supervised vs. unsupervised methods, precision–recall trade-offs, and A/B testing for business impact.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Meta•More Data Scientist•Meta Data Scientist•Meta Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.