PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Analytics & Experimentation/Meta

Design metrics and experiment for stolen-post detection

Last updated: Mar 29, 2026

Quick Overview

Evaluates skills in metrics design, diagnostic analysis, and online experiment methodology within Analytics & Experimentation for a Data Scientist position, with a product-level focus on policy-sensitive detection and measurement challenges.

  • easy
  • Meta
  • Analytics & Experimentation
  • Data Scientist

Design metrics and experiment for stolen-post detection

Company: Meta

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: easy

Interview Round: Technical Screen

You work on **Stolen Post Detection** for a social platform (detecting content that is copied/reposted without permission). A new detection algorithm is proposed (e.g., a model producing a stolen-probability score used to downrank, label, or block posts). ## Questions 1) **Problem framing & diagnostics** - What are the key failure modes and risks (false positives vs false negatives) for stolen-post detection? - If stakeholders report “stolen posts are down,” what would you check to validate whether this is real vs an artifact (measurement issues, reporting changes, seasonality, policy changes, spam shifts, etc.)? 2) **Metrics** Propose: - **Primary success metric(s)** (what you ultimately want to improve) - **Diagnostic metrics** (to understand why things moved) - **Guardrail metrics** (to prevent harm) Include at least one metric that handles delayed / noisy ground truth (since “stolen” labels may come from user reports, manual review, or appeals). 3) **Experiment design** Design an online experiment (A/B test or alternative) to evaluate the new algorithm. Address: - Randomization unit (post-level vs author-level vs viewer-level) and why - Interference / network effects (e.g., copied content affects multiple creators) - Exposure definition (who is affected by the change) - Sample size / power considerations at a high level (what drives variance) - Ramp plan and decision criteria 4) **Tradeoffs and decision** If offline metrics improve (e.g., higher precision/recall on labeled data) but online engagement drops, how would you decide what to launch and what follow-ups you’d run?

Quick Answer: Evaluates skills in metrics design, diagnostic analysis, and online experiment methodology within Analytics & Experimentation for a Data Scientist position, with a product-level focus on policy-sensitive detection and measurement challenges.

Related Interview Questions

  • Measure scheduled posts feature success - Meta (medium)
  • Estimate ads ranking revenue impact - Meta (medium)
  • How should you evaluate unconnected content? - Meta (medium)
  • Should WhatsApp launch group calls? - Meta (medium)
  • How would you grow Meta products? - Meta (medium)
Meta logo
Meta
Dec 18, 2025, 12:00 AM
Data Scientist
Technical Screen
Analytics & Experimentation
12
0
Loading...

You work on Stolen Post Detection for a social platform (detecting content that is copied/reposted without permission).

A new detection algorithm is proposed (e.g., a model producing a stolen-probability score used to downrank, label, or block posts).

Questions

  1. Problem framing & diagnostics
    • What are the key failure modes and risks (false positives vs false negatives) for stolen-post detection?
    • If stakeholders report “stolen posts are down,” what would you check to validate whether this is real vs an artifact (measurement issues, reporting changes, seasonality, policy changes, spam shifts, etc.)?
  2. Metrics Propose:
    • Primary success metric(s) (what you ultimately want to improve)
    • Diagnostic metrics (to understand why things moved)
    • Guardrail metrics (to prevent harm)
    Include at least one metric that handles delayed / noisy ground truth (since “stolen” labels may come from user reports, manual review, or appeals).
  3. Experiment design Design an online experiment (A/B test or alternative) to evaluate the new algorithm. Address:
    • Randomization unit (post-level vs author-level vs viewer-level) and why
    • Interference / network effects (e.g., copied content affects multiple creators)
    • Exposure definition (who is affected by the change)
    • Sample size / power considerations at a high level (what drives variance)
    • Ramp plan and decision criteria
  4. Tradeoffs and decision If offline metrics improve (e.g., higher precision/recall on labeled data) but online engagement drops, how would you decide what to launch and what follow-ups you’d run?

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Meta•More Data Scientist•Meta Data Scientist•Meta Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.