PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Machine Learning/Meta

Evaluate and Experiment with Harmful Content Detection Model

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's competence in machine learning model evaluation and online experiment design for content moderation, testing skills such as handling class imbalance, probabilistic scoring and calibration, threshold selection, slice-based robustness checks, and production experiment planning.

  • medium
  • Meta
  • Machine Learning
  • Data Scientist

Evaluate and Experiment with Harmful Content Detection Model

Company: Meta

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Technical Screen

##### Scenario The platform has a machine-learning model that automatically detects harmful content and flags it for removal or down-ranking. ##### Question Describe how you would evaluate this detection model offline (e.g., on a labeled validation set). Design an online experiment to test the model in production—define hypotheses, variants, success metrics, and guardrails. ##### Hints Cover precision/recall, ROC/PR curves, calibration offline; for online, outline A/B setup, traffic split, primary and guardrail metrics, duration, and significance.

Quick Answer: This question evaluates a candidate's competence in machine learning model evaluation and online experiment design for content moderation, testing skills such as handling class imbalance, probabilistic scoring and calibration, threshold selection, slice-based robustness checks, and production experiment planning.

Related Interview Questions

  • Design and evaluate an ads ranking algorithm - Meta (easy)
  • How would you design a Shop Ads ranking algorithm? - Meta (easy)
  • Derive Linear Regression Solution - Meta (medium)
  • Explain key ML metrics and techniques - Meta (medium)
  • Design an ad recommendation ranking approach - Meta (easy)
Meta logo
Meta
Aug 4, 2025, 10:55 AM
Data Scientist
Technical Screen
Machine Learning
71
0

Evaluating a Harmful-Content Detection Model: Offline and Online

Context

You are given a binary classification model that detects harmful content in a social platform and flags items for either removal or down‑ranking. You need to:

  • Evaluate the model offline on a labeled validation set.
  • Design an online experiment to test the model in production.

Assume class imbalance (harmful content is rare), probabilistic model outputs (scores), and that some actions (auto‑remove) can prevent us from observing true labels unless we design around it.

Tasks

  1. Offline evaluation (labeled validation set):
    • Define and compute core metrics (precision, recall, FPR, ROC/PR curves, AUCs).
    • Assess calibration and choose an operating threshold given policy and cost trade‑offs.
    • Check robustness across slices (e.g., language/region) and over time.
  2. Online experiment design:
    • State hypotheses.
    • Define variants (control vs. treatment), including any shadow/canary ramps.
    • Specify randomization unit, traffic split, duration, and significance plan.
    • Define primary success metrics and guardrails (safety, engagement, fairness, latency).
    • Address measurement challenges (delayed/hidden labels due to enforcement).

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Meta•More Data Scientist•Meta Data Scientist•Meta Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.