Evaluate and Experiment with Harmful Content Detection Model
Company: Meta
Role: Data Scientist
Category: Machine Learning
Difficulty: medium
Interview Round: Technical Screen
##### Scenario
The platform has a machine-learning model that automatically detects harmful content and flags it for removal or down-ranking.
##### Question
Describe how you would evaluate this detection model offline (e.g., on a labeled validation set). Design an online experiment to test the model in production—define hypotheses, variants, success metrics, and guardrails.
##### Hints
Cover precision/recall, ROC/PR curves, calibration offline; for online, outline A/B setup, traffic split, primary and guardrail metrics, duration, and significance.
Quick Answer: This question evaluates a candidate's competence in machine learning model evaluation and online experiment design for content moderation, testing skills such as handling class imbalance, probabilistic scoring and calibration, threshold selection, slice-based robustness checks, and production experiment planning.