PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Machine Learning/Meta

Evaluate Classifier with Precision, Recall, and Fairness Metrics

Last updated: Mar 29, 2026

Quick Overview

This question evaluates proficiency in designing an offline evaluation framework for binary machine-learning classifiers, covering selection of ranking and operating-point metrics, calibration and class-imbalance handling, ground-truth labeling protocols, thresholding under asymmetric costs and capacity constraints, and subgroup fairness analyses; it is in the Machine Learning domain and emphasizes practical application of evaluation and ML systems design. It is commonly asked in technical interviews because it probes both conceptual understanding of statistical and fairness trade-offs and the ability to translate business constraints into measurable evaluation criteria, testing applied reasoning rather than purely theoretical knowledge.

  • medium
  • Meta
  • Machine Learning
  • Data Scientist

Evaluate Classifier with Precision, Recall, and Fairness Metrics

Company: Meta

Role: Data Scientist

Category: Machine Learning

Difficulty: medium

Interview Round: Technical Screen

##### Scenario Data science team must offline-evaluate a classifier that labels videos as harmful. ##### Question What offline evaluation framework would you use? Detail suitable metrics (e.g., precision, recall, PR-AUC), handling class imbalance, ground-truth collection, threshold selection based on business costs, and potential fairness checks. ##### Hints Discuss label skew, cost of false positives vs negatives, and calibration.

Quick Answer: This question evaluates proficiency in designing an offline evaluation framework for binary machine-learning classifiers, covering selection of ranking and operating-point metrics, calibration and class-imbalance handling, ground-truth labeling protocols, thresholding under asymmetric costs and capacity constraints, and subgroup fairness analyses; it is in the Machine Learning domain and emphasizes practical application of evaluation and ML systems design. It is commonly asked in technical interviews because it probes both conceptual understanding of statistical and fairness trade-offs and the ability to translate business constraints into measurable evaluation criteria, testing applied reasoning rather than purely theoretical knowledge.

Related Interview Questions

  • Implement 1NN Embeddings and Forward Pass - Meta (hard)
  • Design and evaluate an ads ranking algorithm - Meta (easy)
  • How would you design a Shop Ads ranking algorithm? - Meta (easy)
  • Derive Linear Regression Solution - Meta (medium)
  • Explain key ML metrics and techniques - Meta (medium)
Meta logo
Meta
Aug 4, 2025, 10:55 AM
Data Scientist
Technical Screen
Machine Learning
2
0

Offline Evaluation Framework for a Harmful-Content Video Classifier

Context

You are evaluating a binary classifier that assigns each video a score (interpretable as the probability it violates a harmful-content policy). Harmful videos are rare (label skew). The business incurs different costs for false positives (over-blocking/over-review) and false negatives (missed harm). Moderation capacity may also be limited.

Task

Design an offline evaluation plan that covers:

  1. Metrics
  • Which ranking and operating-point metrics to report (e.g., precision, recall, PR-AUC), including calibration metrics.
  1. Class Imbalance
  • How to evaluate meaningfully under severe label skew and when the evaluation sample is not a simple random draw (e.g., stratified by model score).
  1. Ground-Truth Collection
  • How to collect high-quality labels for harmful content, including rater setup, agreement, sampling, and quality controls.
  1. Threshold Selection and Business Costs
  • How to choose a decision threshold given asymmetric costs of false positives vs. false negatives and potential moderation capacity constraints.
  1. Fairness Checks
  • What subgroup analyses and fairness metrics to run to guard against disparate impact.

Include assumptions where necessary, and provide formulas and examples for thresholding and weighting.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Meta•More Data Scientist•Meta Data Scientist•Meta Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.