PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Statistics & Math/Google

Compute precision under noisy annotators

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of statistical performance metrics and label-noise propagation by requiring computation of precision, recall, and F1 for binary labels under known annotator sensitivity/specificity and majority-vote aggregation, in the Statistics & Math domain.

  • medium
  • Google
  • Statistics & Math
  • Data Scientist

Compute precision under noisy annotators

Company: Google

Role: Data Scientist

Category: Statistics & Math

Difficulty: medium

Interview Round: Technical Screen

Two independent annotators review videos. For truly illegal videos, each annotator labels “illegal” with sensitivity s = 0.80; for legal videos, each says “illegal” with false‑positive rate f = 0.02 (specificity 0.98). The base rate of illegal videos is p = 0.01. - If you flag a video as illegal when at least one annotator says “illegal”, compute the precision (PPV) and recall (TPR). - If you instead require both annotators to say “illegal”, compute PPV and TPR. - Which policy yields the higher F1? Show calculations. Generalize formulas for n annotators with majority vote and arbitrary p, s, and f.

Quick Answer: This question evaluates understanding of statistical performance metrics and label-noise propagation by requiring computation of precision, recall, and F1 for binary labels under known annotator sensitivity/specificity and majority-vote aggregation, in the Statistics & Math domain.

Related Interview Questions

  • Estimate weather’s effect on mental health - Google (easy)
  • Explain Bootstrap and Statistical Inference - Google (hard)
  • Explain Bootstrap and Prove Uniformity - Google (hard)
  • Can bootstrap help reduce variance - Google (medium)
  • Estimate population singletons from a 10% log - Google (Medium)
Google logo
Google
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Statistics & Math
4
0
Loading...

Two-Annotator Labeling Policy: Precision, Recall, F1, and Generalization

You have two independent annotators who review videos and label them as "illegal" or "legal."

  • Sensitivity (true positive rate per annotator): s = 0.80
  • False positive rate per annotator: f = 0.02 (so specificity = 0.98)
  • Base rate (prevalence of illegal videos): p = 0.01

Policies to evaluate:

  1. At-least-one (OR): Flag a video as illegal if at least one annotator says "illegal."
  2. Both (AND): Flag a video as illegal only if both annotators say "illegal."

Tasks:

  1. For the OR policy, compute precision (PPV) and recall (TPR).
  2. For the AND policy, compute PPV and TPR.
  3. Compute the F1 score for each and state which policy has the higher F1. Show your calculations.
  4. Generalize formulas for n annotators with majority vote (threshold t = ceil((n+1)/2)) and arbitrary p, s, f.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Statistics & Math•More Google•More Data Scientist•Google Data Scientist•Google Statistics & Math•Data Scientist Statistics & Math
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.