PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/TikTok

Design Metrics for Content Moderation and Chatbot Evaluation

Last updated: Mar 29, 2026

Quick Overview

This question evaluates data science competencies in metric design and experiment methodology, including sensitivity-aware A/B testing, user-centric outcome selection, and offline/online evaluation for content moderation and chatbot knowledge bases within the Analytics & Experimentation domain.

  • medium
  • TikTok
  • Analytics & Experimentation
  • Data Scientist

Design Metrics for Content Moderation and Chatbot Evaluation

Company: TikTok

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: medium

Interview Round: Technical Screen

##### Scenario Trust & Safety data science: design metrics for a content-moderation A/B test and for evaluating a customer-service chatbot’s knowledge base. ##### Question In a content-moderation A/B test where harmful-content prevalence is low, which short-term, user-centric metrics would you track to detect impact quickly and why? How would you design an experiment and select evaluation metrics to measure the quality and usefulness of a customer-service chatbot’s knowledge base? ##### Hints Consider immediate user actions: report rates, dismissals, session exits, latency; for chatbot, precision/recall of answers, deflection rate, CSAT; discuss experiment design and trade-offs.

Quick Answer: This question evaluates data science competencies in metric design and experiment methodology, including sensitivity-aware A/B testing, user-centric outcome selection, and offline/online evaluation for content moderation and chatbot knowledge bases within the Analytics & Experimentation domain.

Related Interview Questions

  • Define Ultra success metrics and detect suspicious transactions - TikTok (easy)
  • Plan DS approach for biker delivery project - TikTok (easy)
  • Define and critique a user activity metric - TikTok (easy)
  • Design and decompose Trust & Safety risk metrics - TikTok (easy)
  • Analyze promo anomaly and design risk guardrails - TikTok (Medium)
TikTok logo
TikTok
Aug 4, 2025, 10:55 AM
Data Scientist
Technical Screen
Analytics & Experimentation
1
0

Scenario

Trust & Safety data science: You are asked to design metrics for two situations: (1) a content‑moderation A/B test where harmful‑content prevalence is low, and (2) evaluation of a customer‑service chatbot’s knowledge base.

Task

  1. Content moderation A/B test (low prevalence): Which short‑term, user‑centric metrics would you track to detect impact quickly, and why? Describe how you would set up the experiment to ensure sensitivity and guardrails.
  2. Chatbot knowledge base: How would you design an experiment and choose evaluation metrics to measure the quality and usefulness of the chatbot’s knowledge base? Cover both offline and online evaluation, and discuss trade‑offs.

Hints

  • Consider immediate user actions (e.g., report rates, dismissals, session exits, latency).
  • For chatbot, consider answer precision/recall, deflection/containment, CSAT, time to resolution.
  • Discuss experiment design (randomization unit, triggering, guardrails) and trade‑offs.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More TikTok•More Data Scientist•TikTok Data Scientist•TikTok Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.