PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/TikTok

Design robust metrics for a feature launch

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's ability to define robust A/B test metrics and guardrails, specify precise units of analysis, denominators, exposure and attribution windows, detect empty or accidental engagement via telemetry, and pre-register a valid analysis plan.

  • hard
  • TikTok
  • Analytics & Experimentation
  • Data Scientist

Design robust metrics for a feature launch

Company: TikTok

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Technical Screen

You are launching a new in-app Quick Reply feature in a messaging app and must define metrics and guardrails for a 1-week A/B test running 2025-08-25 to 2025-09-01. Be precise about denominators, units of analysis, and attribution windows. 1) Define the primary success metric (north-star) that captures meaningful user value from Quick Reply. Provide the exact formula, including: unit (user or user-day), numerator, denominator, inclusion criteria (e.g., exposure definition), and a 24-hour attribution rule from click to reply send. 2) Propose at least two guardrail metrics that protect long-term health (e.g., reply quality/abuse rate, churn, app crashes). For each, specify measurement unit, formula, and acceptable movement thresholds. 3) Suppose overall reply send rate increases, but average conversation length decreases and complaint rate from Ads-acquired users rises. Explain how you would segment and interpret the metrics to avoid Simpson’s paradox and decide whether to ship, hold, or iterate. 4) Define a metric to detect “empty engagement” (e.g., accidental taps or replies that are deleted before send). Describe how to implement it using existing telemetry and how it would influence the launch decision. 5) Lay out a pre-analysis plan: how you will pre-register primary/secondary metrics, define stopping rules, and prevent metric fishing. Include how you would validate that “exposed” truly means the user saw the Quick Reply entry point (not just was eligible).

Quick Answer: This question evaluates a data scientist's ability to define robust A/B test metrics and guardrails, specify precise units of analysis, denominators, exposure and attribution windows, detect empty or accidental engagement via telemetry, and pre-register a valid analysis plan.

Related Interview Questions

  • Define Ultra success metrics and detect suspicious transactions - TikTok (easy)
  • Plan DS approach for biker delivery project - TikTok (easy)
  • Define and critique a user activity metric - TikTok (easy)
  • Design and decompose Trust & Safety risk metrics - TikTok (easy)
  • Analyze promo anomaly and design risk guardrails - TikTok (Medium)
TikTok logo
TikTok
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
1
0

A/B Test Metrics and Guardrails for Quick Reply (1-week)

Context

You are adding a Quick Reply feature (suggested reply chips in the DM composer) to a messaging app and will run a 1-week A/B test from 2025-08-25 to 2025-09-01. Define the success metric and guardrails with precise denominators, units of analysis, and attribution windows.

Tasks

  1. Primary Success Metric (North Star)
    • Define a single primary metric that captures meaningful user value from Quick Reply.
    • Provide: unit of analysis (e.g., user or user-day), numerator, denominator, inclusion criteria (including a precise exposure definition), and a 24-hour attribution rule from click to reply send.
  2. Guardrail Metrics
    • Propose at least two guardrails to protect long-term health (e.g., reply quality/abuse rate, churn/retention, app crashes).
    • For each guardrail, specify: measurement unit, exact formula, and acceptable movement thresholds.
  3. Interpreting Mixed Signals
    • Suppose overall reply send rate increases, but average conversation length decreases and complaint rate from Ads-acquired users rises.
    • Explain how you would segment and interpret the metrics to avoid Simpson’s paradox, and decide whether to ship, hold, or iterate.
  4. Detecting Empty Engagement
    • Define a metric to detect accidental taps or replies deleted before send.
    • Describe how to implement it using existing telemetry and how it would influence the launch decision.
  5. Pre-analysis Plan
    • Describe how you will pre-register primary/secondary metrics, define stopping rules, and prevent metric fishing.
    • Include how you would validate that "exposed" truly means the user saw the Quick Reply entry point (not just was eligible).

Solution

Show

Submit Your Answer

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More TikTok•More Data Scientist•TikTok Data Scientist•TikTok Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.