PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/Analytics & Experimentation/TikTok

Diagnose a sudden metric spike or drop

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's competency in anomaly investigation, metric instrumentation validation, causal inference and product analytics, including log-level querying, segmentation, hypothesis generation, and rapid impact estimation.

  • hard
  • TikTok
  • Analytics & Experimentation
  • Data Scientist

Diagnose a sudden metric spike or drop

Company: TikTok

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Technical Screen

Your daily Checkout Conversion Rate (CCR) jumped from 3.2% to 4.5% on 2025-06-12 and stayed elevated for 3 days. CCR is defined as unique purchasers / unique sessions with at least one add_to_cart. Provide a concrete, step-by-step investigation plan to: 1) validate the movement is real vs. instrumentation, including exact sanity checks (e.g., event volume balance, missingness by logger version, null spikes, late-arriving data, bot/outlier filters), 2) localize the movement with a minimal set of cuts that can reveal Simpson’s paradox (device, app/web, geo, acquisition channel, experiment arms, release version, traffic vendor, new vs. returning users), 3) enumerate and rank at least five hypotheses (e.g., pricing change, shipping promo, ad mix shift, experiment ramp, search relevance tweak, fraud filter change), with a quick back-of-envelope impact estimate for each and the exact query or metric you would pull to confirm/refute it, 4) distinguish product-causal vs. mix-driven effects by proposing a counterfactual/holdout or synthetic control and the shortest path to compute it with existing logs, 5) decide next actions (rollback, continue, ramp) and guardrails with concrete thresholds (e.g., revenue/user, refund rate, support tickets), and 6) state the artifacts you would produce in the first 60 minutes (plots/tables) and the exact time-series tests or seasonality checks you would run to avoid false alarms.

Quick Answer: This question evaluates a data scientist's competency in anomaly investigation, metric instrumentation validation, causal inference and product analytics, including log-level querying, segmentation, hypothesis generation, and rapid impact estimation.

Related Interview Questions

  • Define Ultra success metrics and detect suspicious transactions - TikTok (easy)
  • Plan DS approach for biker delivery project - TikTok (easy)
  • Define and critique a user activity metric - TikTok (easy)
  • Design and decompose Trust & Safety risk metrics - TikTok (easy)
  • Analyze promo anomaly and design risk guardrails - TikTok (Medium)
TikTok logo
TikTok
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
2
0

Investigate a 3-Day Jump in Checkout Conversion Rate (CCR)

Context

On 2025-06-12, the daily Checkout Conversion Rate (CCR) increased from 3.2% to 4.5% and stayed elevated for 3 days (through 2025-06-14). CCR is defined as:

CCR = unique purchasers / unique sessions with at least one add_to_cart

Assume we can query event logs and order/payment ledgers with fields like: timestamp, event_name, user_id, session_id, order_id, platform (app/web), device (iOS/Android/Desktop), app_version, logger_version, geo_country, acquisition_channel, experiment_id/variant, traffic_vendor_id, is_new_user, price, currency.

Task

Provide a concrete, step-by-step investigation plan to:

  1. Validate the movement is real vs. instrumentation, including exact sanity checks (event volume balance, missingness by logger version, null spikes, late-arriving data, bot/outlier filters, deduping).
  2. Localize the movement with a minimal set of cuts that can reveal Simpson’s paradox (device, app/web, geo, acquisition channel, experiment arms, release version, traffic vendor, new vs. returning users).
  3. Enumerate and rank at least five hypotheses (e.g., pricing change, shipping promo, ad mix shift, experiment ramp, search relevance tweak, fraud filter change), with a quick back-of-envelope impact estimate for each and the exact query or metric you would pull to confirm/refute it.
  4. Distinguish product-causal vs. mix-driven effects by proposing a counterfactual/holdout or synthetic control and the shortest path to compute it with existing logs.
  5. Decide next actions (rollback, continue, ramp) and guardrails with concrete thresholds (e.g., revenue/user, refund rate, support tickets).
  6. State the artifacts you would produce in the first 60 minutes (plots/tables) and the exact time-series tests or seasonality checks you would run to avoid false alarms.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More TikTok•More Data Scientist•TikTok Data Scientist•TikTok Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.