Diagnose a sudden metric spike or drop
Company: TikTok
Role: Data Scientist
Category: Analytics & Experimentation
Difficulty: hard
Interview Round: Technical Screen
Your daily Checkout Conversion Rate (CCR) jumped from 3.2% to 4.5% on 2025-06-12 and stayed elevated for 3 days. CCR is defined as unique purchasers / unique sessions with at least one add_to_cart. Provide a concrete, step-by-step investigation plan to: 1) validate the movement is real vs. instrumentation, including exact sanity checks (e.g., event volume balance, missingness by logger version, null spikes, late-arriving data, bot/outlier filters), 2) localize the movement with a minimal set of cuts that can reveal Simpson’s paradox (device, app/web, geo, acquisition channel, experiment arms, release version, traffic vendor, new vs. returning users), 3) enumerate and rank at least five hypotheses (e.g., pricing change, shipping promo, ad mix shift, experiment ramp, search relevance tweak, fraud filter change), with a quick back-of-envelope impact estimate for each and the exact query or metric you would pull to confirm/refute it, 4) distinguish product-causal vs. mix-driven effects by proposing a counterfactual/holdout or synthetic control and the shortest path to compute it with existing logs, 5) decide next actions (rollback, continue, ramp) and guardrails with concrete thresholds (e.g., revenue/user, refund rate, support tickets), and 6) state the artifacts you would produce in the first 60 minutes (plots/tables) and the exact time-series tests or seasonality checks you would run to avoid false alarms.
Quick Answer: This question evaluates a data scientist's competency in anomaly investigation, metric instrumentation validation, causal inference and product analytics, including log-level querying, segmentation, hypothesis generation, and rapid impact estimation.