Bytedance Data Scientist Interview Questions
ByteDance Data Scientist interview questions often focus on a blend of practical coding, product-minded analysis, and experimental/statistical reasoning. What’s distinctive about ByteDance interviews is the strong emphasis on metrics and impact: expect SQL and Python problems that test data manipulation and algorithmic clarity, ML/statistics questions that probe experimental design and A/B testing intuition, and product-case prompts that evaluate how you translate data into prioritized business actions. The process is frequently fast-paced and can include an online assessment, several technical rounds, a case or take‑home exercise, and behavioral/hiring‑manager conversations that probe collaboration and stakeholder communication. For effective interview preparation, prioritize clean, reproducible SQL and Python solutions, review core ML concepts and assumptions, and practice end‑to‑end case studies that connect analysis to product decisions. Prepare concise stories that quantify your impact and explain tradeoffs, and rehearse whiteboard or presentation-style walkthroughs of a past project. Time-boxed mock interviews and targeted review of rolling metrics, experiment validity, and model evaluation will help you confidently answer ByteDance Data Scientist interview questions and demonstrate both technical depth and product judgment.
Maximize watched duration under consecutive-sum limit
You have a list of videos in a feed. Video i has duration d[i] (positive integer). A user has an “attention span” limit A. You want to select a subset...
Design and decompose Trust & Safety risk metrics
You are a Data Scientist in a Trust & Safety team for a short-video platform (similar to TikTok/Reels). The team asks: “How would you design risk metr...
Write monthly customer and sales SQL queries
You are analyzing a food-delivery marketplace. Tables Assume the following schema (you may add minor helper CTEs as needed): orders - order_id (BIGINT...
When prioritize precision vs recall
Context You are working on a product team and building (or evaluating) a binary classifier that triggers an action (e.g., show a warning, block conten...
Analyze shopping funnel with joins and windows
Write SQL (PostgreSQL) to analyze a 4-step shopping funnel: view_product → add_to_cart → checkout_start → purchase. Use the schema and sample data bel...
Show ownership in ambiguous creator-growth work
Describe a time you owned an ambiguous growth problem for creators end‑to‑end. Pick one project and cover: 1) the exact business goal and why it matte...
How would you manage precision/recall for fraud detection?
Scenario You own (or significantly contribute to) a production fraud detection system that flags transactions/users as fraud vs legit. - The model out...
Plan DS approach for biker delivery project
You are a Data Scientist supporting a “biker” (delivery rider) product/project for a food-delivery platform. An interviewer gives only a short descrip...
Demonstrate leadership in cross-functional disagreement
Behavioral & Leadership (HR Screen, Data Scientist) Prompt Describe a time you disagreed with a partner team (e.g., product pushing for more aggressiv...
Compute cluster-aware significance and sequential corrections
Cluster-Randomized Tipping UI Experiment: Power, Sequential Testing, and Multiplicity Context: A creator-level (cluster) randomized experiment evaluat...
Design an interference-robust A/B test for monetization
A/B Test Design: New Tipping UI on Creator Posts Context: You are launching a new tipping UI on creator (PGC/OGC) posts to increase creator monetizati...
Rank factors for TikTok market entry
TikTok Market Z Launch Decision Framework (Q4 2025) Context You are a data scientist evaluating whether TikTok should launch in a new country (Market ...
Diagnose a sudden metric spike or drop
Investigate a 3-Day Jump in Checkout Conversion Rate (CCR) Context On 2025-06-12, the daily Checkout Conversion Rate (CCR) increased from 3.2% to 4.5%...
Define and critique a user activity metric
Context You are on a product team and need to define a metric for user activity to be used in dashboards and decision-making. Question 1. Propose 2–4 ...
How do you choose a classification threshold?
Context You built a binary sentiment classification model (e.g., positive vs. negative) and need to deploy it in a product where actions depend on the...
Explain Type I/II errors vs precision/recall
Questions 1. Define Type I error and Type II error in hypothesis testing, and map them to false positives and false negatives. 2. Explain how Type I/I...
Design recommendations objective balancing growth and monetization
Design a Multi-Objective Recommender for Long-Form Content You are designing the ranking objective and measurement plan for a long-form content recomm...
Investigate visit–report correlation causality
Causal Diagnosis: Do More Ad Page Visits Cause More Reports? Context You observe a positive correlation between the number of ad page visits and the p...
Write SQL for geo posting-frequency drops
Using the schema below, write a single ANSI SQL query (window functions allowed) that identifies countries with the largest share of creators whose po...
Design a creator posting-frequency experiment
You’re on the Creator Growth (PGC) team of a short‑video platform. Product proposes a push/email nudge expected to raise creators’ weekly posting freq...