Roblox Data Scientist Interview Questions
Master your tech interview with our curated database of real questions from top companies.
Generate Friendship List with Acceptance Dates Using Pandas
friend_events | requester_id | accepter_id | request_date | accept_date | |--------------|-------------|--------------|-------------| | 1 |...
Evaluate Impact of New Roblox Homepage Tab
Measuring the Impact of Replacing a Homepage Tab Scenario Roblox plans to replace an existing homepage tab with a new one across eligible users. The g...
Analyze Recent Orders Dataset with Python/pandas
orders | order_id | user_id | price | created_at | |----------|---------|-------|------------| | 1 | 101 | 20.5 | 2024-01-01 | | 2 ...
Determine Player Preference for Local Game Creators
Player Preference for Local Creators — Metrics and Causal Estimation Without A/B Tests Context You are evaluating whether players on Roblox prefer gam...
Implement Python Function for Statistical Test P-Values
Scenario You need a utility that calculates p-values for one-sided and two-sided statistical tests. Question Write a Python function compute_p_value(s...
Defend a metric choice under scrutiny
Describe a time you chose a non-obvious primary metric (e.g., time-per-session over total time) and were challenged by a senior stakeholder. 1) How di...
Explain an ML project end-to-end with tradeoffs
Pick one of your production ML projects and walk through it end-to-end. Be specific: 1) Problem framing (prediction vs causal decisioning), target def...
Optimize bread-factory pipeline for max profit
You can assemble a production line by choosing modules of three types: Mixers, Ovens, Packers. Each module i has (type, build_cost_i, throughput_i uni...
Match requests and accepts into friendships in SQL
Given two event tables, write a single SQL query (PostgreSQL) to output unique undirected friendships with the earliest valid friendship_date. Schema ...
Determine if players prefer local creators without experiments
You cannot run an A/B test. Define “local creators” as creators whose games use a player’s native language. Design an observational study to estimate ...
Choose best/worst actions under OA pressure
Behavioral Scenario: Time-Boxed OA With Hidden Test Failure Context You are a data scientist candidate 70 minutes into a 90-minute online assessment (...
Optimize assembly-line scheduling with changeovers
You are scheduling a simplified car-assembly line with two parallel stations S1 and S2. Each job must be processed on exactly one station, processing ...
Design leakage-free predictive maintenance pipeline
Predict 24-hour Machine Faults from an Hourly Panel (End-to-End Design) Context You are given a machine–hour panel: one row per machine per hour with ...
Clean and aggregate factory event data in Pandas
You are given three Pandas DataFrames for a factory: (1) events[event_id, machine_id, ts_utc (datetime64[ns, UTC]), event_type in {'start','stop','fau...
Demonstrate fit with quantified stories and motivations
Behavioral & Leadership Technical Screen Prompt (Data Scientist) Provide two stories—one professional and one personal—that demonstrate you’re a stron...
Implement deduped CTR/RPM aggregator over event stream
Implement a Python function to compute per-day, per-campaign CTR and RPM from an event stream with possible out-of-order and duplicate click events. I...
Compute CTR, RPM, and daily RPM variability in SQL
Write ANSI-Postgres SQL to compute, for each campaign, over the last 7 days inclusive (assume "today" is 2025-09-01, so the window is 2025-08-26 throu...
Design and analyze an ads ranking experiment
Ads Ranking Model: Experiment and Analysis Plan Context You are evaluating a new ads ranking model expected to increase revenue but potentially harm u...
Describe resolving revenue–UX metric conflict
Behavioral: Leading a High-Stakes Revenue vs. UX Trade-off Context: You led a decision where ads revenue goals conflicted with user-experience metrics...
Implement streaming CTR with deduplication
Implement a Python function to compute streaming, per-campaign CTR over a sliding 24-hour window with click de-duplication and late-arriving events. R...