PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Analytics & Experimentation/Roblox

Design and analyze an ads ranking experiment

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a data scientist's experimental-design and causal-inference skills applied to ad ranking, including sample-size estimation, variance-reduction methods, interference reasoning (auctions, budget pacing, frequency caps), and marketplace distortion diagnostics.

  • hard
  • Roblox
  • Analytics & Experimentation
  • Data Scientist

Design and analyze an ads ranking experiment

Company: Roblox

Role: Data Scientist

Category: Analytics & Experimentation

Difficulty: hard

Interview Round: Technical Screen

You are proposing a new ads ranking model that may increase revenue but could harm UX. Design an experiment and analysis plan. Assumptions: Baseline user-day RPM = $1.80, SD = $0.90 at the user-day aggregation; average 2 sessions/user/day and 5 ad requests/session; advertiser budgets pace hourly; the auction is second-price with per-campaign frequency caps. Tasks: A) Define a North Star metric and at least three guardrail metrics, with precise formulas and acceptable movement ranges. Explain trade-offs. B) Choose the randomization unit (request-, session-, or user-level) and justify by addressing interference (auction spillovers, budget pacing, frequency caps) and cross-device identity. C) Provide a sample-size calculation to detect a +2% lift in the primary metric with 80% power and 5% two-sided alpha. State all assumptions (variance unit, clustering, expected correlation from CUPED/stratification) and show your calculation; then describe how you would re-estimate during the test. D) Detail variance reduction you would use (e.g., CUPED with pre-period user RPM, stratification by geo/device/ad vertical) and why each is valid. E) Specify a ramp plan (percentages, duration per stage), early-stop criteria, and how you will handle winner’s curse and novelty effects. F) Describe diagnostics you will run for marketplace distortions (budget reallocation, cannibalization across campaigns, price feedback loops) and how you would correct for them before a full rollout.

Quick Answer: This question evaluates a data scientist's experimental-design and causal-inference skills applied to ad ranking, including sample-size estimation, variance-reduction methods, interference reasoning (auctions, budget pacing, frequency caps), and marketplace distortion diagnostics.

Related Interview Questions

  • How to estimate feature impact on usage time - Roblox (easy)
  • How to estimate a feature’s causal impact on time spent - Roblox (medium)
  • Compute DID estimate and pretrend flag - Roblox (hard)
  • Compute minimum sample size for A/B test - Roblox (hard)
  • Compute DiD and validate parallel trends - Roblox (hard)
Roblox logo
Roblox
Oct 13, 2025, 9:49 PM
Data Scientist
Technical Screen
Analytics & Experimentation
3
0
Loading...

Ads Ranking Model: Experiment and Analysis Plan

Context

You are evaluating a new ads ranking model expected to increase revenue but potentially harm user experience (UX). The marketplace has hourly budget pacing, a second-price auction, and per-campaign frequency caps. Cross-device identity is imperfect.

Assumptions:

  • Baseline user-day RPM (total ad revenue per user per day) = 1.80;standarddeviation(SD)attheuser−daylevel=1.80; standard deviation (SD) at the user-day level = 1.80;standarddeviation(SD)attheuser−daylevel= 0.90.
  • Average 2 sessions per user per day and 5 ad requests per session (≈10 requests/user-day).
  • Advertiser budgets pace hourly; auction is second-price; per-campaign frequency caps apply.

Tasks

A) Define a North Star metric and at least three guardrail metrics, with precise formulas and acceptable movement ranges. Explain trade-offs.

B) Choose the randomization unit (request-, session-, or user-level) and justify by addressing interference (auction spillovers, budget pacing, frequency caps) and cross-device identity.

C) Provide a sample-size calculation to detect a +2% lift in the primary metric with 80% power and 5% two-sided alpha. State all assumptions (variance unit, clustering, expected correlation from CUPED/stratification) and show your calculation; then describe how you would re-estimate during the test.

D) Detail variance reduction you would use (e.g., CUPED with pre-period user RPM, stratification by geo/device/ad vertical) and why each is valid.

E) Specify a ramp plan (percentages, duration per stage), early-stop criteria, and how you will handle winner’s curse and novelty effects.

F) Describe diagnostics you will run for marketplace distortions (budget reallocation, cannibalization across campaigns, price feedback loops) and how you would correct for them before a full rollout.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Analytics & Experimentation•More Roblox•More Data Scientist•Roblox Data Scientist•Roblox Analytics & Experimentation•Data Scientist Analytics & Experimentation
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.