PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/Machine Learning/Upstart

Design a Real-Time Personalized Ad Selection System

Last updated: Mar 29, 2026

Quick Overview

This question evaluates competency in end-to-end real-time ML system design for personalized ad selection, including model decisioning, feature engineering and stores, exploration–exploitation strategies, offline and online evaluation, and scalability and latency constraints within the Machine Learning domain.

  • hard
  • Upstart
  • Machine Learning
  • Data Scientist

Design a Real-Time Personalized Ad Selection System

Company: Upstart

Role: Data Scientist

Category: Machine Learning

Difficulty: hard

Interview Round: Technical Screen

##### Scenario You are asked to build a data-driven advertising platform that serves personalized ads to millions of users in real time. ##### Question Design an end-to-end machine-learning system for ad selection: discuss data collection, labeling, feature engineering, model choice, exploration-exploitation strategy, online/offline evaluation, latency constraints, and scalability. ##### Hints CTR prediction with logistic regression/GBDT, counterfactual evaluation, feature store, bandits for exploration.

Quick Answer: This question evaluates competency in end-to-end real-time ML system design for personalized ad selection, including model decisioning, feature engineering and stores, exploration–exploitation strategies, offline and online evaluation, and scalability and latency constraints within the Machine Learning domain.

Related Interview Questions

  • Explain L1 vs L2 and ridge vs lasso - Upstart (easy)
  • Implement PAVA spend-smoothing under no-borrowing constraint - Upstart (hard)
  • Derive logistic regression objective and gradients - Upstart (easy)
  • Address Missing Income Bracket in California Housing Data - Upstart (hard)
  • Design Push-Notification System for Airport Surge Pricing - Upstart (medium)
Upstart logo
Upstart
Aug 4, 2025, 10:55 AM
Data Scientist
Technical Screen
Machine Learning
6
0

End-to-End ML System Design: Real-Time Ad Selection

Context

You need to design a real-time, data-driven ad selection system that personalizes ads for millions of users. Assume:

  • Request volume: millions of users/day; peak QPS in the thousands.
  • Inventory: tens to hundreds of thousands of ads.
  • Strict latency targets: p95 < 100 ms for ad decisioning (excluding network).
  • Business objective: maximize expected value (e.g., clicks or revenue) while respecting advertiser budgets and policies.

Task

Design an end-to-end ML system to select ads at request time. Discuss and justify:

  1. Data collection and logging
  2. Labeling and outcome definition
  3. Feature engineering and a feature store
  4. Model choice and training objectives
  5. Exploration–exploitation strategy (e.g., contextual bandits)
  6. Offline evaluation and counterfactual/replay evaluation
  7. Online evaluation (A/B tests, guardrails)
  8. Latency and reliability constraints
  9. Scalability and architecture (training/serving pipelines)

Hints: CTR prediction with logistic regression or GBDT; counterfactual evaluation (IPS/DR); feature store for train-serve consistency; bandits for exploration.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Upstart•More Data Scientist•Upstart Data Scientist•Upstart Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.