PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/ML System Design/Roblox

Design a real-time game recommendation system

Last updated: Mar 29, 2026

Quick Overview

This question evaluates proficiency in designing scalable, low-latency ML-powered recommendation system architectures, covering real-time data ingestion and streaming feature computation, real-time feature stores, service contracts for candidate generation and ranking, caching and freshness, personalization state, scalability, experimentation, failure recovery, and operational concerns like monitoring, privacy, and security. It is commonly asked to gauge an engineer's ability to reason about trade-offs between latency, freshness, consistency, and cost for global high-throughput platforms; category: ML System Design (recommender systems), domain: systems architecture and operations, level of abstraction: practical application and architectural design rather than model internals.

  • hard
  • Roblox
  • ML System Design
  • Machine Learning Engineer

Design a real-time game recommendation system

Company: Roblox

Role: Machine Learning Engineer

Category: ML System Design

Difficulty: hard

Interview Round: Technical Screen

Design a real-time game recommendation system for a platform with millions of games and tens of millions of users. Focus on the system architecture, not ML model details. Describe: 1) end-to-end request flow and latency SLOs; 2) online/offline data ingestion (user events, game metadata) and schemas; 3) streaming feature computation and a real-time feature store; 4) candidate generation and ranking service interfaces; 5) caching strategy and freshness/TTL; 6) user personalization state and session handling; 7) scalability plans (QPS estimates, partitioning, replication, storage); 8) experimentation and rollout (A/B, feature flags); 9) cold-start strategies; 10) failure modes, consistency, and backfill/lag recovery; 11) privacy, security, and abuse prevention; 12) monitoring, alerting, and cost controls.

Quick Answer: This question evaluates proficiency in designing scalable, low-latency ML-powered recommendation system architectures, covering real-time data ingestion and streaming feature computation, real-time feature stores, service contracts for candidate generation and ranking, caching and freshness, personalization state, scalability, experimentation, failure recovery, and operational concerns like monitoring, privacy, and security. It is commonly asked to gauge an engineer's ability to reason about trade-offs between latency, freshness, consistency, and cost for global high-throughput platforms; category: ML System Design (recommender systems), domain: systems architecture and operations, level of abstraction: practical application and architectural design rather than model internals.

Related Interview Questions

  • Design a Game Recommendation System - Roblox (medium)
  • Design a game genre classifier - Roblox (medium)
  • Design a game recommendation modeling approach - Roblox (easy)
  • Design a Static Audio Detection System - Roblox (hard)
  • Outline the ML inference and labeling pipeline - Roblox (hard)
Roblox logo
Roblox
Jul 16, 2025, 12:00 AM
Machine Learning Engineer
Technical Screen
ML System Design
6
0

System Design: Real‑Time Game Recommendation System (Architecture Focus)

You are designing a real‑time recommendation system for a large gaming platform with millions of games and tens of millions of users. The focus is on system architecture and operations, not the internals of ML models.

Requirements

Describe the following aspects clearly:

  1. End‑to‑end request flow and latency SLOs.
  2. Online/offline data ingestion (user events, game metadata) and schemas.
  3. Streaming feature computation and a real‑time feature store.
  4. Candidate generation and ranking service interfaces.
  5. Caching strategy and freshness/TTL.
  6. User personalization state and session handling.
  7. Scalability plans (QPS estimates, partitioning, replication, storage).
  8. Experimentation and rollout (A/B, feature flags).
  9. Cold‑start strategies.
  10. Failure modes, consistency, and backfill/lag recovery.
  11. Privacy, security, and abuse prevention.
  12. Monitoring, alerting, and cost controls.

Assume:

  • Tens of millions of MAU/DAU, mobile and desktop clients.
  • Real‑time freshness for recent interactions (sub‑second to low seconds).
  • Global user base with multi‑region traffic.
  • The ML models exist but their internals are out of scope; you must define their service contracts and integration points.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More ML System Design•More Roblox•More Machine Learning Engineer•Roblox Machine Learning Engineer•Roblox ML System Design•Machine Learning Engineer ML System Design
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.