PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/ML System Design/Google

Design feedback-driven recommender

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's competency in online machine learning and recommender system design, covering contextual bandits, feature engineering for users/items/context, real-time feedback processing, exploration–exploitation strategies, reward definition, offline and online evaluation, and scalable, robust architecture.

  • hard
  • Google
  • ML System Design
  • Machine Learning Engineer

Design feedback-driven recommender

Company: Google

Role: Machine Learning Engineer

Category: ML System Design

Difficulty: hard

Interview Round: Onsite

##### Question Design an online learning or bandit-style recommendation system: at each interaction you must choose 1 item out of 4 candidates to show the user, receive immediate feedback, and update the model so that future selections improve over time. Detail model choice, feature engineering, feedback handling, exploration–exploitation strategy, and offline/online evaluation.

Quick Answer: This question evaluates a candidate's competency in online machine learning and recommender system design, covering contextual bandits, feature engineering for users/items/context, real-time feedback processing, exploration–exploitation strategies, reward definition, offline and online evaluation, and scalable, robust architecture.

Related Interview Questions

  • Design an app-store app recommendation system - Google (medium)
  • Design a chatbot over structured and unstructured data - Google (medium)
  • Design a fraud detection system - Google (medium)
  • Choose Fast or Cheap Models - Google
  • Design ML system for self-driving perception - Google (medium)
Google logo
Google
Jul 29, 2025, 8:05 AM
Machine Learning Engineer
Onsite
ML System Design
16
0

Design: Contextual Bandit Recommendation with Online Learning

You are designing an online learning recommendation system. At each user interaction:

  • You receive exactly 4 candidate items from an upstream candidate generator.
  • You must choose exactly 1 item to show the user.
  • You receive immediate feedback (e.g., click or dwell time).
  • The model must update online so that future selections improve over time.

Provide a design that covers:

  1. Model choice (with justification) for a contextual bandit setup.
  2. Feature engineering for users, items, and context, including handling cold start.
  3. Feedback handling and reward definition, including delayed/implicit signals and logging for learning.
  4. Exploration–exploitation strategy and the selection algorithm.
  5. Offline evaluation methodology and online experimentation/monitoring.

State any minimal assumptions you need (e.g., feedback semantics, latency constraints), and make your design robust to non-stationarity and scale.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More ML System Design•More Google•More Machine Learning Engineer•Google Machine Learning Engineer•Google ML System Design•Machine Learning Engineer ML System Design
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.