PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Machine Learning/Amazon

Design a Machine Learning Recommendation System Pipeline

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's competency in designing end-to-end, low-latency machine learning recommendation pipelines, encompassing data collection and governance, feature engineering and feature stores, training workflows and model refresh strategies, online/offline serving.

  • hard
  • Amazon
  • Machine Learning
  • Data Scientist

Design a Machine Learning Recommendation System Pipeline

Company: Amazon

Role: Data Scientist

Category: Machine Learning

Difficulty: hard

Interview Round: Onsite

##### Scenario Designing a machine-learning-powered recommendation system from data collection to real-time serving. ##### Question Design an end-to-end recommendation system. Discuss data collection, feature pipelines, training workflow, model refresh cadence, online/offline architecture, and meeting real-time latency requirements. ##### Hints Address feedback loops, A/B testing, and fallback logic.

Quick Answer: This question evaluates a candidate's competency in designing end-to-end, low-latency machine learning recommendation pipelines, encompassing data collection and governance, feature engineering and feature stores, training workflows and model refresh strategies, online/offline serving.

Related Interview Questions

  • Predicting the Next Elevator Call Location - Amazon (medium)
  • Explain Transformer and MoE Fundamentals - Amazon (medium)
  • Explain Core ML Interview Concepts - Amazon (hard)
  • Evaluate NLP Classification Models - Amazon (easy)
  • Explain overfitting, regularization, and LLM techniques - Amazon (medium)
Amazon logo
Amazon
Aug 4, 2025, 10:55 AM
Data Scientist
Onsite
Machine Learning
19
0

System Design: End-to-End ML Recommendation System

Scenario

You are building an end-to-end machine-learning-powered recommendation system for a large consumer application (e.g., e-commerce). The system must recommend items on high-traffic surfaces (home feed, product detail pages) with strict real-time latency constraints.

Task

Design the system from data collection to real-time serving. Clearly describe:

  1. Data collection and governance
  • What user/item/context signals to log, how to structure event schemas, identity management, and how to prevent data leakage.
  1. Feature pipelines
  • Batch and streaming feature engineering, a feature store strategy, point-in-time correctness, and training–serving consistency.
  1. Training workflow
  • Labeling strategy, negative sampling, model architectures (e.g., retrieval + ranking), objective functions, experiment tracking, and offline evaluation.
  1. Model refresh cadence
  • How frequently to update embeddings and ranking models; handling cold-start for new users/items.
  1. Online/offline architecture
  • Candidate generation, ranking, re-ranking, caching, vector search/ANN, and how offline components (data lake, orchestration) integrate with online serving.
  1. Real-time latency requirements
  • An end-to-end p95/p99 latency budget and techniques to meet it.

Additionally address

  • Feedback loops and exploration vs. exploitation.
  • A/B testing and experiment guardrails.
  • Fallback logic and graceful degradation.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More Amazon•More Data Scientist•Amazon Data Scientist•Amazon Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.