PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/Machine Learning/LinkedIn

Design a short-video recommender system

Last updated: Apr 11, 2026

Quick Overview

This question evaluates a data scientist's competency in end-to-end machine learning system design for recommender systems, including retrieval and ranking architecture, feature and label design, offline and online evaluation, serving and latency constraints, and monitoring for safety and robustness.

  • easy
  • LinkedIn
  • Machine Learning
  • Data Scientist

Design a short-video recommender system

Company: LinkedIn

Role: Data Scientist

Category: Machine Learning

Difficulty: easy

Interview Round: Technical Screen

## ML System Design — Short-video recommendation Design an end-to-end recommendation system for a short-video feed (TikTok/Reels-style). Walk through the full pipeline: ### 1) Objective and constraints - Define the product goal (e.g., maximize long-term user value). - State constraints: latency budget, freshness, exploration, safety/compliance. ### 2) Labels and training data - What labels would you use (watch time, completion, likes, shares, follows, skips)? - How do you handle **delayed feedback**, **position bias**, and **negative sampling**? - How do you construct training examples (impressions, sessions, user-video pairs)? ### 3) Features - User features (history, embeddings, time-of-day, locale) - Item/video features (content, creator, freshness) - Context features (device, network, entry point) - Cross features / interactions ### 4) “Three-stage” recommendation architecture Describe a standard 3-stage system: 1. **Candidate generation (retrieval)** 2. **Ranking** 3. **Re-ranking / post-processing** (diversity, constraints, business rules) For each stage: - Model family choices (e.g., two-tower, GBDT, deep ranker) - Serving architecture and latency considerations - How you ensure freshness and handle cold start ### 5) Offline evaluation and model selection - Which offline metrics would you use and why? - How do you validate correlation with online metrics? - How do you avoid offline-to-online mismatch? ### 6) Online experimentation - A/B test design, primary/guardrail metrics, ramp strategy - Debugging when offline improves but online regresses ### 7) Safety, robustness, and monitoring - How to prevent harmful content amplification - Monitoring (data drift, performance drift, quality regressions) Be specific and justify trade-offs.

Quick Answer: This question evaluates a data scientist's competency in end-to-end machine learning system design for recommender systems, including retrieval and ranking architecture, feature and label design, offline and online evaluation, serving and latency constraints, and monitoring for safety and robustness.

Related Interview Questions

  • Explain Logistic Regression, Backprop, and Adam - LinkedIn (medium)
  • Explain variance reduction in random forests - LinkedIn (medium)
  • Answer practical ML foundations questions - LinkedIn (medium)
  • Handle imbalance, sampling, and overfitting - LinkedIn (easy)
  • Handle imbalance, validate samples, and avoid overfitting - LinkedIn (easy)
LinkedIn logo
LinkedIn
Feb 1, 2026, 5:26 PM
Data Scientist
Technical Screen
Machine Learning
24
0
Loading...

ML System Design — Short-video recommendation

Design an end-to-end recommendation system for a short-video feed (TikTok/Reels-style). Walk through the full pipeline:

1) Objective and constraints

  • Define the product goal (e.g., maximize long-term user value).
  • State constraints: latency budget, freshness, exploration, safety/compliance.

2) Labels and training data

  • What labels would you use (watch time, completion, likes, shares, follows, skips)?
  • How do you handle delayed feedback , position bias , and negative sampling ?
  • How do you construct training examples (impressions, sessions, user-video pairs)?

3) Features

  • User features (history, embeddings, time-of-day, locale)
  • Item/video features (content, creator, freshness)
  • Context features (device, network, entry point)
  • Cross features / interactions

4) “Three-stage” recommendation architecture

Describe a standard 3-stage system:

  1. Candidate generation (retrieval)
  2. Ranking
  3. Re-ranking / post-processing (diversity, constraints, business rules)

For each stage:

  • Model family choices (e.g., two-tower, GBDT, deep ranker)
  • Serving architecture and latency considerations
  • How you ensure freshness and handle cold start

5) Offline evaluation and model selection

  • Which offline metrics would you use and why?
  • How do you validate correlation with online metrics?
  • How do you avoid offline-to-online mismatch?

6) Online experimentation

  • A/B test design, primary/guardrail metrics, ramp strategy
  • Debugging when offline improves but online regresses

7) Safety, robustness, and monitoring

  • How to prevent harmful content amplification
  • Monitoring (data drift, performance drift, quality regressions)

Be specific and justify trade-offs.

Solution

Show

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More Machine Learning•More LinkedIn•More Data Scientist•LinkedIn Data Scientist•LinkedIn Machine Learning•Data Scientist Machine Learning
PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.