How do I approach ML System Design interview questions?

ML System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master ml system design interviews.

What difficulty level is this interview question?

This is a medium difficulty ML System Design question, commonly asked during Technical Screen rounds at Snapchat.

What role is this question designed for?

This question is commonly asked for Machine Learning Engineer candidates at Snapchat during technical interviews.

Design short-video retrieval with sparse text

Last updated: Mar 29, 2026

Quick Overview

This question evaluates expertise in ML system design, large-scale information retrieval and recommendation engineering, with emphasis on multi-modal video representation, sparse-text handling, indexing/ANN choices, low-latency online serving, and bias mitigation such as popularity bias.

|Home/ML System Design/Snapchat

Design short-video retrieval with sparse text

Snapchat

Feb 3, 2026, 12:00 AM

mediumMachine Learning EngineerTechnical ScreenML System Design

You are designing the candidate-generation (retrieval) and recommendation system for a short-video app.

Constraints and setting:

Users can search with a text query (e.g., “funny cat fails”), and the system should retrieve relevant short videos .
Only ~20% of videos have reliable text metadata (title/description/hashtags). The rest may have only visual/audio signals.
You must support low-latency online retrieval at large scale.

Tasks:

Propose an end-to-end architecture for query-to-video retrieval and how it fits into a full recommender stack (retrieval → ranking → re-ranking).
Explain how you would represent videos (multi-modal features) and queries, and how you would handle the 80% of videos without text.
Describe offline training, online serving, indexing/ANN choices, and how you would evaluate retrieval quality.
Discuss how you would mitigate popularity bias in retrieval/recommendation while keeping relevance and engagement strong.

Submit Your Answer to Earn 20XP

Loading comments...

Browse More Questions

More ML System Design•More Snapchat•More Machine Learning Engineer•Snapchat Machine Learning Engineer•Snapchat ML System Design•Machine Learning Engineer ML System Design

Your design canvas — auto-saved

Design short-video retrieval with sparse text

Last updated: Mar 29, 2026

Quick Overview

|Home/ML System Design/Snapchat

Design short-video retrieval with sparse text

Snapchat

Feb 3, 2026, 12:00 AM

mediumMachine Learning EngineerTechnical ScreenML System Design

You are designing the candidate-generation (retrieval) and recommendation system for a short-video app.

Constraints and setting:

Users can search with a text query (e.g., “funny cat fails”), and the system should retrieve relevant short videos .
Only ~20% of videos have reliable text metadata (title/description/hashtags). The rest may have only visual/audio signals.
You must support low-latency online retrieval at large scale.

Tasks:

Propose an end-to-end architecture for query-to-video retrieval and how it fits into a full recommender stack (retrieval → ranking → re-ranking).
Explain how you would represent videos (multi-modal features) and queries, and how you would handle the 80% of videos without text.
Describe offline training, online serving, indexing/ANN choices, and how you would evaluate retrieval quality.
Discuss how you would mitigate popularity bias in retrieval/recommendation while keeping relevance and engagement strong.

Submit Your Answer to Earn 20XP

Loading comments...

Browse More Questions

More ML System Design•More Snapchat•More Machine Learning Engineer•Snapchat Machine Learning Engineer•Snapchat ML System Design•Machine Learning Engineer ML System Design

Your design canvas — auto-saved