PracHub
QuestionsPremiumLearningGuidesInterview PrepNEWCoaches
|Home/ML System Design/Adobe

Design file-embedding storage system

Last updated: Mar 29, 2026

Quick Overview

This question evaluates understanding of multimodal embedding pipelines, covering ingestion, preprocessing (OCR, transcripts, frame extraction), model selection for text/image/video, scalable inference and GPU/CPU utilization, storage schema for raw assets, metadata and vectors, retrieval and cross-modal search in an ML System Design context.

  • hard
  • Adobe
  • ML System Design
  • Software Engineer

Design file-embedding storage system

Company: Adobe

Role: Software Engineer

Category: ML System Design

Difficulty: hard

Interview Round: Technical Screen

##### Question Design a system that generates embeddings for user-uploaded documents, images, and videos (each ≤ x MB) and stores them in a database; cover ingestion, preprocessing, model selection, scalability, storage schema, and retrieval.

Quick Answer: This question evaluates understanding of multimodal embedding pipelines, covering ingestion, preprocessing (OCR, transcripts, frame extraction), model selection for text/image/video, scalable inference and GPU/CPU utilization, storage schema for raw assets, metadata and vectors, retrieval and cross-modal search in an ML System Design context.

Related Interview Questions

  • Design a multimodal embedding service - Adobe (hard)
  • Design a natural-language AEP Q&A assistant - Adobe (hard)
Adobe logo
Adobe
Aug 4, 2025, 10:55 AM
Software Engineer
Technical Screen
ML System Design
12
0

System Design: Multimodal Embedding Service for User Uploads

Context

You are designing a backend service that, for each user-uploaded asset, generates vector embeddings and stores them for search and retrieval.

  • Supported asset types: documents (PDF/DOCX/TXT), images (PNG/JPEG), and videos (MP4).
  • Maximum upload size per asset: x MB (configurable; assume x is a limit enforced by the API).
  • The system must cover: ingestion, preprocessing, model selection, scalability, storage schema, and retrieval.
  • Assume multi-tenant usage and that asynchronous processing is acceptable (embeddings ready within seconds to minutes).

Requirements

  1. Ingestion: Validate and accept uploads, handle security checks, and persist raw assets.
  2. Preprocessing: Extract text from documents, handle OCR for scanned pages, handle frames/transcripts for videos, and normalize images/videos.
  3. Model Selection: Choose models for text, image, and video embeddings (and speech-to-text where needed); justify trade-offs.
  4. Scalability: Design for horizontal scale, efficient GPU/CPU utilization, and cost control; ensure observability and fault tolerance.
  5. Storage Schema: Define where raw assets, metadata, and vectors live, including partitioning and indexing choices.
  6. Retrieval: Support text, image, or video queries that retrieve relevant documents/images/videos; include cross-modal search.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More ML System Design•More Adobe•More Software Engineer•Adobe Software Engineer•Adobe ML System Design•Software Engineer ML System Design
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.