How do I approach ML System Design interview questions?

ML System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master ml system design interviews.

What difficulty level is this interview question?

This is a hard difficulty ML System Design question, commonly asked during Technical Screen rounds at Mistral AI.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Mistral AI during technical interviews.

Design a PDF-to-Markdown Inference API

Last updated: Apr 18, 2026

Quick Overview

This question evaluates a candidate's ability to design scalable, resource-aware ML inference systems that balance API design, page-level parallelism, CPU/GPU/memory scheduling, batching, ordering, intermediate storage, fault tolerance, backpressure, and scaling trade-offs.

Mistral AI

Apr 16, 2026, 12:00 AM

Software Engineer

Technical Screen

ML System Design

Design an inference service that converts PDF files to Markdown. You can assume the following building blocks already exist:

A CPU-intensive function that splits a PDF into individual pages and converts each page into a NumPy array
A GPU-intensive OCR engine
A memory-intensive post-processing step that converts OCR outputs into Markdown or assembles final page results

Discuss two scenarios:

A synchronous API for one very large document, such as a 1000-page PDF, where the user wants the full converted output as quickly as possible
An asynchronous API for many concurrent conversion requests, where the client can receive the result later

Explain the API contract, page-level parallelism, CPU and GPU scheduling, batching, result ordering, intermediate storage, fault tolerance, backpressure, and how the system should scale.

Solution

Show

Submit Your Answer

Loading comments...

Browse More Questions

More ML System Design•More Mistral AI•More Software Engineer•Mistral AI Software Engineer•Mistral AI ML System Design•Software Engineer ML System Design