How do I approach System Design interview questions?

System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master system design interviews.

What difficulty level is this interview question?

This is a hard difficulty System Design question, commonly asked during Take-home Project rounds at Da Vinci Trading.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Da Vinci Trading during technical interviews.

Design an in-memory order matching engine | Da Vinci Trading Interview Question

Q: Design an in-memory order matching engine

This question evaluates expertise in high-performance trading system design, covering in-memory order books, latency-sensitive order matching, efficient C++ data structures, correctness under price-time priority, and concurrency and complexity analysis.

You are asked to design an in-memory order book / order matching system for a single electronic trading venue. The implementation language is C++, and the system must meet strict runtime and latency requirements (microseconds to low milliseconds per operation) under high throughput.

Work through the design step by step. For each part below, justify your choices and explain why your approach can meet the latency and throughput bar in C++.

Constraints & Assumptions

Anchor your design to the following (state any additional assumptions explicitly):

Single venue, single machine. The whole order book fits in RAM on one host; no distributed consensus is required for the core matcher.
Throughput target: tens to hundreds of thousands of orders/second.
Latency target: microsecond-scale, low-jitter per-order processing (low milliseconds is the outer bound).
Priority model: strict price–time priority (best price first; ties broken by arrival order).
At minimum LIMIT orders and cancel-by- order_id must be supported; other order types are an extension you should describe.
This mirrors a real take-home from a low-latency trading firm (Optiver/Da Vinci style), C++ only, roughly a 3-hour budget — so prefer concrete, implementable structures over hand-waving.

Clarifying Questions to Ask

A strong candidate scopes the problem before designing. Consider asking the interviewer:

One symbol or many? Is a single instance responsible for one instrument, or must one engine multiplex many symbols?
Price representation: are prices on a fixed tick grid (and is the tick range bounded around the current price), or arbitrary/sparse?
Order-type surface: which types are in scope now (LIMIT only?) vs. which must be extensible (MARKET, IOC, FOK, GTC, PostOnly)?
Trade-price convention: does a fill execute at the resting (maker) price or the aggressor's price? (Venue policy, and it is correctness-critical.)
Durability requirement: is this purely in-memory (rebuild from the feed on restart), or must we recover exact book + trade state after a crash?
Self-trading: is self-match prevention required, and which policy (cancel-resting / cancel-incoming / cancel-both)?

What a Strong Answer Covers Premium

Part 1 — Core functionality

The system receives a stream of orders. Each order has at least:

order_id (unique identifier)
side (BUY or SELL)
price
quantity
timestamp or sequence number (arrival time)
type (e.g., LIMIT, MARKET; assume at minimum LIMIT must be supported)

Maintain an order book with two sides:

Bid side: buy limit orders, sorted by descending price, then by time (price–time priority).
Ask side: sell limit orders, sorted by ascending price, then by time (price–time priority).

On the arrival of a new order, the system must:

Attempt to match it against existing orders on the opposite side according to price–time priority.
Generate trades on a match — specify the fields a trade carries (e.g., trade_id , price, quantity, the involved order ids, timestamp).
Handle both partial fills and complete fills .
If the order is not fully filled and is still valid (e.g., not IOC), insert the remaining quantity into the book.

Part 2 — Order types and constraints

At minimum, support:

Limit orders.
Cancel existing orders by order_id .

Then describe briefly how you would extend to support additional types (e.g., MARKET, IOC, FOK, GTC, and optionally PostOnly) and any constraints each implies.

Part 3 — Performance and complexity constraints

The system must support very high throughput (tens to hundreds of thousands of orders/second) with low latency per operation. State your target time complexity for each of:

Insertion of a limit order.
Matching an incoming order (scanning and removing top-of-book orders on the opposite side).
Cancel by order_id .

You may assume the order book fits in memory on a single machine.

Part 4 — Data structures and design

Propose specific C++-oriented data structures to:

Maintain price levels on each side (bids/asks) in sorted order.
Maintain FIFO order of orders within a price level.
Support O(1) or near-O(1) cancellation by order_id .

Discuss how you would organize these structures (e.g., maps from price to queues, auxiliary hash maps for order lookup, custom allocators, object pools, etc.).

Part 5 — Concurrency and threading

Assume the system may receive orders from multiple network threads. Describe how you would make the core matching logic thread-safe and low-contention while preserving strict price–time ordering. Consider options such as:

A single-threaded matching engine fed by message queues.
Sharding per symbol.
Lock-free queues or carefully scoped locking.

Part 6 — Failure handling and robustness

What happens on process restart? Do you need to recover the order book state? If so, how might you snapshot or log state efficiently?
How do you ensure each order is processed exactly once and that trades are not duplicated when recovering?

Part 7 — API and output

Sketch a simple API for the engine (C++ interfaces or function signatures) to:

Submit a new order.
Cancel an order.
Query the current top-of-book (best bid/ask) and possibly full depth.

Show what the engine outputs when matches occur (trade events, order-status updates, etc.).

Part 8 — Complexity analysis

Analyze the time and space complexity of your design:

Average and worst-case complexities for add , match , cancel , and query best bid/ask .
Discuss the trade-offs you made between simplicity and performance.

Follow-up Questions

Be ready for deeper probes after your main design:

Scale-up: how does the design change if a single symbol's order rate is 10–100× higher, or if one engine must handle thousands of symbols? What breaks first?
Self-match prevention: walk through your STP policy and prove the matching loop still terminates when the aggressor and the head maker share an account.
FOK under self-liquidity: if the incoming account already has resting orders in the crossable range, why might a naive feasibility check (summing level totals) let a FOK pass and then under-fill — and how do you fix it?
Numeric correctness: why ban floating-point prices on the matching path, and how do you represent price exactly instead?

Explain your design choices step-by-step and justify why your approach can meet strict runtime and latency requirements in C++.

Work through the design step by step. For each part below, justify your choices and explain why your approach can meet the latency and throughput bar in C++.

Constraints & Assumptions

Anchor your design to the following (state any additional assumptions explicitly):

Single venue, single machine. The whole order book fits in RAM on one host; no distributed consensus is required for the core matcher.
Throughput target: tens to hundreds of thousands of orders/second.
Latency target: microsecond-scale, low-jitter per-order processing (low milliseconds is the outer bound).
Priority model: strict price–time priority (best price first; ties broken by arrival order).
At minimum LIMIT orders and cancel-by- order_id must be supported; other order types are an extension you should describe.
This mirrors a real take-home from a low-latency trading firm (Optiver/Da Vinci style), C++ only, roughly a 3-hour budget — so prefer concrete, implementable structures over hand-waving.

Clarifying Questions to Ask

A strong candidate scopes the problem before designing. Consider asking the interviewer:

One symbol or many? Is a single instance responsible for one instrument, or must one engine multiplex many symbols?
Price representation: are prices on a fixed tick grid (and is the tick range bounded around the current price), or arbitrary/sparse?
Order-type surface: which types are in scope now (LIMIT only?) vs. which must be extensible (MARKET, IOC, FOK, GTC, PostOnly)?
Trade-price convention: does a fill execute at the resting (maker) price or the aggressor's price? (Venue policy, and it is correctness-critical.)
Durability requirement: is this purely in-memory (rebuild from the feed on restart), or must we recover exact book + trade state after a crash?
Self-trading: is self-match prevention required, and which policy (cancel-resting / cancel-incoming / cancel-both)?

What a Strong Answer Covers Premium

Part 1 — Core functionality

The system receives a stream of orders. Each order has at least:

order_id (unique identifier)
side (BUY or SELL)
price
quantity
timestamp or sequence number (arrival time)
type (e.g., LIMIT, MARKET; assume at minimum LIMIT must be supported)

Maintain an order book with two sides:

Bid side: buy limit orders, sorted by descending price, then by time (price–time priority).
Ask side: sell limit orders, sorted by ascending price, then by time (price–time priority).

On the arrival of a new order, the system must:

Attempt to match it against existing orders on the opposite side according to price–time priority.
Generate trades on a match — specify the fields a trade carries (e.g., trade_id , price, quantity, the involved order ids, timestamp).
Handle both partial fills and complete fills .
If the order is not fully filled and is still valid (e.g., not IOC), insert the remaining quantity into the book.

Part 2 — Order types and constraints

At minimum, support:

Limit orders.
Cancel existing orders by order_id .

Then describe briefly how you would extend to support additional types (e.g., MARKET, IOC, FOK, GTC, and optionally PostOnly) and any constraints each implies.

Part 3 — Performance and complexity constraints

The system must support very high throughput (tens to hundreds of thousands of orders/second) with low latency per operation. State your target time complexity for each of:

Insertion of a limit order.
Matching an incoming order (scanning and removing top-of-book orders on the opposite side).
Cancel by order_id .

You may assume the order book fits in memory on a single machine.

Part 4 — Data structures and design

Propose specific C++-oriented data structures to:

Maintain price levels on each side (bids/asks) in sorted order.
Maintain FIFO order of orders within a price level.
Support O(1) or near-O(1) cancellation by order_id .

Discuss how you would organize these structures (e.g., maps from price to queues, auxiliary hash maps for order lookup, custom allocators, object pools, etc.).

Part 5 — Concurrency and threading

A single-threaded matching engine fed by message queues.
Sharding per symbol.
Lock-free queues or carefully scoped locking.

Part 6 — Failure handling and robustness

What happens on process restart? Do you need to recover the order book state? If so, how might you snapshot or log state efficiently?
How do you ensure each order is processed exactly once and that trades are not duplicated when recovering?

Part 7 — API and output

Sketch a simple API for the engine (C++ interfaces or function signatures) to:

Submit a new order.
Cancel an order.
Query the current top-of-book (best bid/ask) and possibly full depth.

Show what the engine outputs when matches occur (trade events, order-status updates, etc.).

Part 8 — Complexity analysis

Analyze the time and space complexity of your design:

Average and worst-case complexities for add , match , cancel , and query best bid/ask .
Discuss the trade-offs you made between simplicity and performance.

Follow-up Questions

Be ready for deeper probes after your main design:

Scale-up: how does the design change if a single symbol's order rate is 10–100× higher, or if one engine must handle thousands of symbols? What breaks first?
Self-match prevention: walk through your STP policy and prove the matching loop still terminates when the aggressor and the head maker share an account.
FOK under self-liquidity: if the incoming account already has resting orders in the crossable range, why might a naive feasibility check (summing level totals) let a FOK pass and then under-fill — and how do you fix it?
Numeric correctness: why ban floating-point prices on the matching path, and how do you represent price exactly instead?

Explain your design choices step-by-step and justify why your approach can meet strict runtime and latency requirements in C++.

Design an in-memory order matching engine

Quick Overview

Constraints & Assumptions

Clarifying Questions to Ask

What a Strong Answer Covers Premium

Part 1 — Core functionality

Part 2 — Order types and constraints

Part 3 — Performance and complexity constraints

Part 4 — Data structures and design

Part 5 — Concurrency and threading

Part 6 — Failure handling and robustness

Part 7 — API and output

Part 8 — Complexity analysis

Follow-up Questions

Solution

Submit Your Answer to Earn 20XP

Design an in-memory order matching engine

Quick Overview

Constraints & Assumptions

Clarifying Questions to Ask

What a Strong Answer Covers Premium

Part 1 — Core functionality

Part 2 — Order types and constraints

Part 3 — Performance and complexity constraints

Part 4 — Data structures and design

Part 5 — Concurrency and threading

Part 6 — Failure handling and robustness

Part 7 — API and output

Part 8 — Complexity analysis

Follow-up Questions

Solution

Submit Your Answer to Earn 20XP