How do I approach System Design interview questions?

System Design questions require understanding of core concepts and practice. PracHub provides solutions with explanations to help you master system design interviews.

What difficulty level is this interview question?

This is a medium difficulty System Design question, commonly asked during Technical Screen rounds at Microsoft.

What role is this question designed for?

This question is commonly asked for Software Engineer candidates at Microsoft during technical interviews.

Design a URL Shortener (High-Level and Low-Level Design)

Q: Design a URL Shortener (High-Level and Low-Level Design)

This question evaluates the ability to design a large-scale distributed system, covering both high-level architecture and low-level implementation details such as key generation and data modeling. It is commonly used to assess practical system design skills, including capacity estimation, read/write scaling trade-offs, and storage schema decisions. The scenario tests conceptual reasoning about distributed key-value systems alongside applied design of concrete APIs and data structures.

Design a URL shortening service similar to TinyURL or Bitly. The service accepts a long URL and returns a much shorter alias; when anyone visits the short alias, the service redirects them to the original long URL. Walk through both the high-level architecture and the low-level design, and be ready to justify your key-generation strategy and storage choices.

Constraints & Assumptions

Read-heavy workload: assume roughly a 100:1 read:write ratio (redirects vastly outnumber creations).
Scale assumption: about 500M new short URLs per month (~200 writes/sec average); at 100:1 that is ~20,000 redirects/sec average, with peaks several times higher.
Short codes should be as short as practical (target ~7 characters) drawn from [a-zA-Z0-9] (base-62).
A short link, once created, must keep resolving to the same long URL for its lifetime.
Links may have an optional expiration; assume a long default TTL (e.g., a few years) unless the user sets one.
Redirects must be low-latency (target p99 < ~100 ms) and highly available — a redirect should essentially never return a 5xx.
Optional features to consider but not over-build: custom/vanity aliases and per-link click analytics.

Clarifying Questions to Ask

What is the expected scale (new URLs per day, total stored) and the read:write ratio?
Do short links expire, and if so who controls the TTL?
Do we need custom/vanity aliases, and must generated codes be hard to guess (security/enumeration concern)?
What is the allowed character set and maximum length for a short code?
Do we need click analytics (counts, geo, referrer) on the redirect path?
What are the availability, latency, and consistency targets?

Part 1 — High-Level Design

Lay out the functional and non-functional requirements, do a back-of-the-envelope capacity estimation, define the public API, and sketch the high-level architecture — explicitly separating the write (create) path from the read (redirect) path.

Clarifying Questions for this Part

Should redirects use 301 (permanent) — browser/CDN-cacheable, cheaper, but you lose per-click analytics — or 302 (temporary) so every click reaches your servers and can be counted?

What This Part Should Cover Premium

Part 2 — Low-Level Design

Design the short-code generation and the data model. Compare key-generation strategies, handle collisions and custom aliases, choose a storage engine with justification, and sketch the core classes/interfaces.

What This Part Should Cover Premium

What a Strong Answer Covers Premium

Follow-up Questions

How do you guarantee two concurrent creates never issue the same code? (KGS, DB unique constraint + retry, counter ranges per server.)
At this scale, how do you expire and garbage-collect old or expired links?
How would you add per-link click analytics without adding latency to the redirect path?
As the datastore outgrows one node, how do you shard/partition it, and how does that interact with your key scheme?

Constraints & Assumptions

Read-heavy workload: assume roughly a 100:1 read:write ratio (redirects vastly outnumber creations).
Scale assumption: about 500M new short URLs per month (~200 writes/sec average); at 100:1 that is ~20,000 redirects/sec average, with peaks several times higher.
Short codes should be as short as practical (target ~7 characters) drawn from [a-zA-Z0-9] (base-62).
A short link, once created, must keep resolving to the same long URL for its lifetime.
Links may have an optional expiration; assume a long default TTL (e.g., a few years) unless the user sets one.
Redirects must be low-latency (target p99 < ~100 ms) and highly available — a redirect should essentially never return a 5xx.
Optional features to consider but not over-build: custom/vanity aliases and per-link click analytics.

Clarifying Questions to Ask

What is the expected scale (new URLs per day, total stored) and the read:write ratio?
Do short links expire, and if so who controls the TTL?
Do we need custom/vanity aliases, and must generated codes be hard to guess (security/enumeration concern)?
What is the allowed character set and maximum length for a short code?
Do we need click analytics (counts, geo, referrer) on the redirect path?
What are the availability, latency, and consistency targets?

Part 1 — High-Level Design

Clarifying Questions for this Part

Should redirects use 301 (permanent) — browser/CDN-cacheable, cheaper, but you lose per-click analytics — or 302 (temporary) so every click reaches your servers and can be counted?

What This Part Should Cover Premium

Part 2 — Low-Level Design

What This Part Should Cover Premium

What a Strong Answer Covers Premium

Follow-up Questions

How do you guarantee two concurrent creates never issue the same code? (KGS, DB unique constraint + retry, counter ranges per server.)
At this scale, how do you expire and garbage-collect old or expired links?
How would you add per-link click analytics without adding latency to the redirect path?
As the datastore outgrows one node, how do you shard/partition it, and how does that interact with your key scheme?

Design a URL Shortener (High-Level and Low-Level Design)

Quick Overview

Design a URL Shortener (High-Level and Low-Level Design)

Constraints & Assumptions

Clarifying Questions to Ask

Part 1 — High-Level Design

Clarifying Questions for this Part

What This Part Should Cover Premium

Part 2 — Low-Level Design

What This Part Should Cover Premium

What a Strong Answer Covers Premium

Follow-up Questions

Submit Your Answer to Earn 20XP

Design a URL Shortener (High-Level and Low-Level Design)

Quick Overview

Design a URL Shortener (High-Level and Low-Level Design)

Constraints & Assumptions

Clarifying Questions to Ask

Part 1 — High-Level Design

Clarifying Questions for this Part

What This Part Should Cover Premium

Part 2 — Low-Level Design

What This Part Should Cover Premium

What a Strong Answer Covers Premium

Follow-up Questions

Submit Your Answer to Earn 20XP