PracHub
QuestionsCoachesLearningGuidesInterview Prep
|Home/System Design/Confluent

Design a distributed key-value store at scale

Last updated: May 26, 2026

Quick Overview

Design a distributed key-value store at scale evaluates requirements, scale assumptions, API/data design, architecture, trade-offs, failure modes, and rollout in a realistic interview setting. A strong answer states assumptions, handles edge cases, explains trade-offs, and shows how to validate the result clearly.

  • hard
  • Confluent
  • System Design
  • Software Engineer

Design a distributed key-value store at scale

Company: Confluent

Role: Software Engineer

Category: System Design

Difficulty: hard

Interview Round: Technical Screen

Design a globally distributed key-value store optimized for read-heavy workloads. Address: ( 1) OS-level performance considerations (threads vs. async I/O, context switching, memory management, filesystem tuning); ( 2) storage layout and indexing choices, including compaction and write amplification trade-offs; ( 3) partitioning and sharding strategies, key distribution, and rebalancing; ( 4) replication and caching layers (write/read paths, coherence, TTLs, invalidation); ( 5) consistency models and CAP trade-offs, including client-visible guarantees; ( 6) failure detection, fault isolation, leader election, and recovery; ( 7) hotspot mitigation, backpressure, and rate limiting; ( 8) capacity planning, SLAs/SLOs, and observability. For each component, justify design choices and discuss performance, complexity, and resource trade-offs.

Quick Answer: Design a distributed key-value store at scale evaluates requirements, scale assumptions, API/data design, architecture, trade-offs, failure modes, and rollout in a realistic interview setting. A strong answer states assumptions, handles edge cases, explains trade-offs, and shows how to validate the result clearly.

Related Interview Questions

  • Design an RSS News Feed Service - Confluent (medium)
  • Design a News Feed and Mail Service - Confluent (medium)
  • Design RSS Feed and Temporary Mail - Confluent (medium)
  • Design a temporary email service - Confluent (hard)
|Home/System Design/Confluent

Design a distributed key-value store at scale

Confluent logo
Confluent
Jul 26, 2025, 12:00 AM
hardSoftware EngineerTechnical ScreenSystem Design
25
0

Design a distributed key-value store at scale

System Design Prompt: Globally Distributed, Read-Optimized Key-Value Store

Context and minimal assumptions

Design a globally distributed key-value (KV) store optimized for read-heavy workloads. Assume:

  • Workload: 90–95% reads, 5–10% writes; point lookups dominate, range scans are rare.
  • Data model: opaque values by key; keys up to ~1 KB; median value ~1 KB (max 1 MB).
  • Scale: tens of TB per region; multi-tenant; multi-region across continents.
  • Latency targets: in-region p99 read ≤ 5–10 ms; global p99 read ≤ 150 ms via nearest region.
  • Availability/durability: ≥ 99.99% availability; RPO 0 within a region; cross-region replication tolerated to be async by default.

Task

Propose an end-to-end design and justify your choices. Address the following areas with performance, complexity, and resource trade-offs:

  1. OS-level performance considerations (threads vs async I/O, context switching, memory management, filesystem and kernel tuning).
  2. Storage layout and indexing (on-disk format, compaction strategy, write amplification trade-offs).
  3. Partitioning and sharding (key distribution, shard sizing, rebalancing strategy).
  4. Replication and caching layers (write/read paths, coherence, TTLs, invalidation strategies).
  5. Consistency models and CAP/PACELC trade-offs (client-visible guarantees and tunable options).
  6. Failure detection, fault isolation, leader election, recovery and repair.
  7. Hotspot mitigation, backpressure, and rate limiting (per-tenant fairness and overload control).
  8. Capacity planning, SLAs/SLOs, and observability (metrics, tracing, alerting; how to validate the design).

Be explicit about assumptions, call out pitfalls/edge cases, and use small numeric examples where helpful.

Constraints & Assumptions

  • Preserve the scope, facts, inputs, and requested outputs from the prompt above.
  • If the prompt leaves a detail unspecified, state a reasonable assumption before relying on it.
  • Keep the answer interview-ready: concise enough to present, but concrete enough to implement or evaluate.

Clarifying Questions to Ask

  • Clarify users, core use cases, read/write patterns, scale, latency, availability, and data retention.
  • State explicit assumptions before making sizing or architecture decisions.
  • Prioritize the functional path first, then address reliability, security, observability, and rollout.

What a Strong Answer Covers

  • A scoped requirements summary with concrete non-goals and success metrics.
  • API, data model, architecture, consistency, capacity, and operations.
  • Reasoned trade-offs among simple and scalable designs, including bottlenecks and failure modes.
  • A validation, monitoring, migration, and launch plan appropriate for the risk level.

Follow-up Questions

  • What breaks first at 10x traffic or data volume?
  • How would you degrade gracefully during dependency failures?
  • What metrics and alerts would prove the design is healthy after launch?

Submit Your Answer to Earn 20XP

Sign in to leave a comment

Loading comments...

Browse More Questions

More System Design•More Confluent•More Software Engineer•Confluent Software Engineer•Confluent System Design•Software Engineer System Design

Your design canvas — auto-saved

PracHub

Master your tech interviews with 8,000+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • AI Coding Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.