This question evaluates knowledge of distributed storage and system design concepts, including data modeling, partitioning and consistent hashing, replication and consensus, write/read path trade-offs (e.g., LSM-tree vs B-Tree), compaction, caching, failure handling, and support for efficient range scans in a scalable key–value store.
You are asked to design a distributed key–value (KV) store that supports the following operations at scale:
Assume billions of keys, multi-terabyte datasets, horizontal scalability, and high availability across multiple availability zones. Low p99 latency is desired for point lookups and sequential range scans.
Discuss and justify design choices for the following topics:
Provide a high-level architecture and call out key trade-offs, pitfalls, and how you will validate the design under load.
Login required