PracHub
QuestionsPremiumCoachesLearningGuidesInterview Prep
|Home/System Design/Google

Design distributed log storage service

Last updated: Mar 29, 2026

Quick Overview

This question evaluates a candidate's expertise in designing distributed storage systems, covering partitioned append-only logs, high-throughput writes and reads, replication, retention, indexing, compaction, leader/follower roles, and recovery mechanisms.

  • hard
  • Google
  • System Design
  • Software Engineer

Design distributed log storage service

Company: Google

Role: Software Engineer

Category: System Design

Difficulty: hard

Interview Round: Technical Screen

How would you design storage for distributed data logs supporting append-only writes, retention policies, partitioning, replication, and high-throughput sequential reads? Detail segment management, compaction, indexing, leader and follower roles, and recovery.

Quick Answer: This question evaluates a candidate's expertise in designing distributed storage systems, covering partitioned append-only logs, high-throughput writes and reads, replication, retention, indexing, compaction, leader/follower roles, and recovery mechanisms.

Related Interview Questions

  • Design a Security Monitoring Framework - Google (medium)
  • Design an Online Coding Judge Platform - Google (medium)
  • Design Calendar Event Conflict Handling - Google (medium)
  • Design a pub-sub replay system - Google (hard)
  • How to host many domains on one IP? - Google (medium)
Google logo
Google
Sep 6, 2025, 12:00 AM
Software Engineer
Technical Screen
System Design
6
0

Design a Distributed Append-Only Log Storage System

You are asked to design the storage layer of a distributed, partitioned, replicated append-only log service that supports:

  • Append-only writes with high throughput
  • Retention policies (by time and/or size)
  • Partitioning for horizontal scalability
  • Replication for fault tolerance
  • High-throughput sequential reads

Detail the following components and behaviors:

  1. Segment management: active vs. sealed segments, rollover conditions, deletion, and preallocation.
  2. Compaction: when and how to compact, tombstones/deletes, write amplification tradeoffs.
  3. Indexing: offset-to-file-position index, time index, sparsity, and rebuild.
  4. Leader/follower roles: write and read paths, quorum/ack semantics, high watermark, lag handling.
  5. Recovery: crash recovery, index rebuild, truncation, leader election safety, and follower catch-up.

Assume a large-scale multi-node deployment with commodity disks and network, and that clients produce and consume records identified by monotonically increasing offsets within each partition.

Solution

Show

Submit Your Answer

Sign in to leave a comment

Loading comments...

Browse More Questions

More System Design•More Google•More Software Engineer•Google Software Engineer•Google System Design•Software Engineer System Design
PracHub

Master your tech interviews with 8,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.