PracHub
QuestionsPremiumLearningGuidesCheatsheetNEWCoaches
|Home/System Design/LinkedIn

Design an exception monitoring system with top‑K

Last updated: Mar 29, 2026

Quick Overview

This question evaluates the ability to design a scalable, low-latency exception monitoring system focusing on streaming ingestion, event grouping/fingerprinting, data modeling for raw and aggregated stores, retention policies, and payload privacy.

  • medium
  • LinkedIn
  • System Design
  • Software Engineer

Design an exception monitoring system with top‑K

Company: LinkedIn

Role: Software Engineer

Category: System Design

Difficulty: medium

Interview Round: Onsite

## System Design: Exception Monitoring with Top-K Design an **exception monitoring system** for a microservices environment. ### Core requirements - Services emit exception events (message, stack trace, service name, environment, version, timestamp, severity, request context). - The system should enable on-call engineers to: - View **Top K exceptions** over a time window (e.g., last 5/15/60 minutes), grouped/deduplicated by “same exception.” - Filter by service, environment (prod/staging), deployment version, region. - Drill down into a group to see recent samples and aggregated stats. ### Non-functional requirements - High write throughput, low-latency queries for Top K. - Handle duplicates, retries, bursts (incident storms). - Retain raw data for debugging (e.g., 7–30 days) and aggregated metrics longer. - Protect sensitive data in payloads. ### Clarifications to address - How exceptions are **collected** from services. - How events are **grouped** (fingerprinting) and how you store/query efficiently. - What the **database schema** / key columns look like for both raw events and aggregates. Deliverables: high-level architecture, data flow, storage choices, and APIs used by UI/on-call tooling.

Quick Answer: This question evaluates the ability to design a scalable, low-latency exception monitoring system focusing on streaming ingestion, event grouping/fingerprinting, data modeling for raw and aggregated stores, retention policies, and payload privacy.

Related Interview Questions

  • Review a Web Application Architecture - LinkedIn (easy)
  • Scale a Distributed Randomized Multiset - LinkedIn (medium)
  • Design a Top-K Ranking Service - LinkedIn (easy)
  • Design a Global Calendar Service - LinkedIn (medium)
  • Design a malicious-URL checking service using an isMalicious API - LinkedIn (medium)
LinkedIn logo
LinkedIn
Oct 13, 2025, 12:00 AM
Software Engineer
Onsite
System Design
4
0
Loading...

System Design: Exception Monitoring with Top-K

Design an exception monitoring system for a microservices environment.

Core requirements

  • Services emit exception events (message, stack trace, service name, environment, version, timestamp, severity, request context).
  • The system should enable on-call engineers to:
    • View Top K exceptions over a time window (e.g., last 5/15/60 minutes), grouped/deduplicated by “same exception.”
    • Filter by service, environment (prod/staging), deployment version, region.
    • Drill down into a group to see recent samples and aggregated stats.

Non-functional requirements

  • High write throughput, low-latency queries for Top K.
  • Handle duplicates, retries, bursts (incident storms).
  • Retain raw data for debugging (e.g., 7–30 days) and aggregated metrics longer.
  • Protect sensitive data in payloads.

Clarifications to address

  • How exceptions are collected from services.
  • How events are grouped (fingerprinting) and how you store/query efficiently.
  • What the database schema / key columns look like for both raw events and aggregates.

Deliverables: high-level architecture, data flow, storage choices, and APIs used by UI/on-call tooling.

Solution

Show

Comments (0)

Sign in to leave a comment

Loading comments...

Browse More Questions

More System Design•More LinkedIn•More Software Engineer•LinkedIn Software Engineer•LinkedIn System Design•Software Engineer System Design
PracHub

Master your tech interviews with 7,500+ real questions from top companies.

Product

  • Questions
  • Learning Tracks
  • Interview Guides
  • Resources
  • Premium
  • For Universities
  • Student Access

Browse

  • By Company
  • By Role
  • By Category
  • Topic Hubs
  • SQL Questions
  • Compare Platforms
  • Discord Community

Support

  • support@prachub.com
  • (916) 541-4762

Legal

  • Privacy Policy
  • Terms of Service
  • About Us

© 2026 PracHub. All rights reserved.