This question evaluates understanding of real-time streaming system architecture and stateful analytics, covering event-time processing and late/out-of-order handling, ingestion and partitioning strategies, state management and checkpointing, fault tolerance, and algorithms for top-k aggregation.
You are given an unbounded stream of purchase events with schema:
Design a system to compute and serve the top-k most purchased items in real time, supporting:
Discuss and justify choices for:
Login required