This question evaluates relational data modeling and data engineering competencies for high-volume event analytics, including schema design, partitioning and clustering strategies, duplicate and late-arriving event handling, and normalization versus denormalization trade-offs.

You are designing the event store for a high-volume consumer app. The schema must support common product analytics and growth metrics efficiently, including:
Assume at-least-once delivery from clients/services, possible late-arriving events, and high cardinality in event parameters. The store will be queried in a columnar data warehouse or relational engine that supports partitioning and clustering.
Design a relational schema to store user events so that the metrics above can be computed efficiently. Provide:
Login required
Related Questions