This question evaluates a candidate's ability to design scalable analytics schemas and event-driven data flows for high-throughput, near-real-time tracking, testing competencies in data modeling, ingestion architecture, and query performance.
You are building an analytics data store to track user behavior and commerce events so that product and business teams can run reliable, fast queries. Assume web and mobile clients emit events at scale (millions/day). Latency target for analytics is minutes, not milliseconds.
Describe the schema, event model, and data flow for an analytical database that can efficiently answer the above and similar pre-written queries.
Login required