This question evaluates a candidate's competence in designing large-scale analytics backends, covering data ingestion, event and metric schemas, near-real-time and historical analytics, metric computation and slicing, data freshness and deduplication, privacy and access control, operational monitoring, and the API/query layer.
Design the backend of an internal analytics system for a conversational AI product similar to ChatGPT. The goal is to power an analytical metrics dashboard used by product managers and backend engineers.
The dashboard itself and the LLM serving stack are out of scope. Focus on the data platform and backend services that collect data from existing production databases and service logs, compute product and reliability metrics, and serve those metrics to the dashboard.
Your design should address:
Assume the system must support large-scale traffic and multi-month historical retention.