This question evaluates system design competency in distributed systems, focusing on scalability, multi-tenant isolation, rate-limiting algorithms, consistency and failover trade-offs, and operational concerns such as observability and safe degradation.
Design a multi-tenant, large-scale distributed Rate Limiter service used by many internal teams.
The service should allow product teams to enforce request limits (e.g., per user, per API key, per IP, per tenant) at very high QPS.
Explain a concrete architecture (components + data model + API), and walk through: