This question evaluates a candidate's ability to design a low-latency, high-QPS, multi-tenant frequency capping service, testing competencies in identity resolution, hierarchical cap modeling, window semantics, counter and storage architecture, consistency versus latency trade-offs, real-time decisioning APIs, configuration/versioning, backfill, reporting, privacy/compliance, and failure handling. It is commonly asked to assess systems-design proficiency in distributed systems, databases, caching, API design, and operational reliability; it belongs to the System Design domain and emphasizes practical application with system-level conceptual reasoning rather than purely algorithmic detail.
You are designing a service that ensures a user does not see the same ad creative or campaign more than a configured number of times within specified time windows (for example, 3/day per campaign, 10/week across an advertiser). The service must support real-time ad decisioning at high QPS with low latency, multi-tenant advertiser onboarding, and reliable reporting.
Assume an ad-supported video application with: high read QPS during ad selection, sub-10 ms p99 decision latency, multi-region deployment, and strict privacy/compliance requirements. The system integrates with an ad server/selector that provides candidate ads and expects an allow/deny decision before rendering.
Design the frequency capping service and compare it to a generic rate limiter. Explain the extra components required for advertiser configuration and onboarding.
Cover the following topics:
Also include a brief comparison to a generic rate limiter and highlight the additional components needed for advertiser onboarding and configuration.
Login required