This question evaluates system design and distributed systems competency with an emphasis on caching strategies for DAG-based computed views, testing understanding of cache placement, invalidation, consistency models, versioning, and cost-aware eviction.
You operate a cloud query engine where logical views are defined over base tables and/or other views. These definitions form a Directed Acyclic Graph (DAG): leaves are base tables; internal nodes are computed views (e.g., projections, filters, joins, aggregations). Queries typically target one or more root views.
Assume:
Design a caching strategy to reduce query latency. Address the following:
Discuss design trade-offs and include mechanisms such as versioning keys, triggers, and cost models.
Login required