This question evaluates a candidate's competency in ML system design and distributed systems, covering model lifecycle management, versioning, integrity verification, efficient rollout, local caching, security, observability, and fault recovery.
Design a system that distributes machine learning model artifacts from centralized storage to a large fleet of inference servers.
The system should support:
Describe the main components, host-side behavior, APIs, and scaling strategy.