This question evaluates a candidate's competency in designing production-grade ML systems, covering deployment architecture, online and batch inference, model and feature versioning, MLOps/CI-CD, monitoring and observability, rollout and safety strategies, and data/feedback instrumentation.
Assume you are designing a user-facing AI-powered feature for a web/mobile product. Some decisions must return predictions in real time (online inference), while others can be scheduled periodically (batch inference). Propose a production-grade design that addresses reliability, safety, privacy, and continuous improvement.
Be explicit about trade-offs and provide practical guardrails and fallbacks.
Login required