This question evaluates a candidate's competency in ML system design for recommendation systems, including choices of recommendation models, handling implicit feedback and cold-start, low-latency serving architecture, and storage/schema design for users, items, and interactions.
You are asked to design an end-to-end recommendation service that suggests items to users. The service should include choices for the recommendation model, the serving architecture with multiple application servers behind a reverse proxy, and a storage schema for users, items, and interactions.
Assume: millions of users and items, primarily implicit feedback (views, clicks, purchases), and a p95 online latency target under 150 ms for the recommendation endpoint.
Login required