This question evaluates a candidate's competency in online machine learning and recommender system design, covering contextual bandits, feature engineering for users/items/context, real-time feedback processing, exploration–exploitation strategies, reward definition, offline and online evaluation, and scalable, robust architecture.
You are designing an online learning recommendation system. At each user interaction:
Provide a design that covers:
State any minimal assumptions you need (e.g., feedback semantics, latency constraints), and make your design robust to non-stationarity and scale.
Login required