This question evaluates a candidate's competency in designing production-grade vector semantic search systems, including embedding model selection and training, ANN indexing and sharding, hybrid keyword-plus-semantic retrieval and reranking, caching and latency optimization, multi-tenant isolation, and freshness-aware update pipelines.
You own the vector semantic search layer for an AI assistant (e.g., Copilot). Users query across enterprise documents and/or product knowledge. Current issues:
Design a plan to optimize the end-to-end vector search system.