Design a low-latency RAG system | OpenAI Interview Question