This question evaluates a candidate's ability to design scalable LLM-powered search systems, including retrieval, indexing, long-context management, and integration of LLMs with document storage, and is categorized under ML system design.
You are asked to design an LLM-powered search system that lets users query a large corpus of documents (e.g., internal wikis, PDFs, logs, and web pages) and receive natural-language answers.
A key challenge is that both documents and user queries can be very long, often exceeding the context window (maximum token length) of the underlying large language model (LLM). For example, a user might paste multiple pages of logs or a long contract as part of their query.
Design the system with a focus on:
Describe your design in detail:
Login required