This question evaluates expertise in designing and training Retrieval-Augmented Generation (RAG) systems, including retriever, evaluator (reranker/verifier/filter), and generator components, with emphasis on model architecture choices, training objectives, data preparation under privacy and document-permission constraints, and evaluation strategies for grounded answers with citations. It is commonly asked to probe advanced ML system design and operationalization skills for mitigating hallucination, stale or conflicting sources, and long-document retrieval; the category is ML System Design and the level is practical application-focused with detailed modeling and training considerations.
Design an enterprise GPT-style assistant that allows employees to ask questions about internal company documents (policies, wikis, specs, tickets, PDFs, etc.). The core approach is Retrieval-Augmented Generation (RAG).
The interviewer is primarily focused on machine learning choices and training rather than generic infrastructure.
Assume the system must respect document-level permissions, and responses should be grounded in retrieved sources with citations.
Login required