This question evaluates understanding of chunking strategies for Retrieval-Augmented Generation systems, testing competencies in information retrieval, embedding and indexing trade-offs, document-structure-aware segmentation, and semantic chunking within the ML System Design and NLP domains.
You are building a Retrieval-Augmented Generation (RAG) system that uses an LLM plus a vector database. Before creating embeddings and indexing documents, you must split long documents into chunks.
Describe how you would design the chunking strategy. In your answer, discuss:
Login required