Definition
The process of determining the optimal size for breaking down large documents into smaller segments (chunks) for effective retrieval in RAG systems.
Why it matters (in Poovi’s context)
Affects the balance between providing sufficient context for the LLM and avoiding the ‘lost in the middle’ problem within limited context windows.
Key properties or components
- Context Window Limits
- Information Loss
- Semantic Cohesion
- Experimental Tuning
Contradictions or debates
None.