Summary
Introduce a chunking strategy optimized for technical documentation with code blocks and API references.
Problem
Generic character-based splitting may break code context, signatures, and semantic boundaries, reducing retrieval quality.
Expected Solution
Build a custom chunker that is structure-aware (headings, code fences, lists, tables, API sections), preserves useful context windows, and improves retrieval precision.
Scope
backend/chatWorker.js
backend/utils/ragUtilities.js
- Chunk metadata fields and retrieval payload structure
Acceptance Criteria
Summary
Introduce a chunking strategy optimized for technical documentation with code blocks and API references.
Problem
Generic character-based splitting may break code context, signatures, and semantic boundaries, reducing retrieval quality.
Expected Solution
Build a custom chunker that is structure-aware (headings, code fences, lists, tables, API sections), preserves useful context windows, and improves retrieval precision.
Scope
backend/chatWorker.jsbackend/utils/ragUtilities.jsAcceptance Criteria