Skip to content

Add BM25 retrieval and RRF fusion#31

Open
jorgenosberg wants to merge 20 commits into
mainfrom
feat/bm25-rrf
Open

Add BM25 retrieval and RRF fusion#31
jorgenosberg wants to merge 20 commits into
mainfrom
feat/bm25-rrf

Conversation

@jorgenosberg

Copy link
Copy Markdown
Owner

Adds BM25 lexical retrieval and Reciprocal Rank Fusion alongside the existing dense Chroma path. A new SQLite/FTS5 store at .docstra/index.db indexes chunk content and symbol names; FusionRetriever runs both sources and the dense path in parallel and fuses with RRF (k=60), replacing the regex-based HybridRetriever at every call site. Knobs live under user_config.retrieval (rrf_k, fts_chunks_top_k, fts_symbols_top_k). Closes #19.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add BM25 retrieval and RRF fusion

1 participant