Skip to content

docs: TDD — AI inference KV & Rust-native RAG vectors (research)#189

Merged
stevei101 merged 1 commit into
developfrom
docs/tdd-inference-kv-rag-vectors
Jun 4, 2026
Merged

docs: TDD — AI inference KV & Rust-native RAG vectors (research)#189
stevei101 merged 1 commit into
developfrom
docs/tdd-inference-kv-rag-vectors

Conversation

@stevei101
Copy link
Copy Markdown
Contributor

Summary

Adds the research-group technical design document for storage layers used in AI inference and RAG.

Doc: docs/TDD-ai-inference-storage-and-rag-vectors.md

Tracks #188.

Contents

  • KV recommendations (Valkey default, Dragonfly at scale)
  • Rust-native RAG: fastembed/candle/ort + Qdrant/LanceDB/HNSW
  • Trait seams (EmbeddingModel, VectorStore)
  • Alignment with existing oxidizedRAG features and oxidizedgraph OPEN #18
  • ADRs, roadmap, open questions for research review

Test plan

Made with Cursor

Research-group design doc covering Valkey/Dragonfly for inference caches
and fastembed/candle/Qdrant/LanceDB/HNSW options for oxidizedRAG.

Tracking: #188
Co-authored-by: Cursor <cursoragent@cursor.com>
@stevei101 stevei101 merged commit 12821b1 into develop Jun 4, 2026
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants