You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Historical design documentation for The OASIS Project. These documents capture architectural decisions, completed feature designs, and implementation records that shaped the project. They are preserved here for reference after being retired from active repositories.
DAWN Archive
Design documents from the DAWN voice assistant, organized by subsystem.
SmartThings OAuth integration (blocked at AWS WAF)
Memory Subsystem
Memory has its own subdirectory at dawn/memory/ — see the memory README for an annotated index. Entries below are the canonical pointers from the top-level Atlas index.
Living snapshot. Current state of the memory subsystem — recently shipped, benchmark position, short/medium/long-term workstreams in priority order. Read first when starting memory-focused work.
bge-small-en-v1.5-int8 model swap + tech-debt cache-invalidation fix + ID-based extraction filter + per-user embedding recompute worker (schema v41). Lifted LoCoMo overall +7.9pp, LongMemEval R@5 +1.4pp. Cross-encoder reranker was Feature 2 of the original plan — investigated and reverted (see RERANKER_INVESTIGATION).
Cross-encoder reranker investigation: implemented (ms-marco-MiniLM-L-6-v2 int8 ONNX with CUDA EP, integration across memory + RAG paths, 5 config keys) then reverted after empirical results and literature review showed no net benefit on conversational data and only marginal lift on LongMemEval at 10× latency. Kept artifacts: shared WordPiece tokenizer (memory_embed_tokenizer), rerank_shootout.py test harness
LoCoMo cat-3 failure-mode profiling, session-neighbor boost (Tier 2 quick win, +3.0pp dialog overall / +20.0pp cat-3), and memory-pipeline bench mode (Tier 1, Phase 0/1/1.5): end-to-end LoCoMo evaluation against extracted memory at production parity, recall_reach metric, Haiku 4.5 result of 0.742 / 0.646 cat-3 (+9.3pp / +20pp over dialog baseline). Identifies retrieval vs answer-support framing for closing the gap to leaders
Frozen reference of published numbers and methodology observations from competing memory-retrieval systems on LongMemEval, LoCoMo, and ConvoMem. No DAWN-side numbers — durable competitor research that doesn't rot with shipments. Includes the MemPalace methodology audit and the retrieval-vs-end-to-end-QA distinction.