@@ -33,10 +33,15 @@ Completed since the prior snapshot:
33336 . Deterministic dummy SHA-256 embedder exists for pre-model hotpath testing (` embeddings/DeterministicDummyEmbeddingBackend.ts ` ).
34347 . Benchmark harness exists for dummy embedder throughput baselining (` npm run benchmark:dummy ` ).
35358 . Baseline adaptive provider selection exists with capability filtering + benchmark-based winner choice (` embeddings/ProviderResolver.ts ` , ` embeddings/EmbeddingRunner.ts ` ).
36+ 9 . External capability verification completed for real-provider planning:
37+ - Transformers.js is ONNX Runtime-backed and directly exposes `webnn`, `webgpu`, and `wasm` device paths.
38+ - Transformers.js does not currently expose `webgl` as a direct device; `webgl` should remain an explicit ORT adapter path.
3639
3740Next focus:
38411 . Wire resolved model profiles into runtime ingest/query entry points.
39- 2 . Add real embedding providers (ONNX/Transformers/WebNN/WebGPU/WebGL/WASM) to resolver candidate sets.
42+ 2 . Add real embedding providers to resolver candidate sets, split by runtime family:
43+ - Transformers.js provider (`webnn/webgpu/wasm`)
44+ - Explicit ORT WebGL provider (`webgl`)
40453 . Add browser/electron runtime test lanes to match merge-gate policy.
4146
4247## 1. Design
@@ -150,18 +155,23 @@ Performance budget targets for v1:
150155
151156Graceful degradation:
1521571 . ` webgpu ` preferred.
153- 2 . ` webgl ` fallback .
154- 3 . ` webnn ` optional path for matmul-friendly ops .
158+ 2 . ` webnn ` optional path for matmul-friendly ops .
159+ 3 . ` webgl ` fallback via explicit ORT adapter path .
1551604 . ` wasm ` guaranteed baseline.
156161
162+ Implementation note (verified 2026-03-11):
163+ 1 . Transformers.js path currently maps to ` webnn/webgpu/wasm ` (no direct ` webgl ` device key).
164+ 2 . Keep ` webgl ` in architecture through the explicit ORT adapter backend.
165+
157166### 1.9 Current gap analysis from repo snapshot
158167Observed blockers in current PoC files:
159- 1 . Embedding runtime modules (provider resolver + runner) are still missing.
160- 2 . Ingest/query orchestrators are not yet wired to resolved ` ModelProfile ` values.
161- 3 . Browser/Electron runtime test lanes are not yet implemented in scripts/CI.
162- 4 . Shader and backend files compile but are not yet integrated into a full vertical runtime path.
168+ 1 . Embedding runtime modules exist (` ProviderResolver ` + ` EmbeddingRunner ` ), but only baseline/dummy-provider flow is wired.
169+ 2 . Real provider adapters are not yet wired (Transformers.js for ` webnn/webgpu/wasm ` ; explicit ORT adapter for ` webgl ` ).
170+ 3 . Ingest/query orchestrators are not yet wired to resolved ` ModelProfile ` values.
171+ 4 . Browser/Electron runtime test lanes are not yet implemented in scripts/CI.
172+ 5 . Shader and backend files compile but are not yet integrated into a full vertical runtime path.
163173
164- These are Phase 0 blockers and should be fixed before feature work .
174+ These are the remaining vertical-slice blockers before broader feature expansion .
165175
166176## 2. Implementation Plan
167177
@@ -318,7 +328,7 @@ Priority legend:
3183286 . Add corruption recovery tooling for vector store and metadata store.
3193297 . Add schema migration tests across multiple versions.
3203308 . Add large-corpus stress tests for memory and latency.
321- 9 . Add adaptive runtime policy based on backend capability .
331+ 9 . Extend adaptive runtime policy with real providers and runtime telemetry persistence .
32233210 . Add resource governance controls for Daydreamer CPU budget.
32333311 . Improve ranking quality with optional rerank stage.
32433412 . Add developer docs with architecture diagrams and troubleshooting.
0 commit comments