Skip to content

feat(vscode): add VS Code extension with chat, completions, and agent tools#975

Open
ruthlessproductions wants to merge 1 commit intojundot:mainfrom
ruthlessproductions:claude/naughty-goldberg-88af10
Open

feat(vscode): add VS Code extension with chat, completions, and agent tools#975
ruthlessproductions wants to merge 1 commit intojundot:mainfrom
ruthlessproductions:claude/naughty-goldberg-88af10

Conversation

@ruthlessproductions
Copy link
Copy Markdown

Admittedly this is a big feature add that was vibe coded and probably has a number of issues. Mostly did it for my own benefit and thought I'd share in case it's useful.

Adds a full VS Code extension for oMLX with:

  • Inline code completions (Copilot-style, debounced)
  • Streaming chat sidebar with conversation history
  • Agentic tool loop: execute_code, run_command, read/write files, list_files, open_preview, start_server
  • Model quick-pick switcher showing loaded/unloaded state
  • Status bar with connection health and tokens-per-second
  • Settings panel (server URL, API key, model parameters)
  • Auto-selects first loaded model on startup
  • Apache 2.0 SPDX headers on all source files

Installation:
cd extensions/vscode
npm install
npm run build

Press F5 in VS Code to launch Extension Development Host

Or install the pre-built VSIX directly:

Extensions: Install from VSIX… → select extensions/vscode/vscode-omlx-0.1.0.vsix

Test results
3,723 tests pass. 8 pre-existing failures unrelated to this PR:

5× test_grammar.py — optional xgrammar dependency not installed
2× test_admin_* — preserve_thinking/turboquant_skip_last fields missing from test fixtures
1× test_accuracy_benchmark — benchmark count mismatch (16 vs 12)
Notes
All source files carry SPDX-License-Identifier: Apache-2.0 headers
Extension communicates with oMLX via the existing /v1/ and /admin/api/ endpoints — no server changes required
Bundled with esbuild (CommonJS, external: ['vscode']), no runtime npm dependencies

vscode-omlx-0.1.0.vsix.zip

… tools

Adds a full VS Code extension for oMLX with:
- Inline code completions (Copilot-style, debounced)
- Streaming chat sidebar with conversation history
- Agentic tool loop: execute_code, run_command, read/write files,
  list_files, open_preview, start_server
- Model quick-pick switcher showing loaded/unloaded state
- Status bar with connection health and tokens-per-second
- Settings panel (server URL, API key, model parameters)
- Auto-selects first loaded model on startup
- Apache 2.0 SPDX headers on all source files

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant