Control your entire computer with natural language, voice, and hand gestures.
An open-source, privacy-first AI agent that plans, executes, and verifies complex multi-step tasks.
π Visit the Official Website (helioxos.dev) π
Quick Start β’ JARVIS Mode β’ Features β’ Architecture β’ Security β’ Troubleshooting β’ Contributing
Unlike simple command runners, Heliox OS is a true agentic system inspired by robust autonomous architectures like OpenClaw, running a continuous ReAct loop with a modular multi-agent orchestrator:
- Gateway Hub & Memory β LLM evaluates persistent memory context before reasoning.
- Planner β Converts natural language into a structured multi-step action plan.
- Agent Orchestrator β Routes each action to the correct specialist agent (System, Code, Web, Monitor, Communication).
- Specialist Agents β Five domain experts execute actions via native OS APIs (never flimsy GUI automation).
- Verifier β Post-execution verification confirms the action succeeded and feeds results back into the loop.
- Reflector β Self-improvement engine learns from successes and failures.
- Security β Five-tier permission system with confirmation gates and rollback support.
Heliox OS has achieved true proactive autonomy, transitioning from a reactive assistant to an invisible, always-on background intelligence system natively integrated into your OS:
- π§ Proactive Suggestion Engine: Learns your daily workflows and pattern-matches your screen context to silently surface UI action suggestions (e.g., offering to summarize a long thread or launch an IDE when browsing issues) before you ask.
- β‘ Fire-and-Forget Autonomous Jobs: Spawn complex multi-step background tasks that decompose, execute, and verify completely independent of the UI or main event loop.
- ποΈ Always-On Screen Awareness: Automatically bootstrapped computer vision that tracks your contextual state cross-platform, natively bridging exactly what you see into the LLM planner.
- π€ Continuous Voice Listener: Real-time push-free 'Hey Heliox' ambient wake-word dispatch for frictionless task execution.
- π€ 30+ Hand Gestures & Air Drawing: Control your PC via webcam with static poses (Palm, Pinch) and motion gestures (Two-Finger Swipe).
- π Arc Reactor UI & Ambient HUD: Animated, immersive Tauri overlays responding contextually to system actions.
Heliox OS is now fully neuro-adaptive, integrating Meta's TRIBE v2 Cognitive Engine directly into the operating logic:
- Neural Cognitive HUD: Tracks real-time Saliency and Brain Load via screen vision buffer, mapped onto the Svelte UI.
- Dynamic TTS Stress-Pacing: JARVIS automatically slows down voice generation if you are engaged in high cognitive-load tasks.
- Neuro-Safe Destructive Gate: High-risk actions (e.g., recursive deletes) evaluate cognitive stress first. If you are distracted, a strict 10-second auditory confirmation gate holds the process.
- Subconscious Persona Fingerprint: The Subconscious background loop learns your neural visual plasticity constraints and encodes them to
persona.mdto align UIs to your brain. - Attention-Optimized Notifications: The notification pipeline dynamically buffers trivial background alerts until the system detects a "cortical transition" (a low-load resting state).
- ReAct Neural Cost Estimator: Tasks predict aggregate cognitive demand proactively logic. If executing a plan risks exceeding mental bandwidth, JARVIS pauses.
- JARVIS Intent Classifier: Fully integrated native intent fusion classifying spoken commands against current workload intensity.
Heliox OS now pushes the boundaries with 7 revolutionary biologically-inspired AI features powered by Meta's TRIBE v2 neural model:
Track user's physiological patterns over weeks (time-of-day productivity, stress cycles). Create personalized "cognitive fingerprints" that predict optimal interaction times. Implements a closed-loop feedback system where user responses refine the TRIBE predictions.
from pilot.cognitive.biometric_loop import BiometricLearningLoop
loop = BiometricLearningLoop(user_id="user")
loop.record_cognitive_sample(attention=0.7, stress=0.3, load=0.5)
recommendation = loop.get_interaction_recommendation()
print(recommendation.recommended, recommendation.interaction_type)Instead of reactive commands, Heliox proactively suggests actions based on predicted cognitive state. Example: "You've been on this task for 2 hours with increasing stress β want me to schedule a break?"
from pilot.cognitive.ambient_intelligence import AmbientIntelligenceEngine
ambient = AmbientIntelligenceEngine(biometric_loop)
await ambient.update_cognitive_state(attention=0.8, stress=0.6, load=0.7)
# Automatically generates proactive suggestions based on trendsExtend beyond screen vision β integrate webcam eye-tracking, audio tone analysis, keyboard/mouse dynamics. Build a unified "neural workspace" that predicts what the user will need before they ask.
from pilot.cognitive.neural_bridge import NeuralBridge
bridge = NeuralBridge()
bridge.record_keystroke()
bridge.record_mouse_move(x, y)
workspace = bridge.compute_workspace()
print(workspace.cognitive_state, workspace.predicted_need)When load > 80%, automatically surface "memory anchors" β key info from recent actions. Let Heliox absorb cognitive burden by remembering complex multi-step workflows.
from pilot.cognitive.cognitive_offload import CognitiveOffloader
offloader = CognitiveOffloader()
offloader.update_load(0.85) # Triggers offload surface
surface = offloader.get_offload_surface()
print(surface["anchors"], surface["workflows"])Move from static persona.md to a living "neural avatar" that changes daily based on cognitive patterns. The AI's communication style adapts: concise when stressed, detailed when relaxed.
from pilot.cognitive.evolving_persona import EvolvingPersonaEngine
persona = EvolvingPersonaEngine(user_id="user")
persona.record_interaction(attention=0.7, stress=0.3, load=0.5)
greeting = persona.get_greeting() # "Good afternoon! Ready to tackle anything?"
print(persona.get_ui_config()) # Adapts UI based on stateIf TRIBE detects high load on desktop, suggest continuing on mobile with context transfer. Build a "cognitive state cloud" that follows the user across devices.
from pilot.cognitive.cognitive_handoff import CognitiveHandoffEngine
handoff = CognitiveHandoffEngine(device_name="desktop")
handoff.capture_snapshot(attention=0.8, stress=0.6, load=0.7)
handoff.sync_to_cloud()
suggestion = handoff.get_handoff_suggestion(load=0.85, stress=0.3)Design the cognitive pipeline to be model-agnostic β swap TRIBE for future neural models. Create standard cognitive APIs that other developers can build on.
from pilot.cognitive.quantum_cognitive import create_pipeline, QuantumCognitivePipeline
pipeline = create_pipeline()
output = await pipeline.predict("user working on complex task")
print(output.attention_score, output.stress_level, output.cognitive_load)
# Switch models at runtime
pipeline.set_active_model("gpt_neuro") # Future model supportAll features are wrapped in a single interface:
from pilot.cognitive.hub import CognitiveHub
hub = CognitiveHub()
state = await hub.analyze("user is coding")
# All features accessible
print(f"Attention: {state.attention}, Stress: {state.stress}")
print(f"Optimal: {state.optimal_interaction}")
print(f"Overloaded: {state.is_overloaded}")
print(hub.get_greeting()) # Adaptive greeting
print(hub.get_offload_surface()) # Memory anchorsThe QuantumCognitivePipeline automatically uses TRIBE v2 when available:
Available Models:
- Meta TRIBE v2 (tribe_v2)
Available: True
Capabilities: ATTENTION_PREDICTION, STRESS_DETECTION, LOAD_ESTIMATION
Avg Latency: ~50ms
When TRIBE is unavailable, it gracefully falls back to heuristic models.
Heliox OS uses a modular multi-agent architecture where specialized agents collaborate to solve complex tasks:
| Agent | Domain | Key Skills |
|---|---|---|
| π₯οΈ System Agent | OS Operations | Files, processes, services, power, input control, screen vision, triggers |
| π» Code Agent | Development | Code generation, execution, debugging, dev tooling (git, pip, npm) |
| π Web Agent | Web & APIs | Browser automation, scraping, HTTP requests, downloads |
| π Monitor Agent | Monitoring | CPU, RAM, disk, network monitoring with threshold alerts |
| π‘ Communication Agent | Messaging | Email, Slack, Discord, webhooks, desktop notifications |
| π Calendar Agent | Scheduling | .ics parsing, CalDAV sync, event management |
How it works: The Planner generates an action plan β the Orchestrator analyzes each action type β routes to the correct specialist β agents execute in sequence β results merge for verification.
Dynamic Spawning: Agents can be created on-demand at runtime via the agent_spawn API endpoint.
| Task | Type | Status |
|---|---|---|
| Web scrape Wikipedia + word frequency analysis | Web + Code | β |
| Background CPU trigger with voice alert | System Monitor | β |
| Screenshot OCR + text reversal + file tree | Vision + Code | β |
| Multi-page web comparison (Python vs JS) | Web + Analysis | β |
| Create project scaffold + run unit tests | File + Code | β |
| REST API fetch + JSON parse + formatted table | API + Code | β |
| CSV data pipeline + financial analysis | Data + Code | β |
| And more... | β |
| Platform | Status |
|---|---|
| Windows 10/11 | β Full support |
| Ubuntu / Debian | β Full support |
| macOS | β Full support |
| Fedora / Arch | β Via dnf/pacman |
file_read Β· file_write Β· file_delete Β· file_move Β· file_copy Β· file_list Β· file_search Β· directory_summary Β· file_permissions
process_list Β· process_kill Β· process_info
shell_command Β· shell_script (multi-line bash/powershell/python)
code_execute β Run Python, PowerShell, Bash, or JavaScript with auto-fix on failure
browser_navigate Β· browser_extract Β· browser_extract_table Β· browser_extract_links
screenshot Β· screen_ocr Β· screen_analyze
package_install Β· package_remove Β· package_update Β· package_search
Auto-detects: winget, choco, brew, apt, dnf, pacman
system_info Β· cpu_usage Β· memory_usage Β· disk_usage Β· network_info Β· battery_info
window_list Β· window_focus Β· window_close Β· window_minimize Β· window_maximize
volume_get Β· volume_set Β· volume_mute
brightness_get Β· brightness_set Β· screenshot
power_shutdown Β· power_restart Β· power_sleep Β· power_lock Β· power_logout
wifi_list Β· wifi_connect Β· wifi_disconnect
clipboard_read Β· clipboard_write
schedule_create Β· schedule_list Β· schedule_delete Β· trigger_create
env_get Β· env_set Β· env_list
download_file
service_start Β· service_stop Β· service_restart Β· service_enable Β· service_disable Β· service_status
gnome_setting_read Β· gnome_setting_write Β· dbus_call
registry_read Β· registry_write
calendar_parse Β· calendar_sync Β· calendar_create_event Β· calendar_list_events Β· calendar_delete_event
open_url Β· open_application Β· notify
graph TD
User(["User Input: Voice, Text, Gestures"]) --> Gateway
subgraph "Frontend Gateway - Tauri + Svelte"
Gateway["WebSocket Hub"]
GUI["Desktop Window"]
HUD["Ambient System HUD"]
VC["Voice Controller"]
GC["Hand Gesture Controller"]
RPV["ReAct Pipeline Visualizer"]
TVS["Thought Visualization π§ "]
Gateway --- GUI
GUI --- HUD
GUI --- VC
GUI --- GC
GUI --- RPV
RPV --- TVS
end
Gateway --> Fusion
subgraph "Multimodal Fusion"
Fusion["Intent Fusion Engine"]
Fusion --> |"voice + gesture"| FusedIntent["Fused Intent"]
end
FusedIntent --> Daemon
subgraph "Agent Runtime - Python"
Daemon["Agent Server / ReAct Loop"] --> Memory[("Long-term Memory")]
Memory --> |"Vector + Semantic"| ChromaDB[("ChromaDB")]
Daemon --> Decomposer["Task Decomposer"]
Decomposer --> Planner["LLM Planner"]
Planner --> PromptImprover["Prompt Improver"]
PromptImprover --> |"reuse strategies"| Planner
Planner --> Router{"Model Router"}
Router --> Ext_LLM("Gemini / OpenAI / Claude")
Router --> Int_LLM("Ollama")
Planner --> Sandbox["Simulation Sandbox"]
Sandbox --> |"risk report"| Security["Security Gate"]
Security --> Orchestrator["Agent Orchestrator"]
end
subgraph "Multi-Agent System"
Orchestrator --> SA["System Agent"]
Orchestrator --> CA["Code Agent"]
Orchestrator --> WA["Web Agent"]
Orchestrator --> MA["Monitor Agent"]
Orchestrator --> COMM["Communication Agent"]
end
subgraph "Plugin Ecosystem"
PluginReg["Plugin Registry"]
PluginReg -->|"tools"| Orchestrator
P1["developer-tools"]
P2["media-control"]
P3["home-assistant"]
P1 --- PluginReg
P2 --- PluginReg
P3 --- PluginReg
end
SA --> Executor["System Executor"]
CA --> Executor
WA --> Executor
COMM --> Executor
MA --> BG["Background Tasks"]
Executor --> Verifier["Verifier"]
BG --> Verifier
Verifier --> Reflector["Reflector"]
Reflector --> PromptImprover
Reflector --> Memory
Reflector --> SkillReg["Skill Registry"]
subgraph "Subconscious Layer"
SubAgent["Subconscious Agent"]
SubAgent -->|"persona rules"| Planner
Reflector --> SubAgent
SubAgent --> PersonaFile["~/.heliox/persona.md"]
end
subgraph "Screen Awareness"
ScreenVision["Screen Vision Agent"]
ScreenVision -->|"context"| Planner
ScreenVision --> AppDetect["Active App Detector"]
ScreenVision --> DiffEngine["Screenshot Diff"]
end
subgraph "Reasoning Telemetry"
Daemon -.-> |"events"| ReasoningEmitter["Reasoning Emitter"]
ReasoningEmitter -.-> |"WebSocket"| TVS
end
Heliox OS implements 13 research-level features that push beyond typical AI agents:
| # | Feature | Status | Module |
|---|---|---|---|
| 1 | Persistent Long-Term Memory (Vector + Semantic) | β | memory/store.py + ChromaDB |
| 2 | Self-Reflection Loop | β | agents/reflector.py |
| 3 | Tool Discovery / Skill Registry | β | agents/reflector.py (skill_registry table) |
| 4 | Task Decomposition Engine | β | agents/decomposer.py |
| 5 | Autonomous Background Agents | β | agents/background.py + monitor_agent.py |
| 6 | Multi-Agent Collaboration | β | agents/orchestrator.py (5 specialists) |
| 7 | Real-Time Reasoning Visualization | β | reasoning/events.py + ReActPipeline.svelte |
| 8 | Simulation Sandbox | β | agents/sandbox.py |
| 9 | Self-Improving Prompt System | β | agents/prompt_improver.py |
| 10 | Plugin Ecosystem | β | plugins/__init__.py |
| 11 | Flagship Plugins (Developer, Media, IoT) | β | plugins/developer/, plugins/media/, plugins/homeassistant/ |
| 12 | Subconscious Agent (Persona Learning) | β | agents/subconscious.py |
| 13 | Screen Vision (Continuous Screen Awareness) | β | agents/screen_vision.py |
Complex goals are automatically broken into dependency-aware subtask trees:
User: "Build a Flask API for todo list"
β 1. [system] Create project folder
β 2. [system] Install Flask (depends: 1)
β 3. [code] Generate API code (depends: 1)
β 4. [code] Create requirements.txt (depends: 2)
β 5. [code] Run tests (depends: 3, 4)
Before executing dangerous commands, the sandbox produces an impact report:
β οΈ Simulation Report:
Risk: HIGH
Impact: 154 files affected (wildcard)
Warnings:
- β οΈ Plan contains destructive actions
- π Plan requires elevated privileges
- β»οΈ 2 action(s) are NOT reversible
Recommendation: β οΈ HIGH RISK β Confirm impact
Successful reasoning chains are stored and reused:
- Keyword-indexed prompt templates with success/failure rates
- Automatic strategy matching for similar future tasks
- Rolling improvement β the agent gets better over time
Heliox OS ships with 3 flagship plugins and supports community-built extensions:
| Plugin | Type | Capabilities |
|---|---|---|
| developer-tools | Code | Jira tickets, git clone, branch, commit, push, GitHub PRs |
| media-control | System | Spotify (play/pause/skip), system volume, YouTube, media keys |
| home-assistant | IoT | Smart lights, switches, thermostats, scenes, device discovery |
Drop custom plugins into ~/.heliox/plugins/ β they're auto-discovered at startup
after Ed25519 signature verification. Production registries can verify against
bundled trusted public keys; local plugin packages can include
plugin.ed25519.pub with plugin.ed25519.sig. Unsigned, untrusted, or tampered
plugins are rejected before their manifest or code is loaded.
{
"name": "my-plugin",
"version": "1.0.0",
"tools": [{"name": "my_tool", "inputs": ["arg1"], "action_type": "api_call"}]
}A background agent that runs every 30 minutes to review the day's actions and learn user preferences:
- Clusters behavioral patterns ("always writes Python", "prefers dark mode")
- Extracts actionable rules with confidence scores
- Writes a
~/.heliox/persona.mdthat is injected into planner context - Supports manual preference setting via
persona_add_preferenceAPI - Categories:
preference,habit,constraint,style
Continuous computer-vision loop that gives the agent awareness of what the user sees:
- Takes screenshots every 2 seconds, hashes for change detection
- Detects the active application and window title cross-platform
- Maintains a rolling context buffer of recent screen states
- When user says "summarize this" or "close that", the planner already knows the target
- Optional LLM-powered screen description for advanced awareness
The easiest way to get started is to download the pre-compiled installer for your operating system.
- Go to the GitHub Releases page.
- Download the installer for your OS:
- Windows:
Heliox OS_x64-setup.exe - macOS (Apple Silicon):
Heliox OS_aarch64.dmg - macOS (Intel):
Heliox OS_x86_64.dmg - Linux:
.AppImageor.deb
- Windows:
- Install the app.
- Open Heliox OS and enter your API Key (e.g., Gemini, OpenAI, Claude) in the Settings tab.
Note: The Python backend requires Python 3.11+ installed on your system. You must start the local daemon manually for now.
Error Messages: Root Cause: PyTorch requires CUDA runtime DLLs in system PATH
Fix:
-
Install Visual C++ Redistributable:
- Download: https://support.microsoft.com/en-us/help/2977003
- Install the x64 version
-
Reinstall PyTorch matching your CUDA version:
# For CUDA 12.1 (most common)
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
# For CUDA 11.8
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
# For CPU only
pip install torch torchvision torchaudio- Verify installation:
python -c "import torch; print(torch.cuda.is_available())"Error Messages: CUDA runtime version mismatch CUDA version is insufficient for this model RuntimeError: CUDA out of memory
Root Cause: PyTorch CUDA version doesn't match installed NVIDIA CUDA version
Fix:
- Check your CUDA version:
nvidia-smi- Check PyTorch CUDA version:
python -c "import torch; print(torch.version.cuda)"- Match your CUDA version using this table:
| NVIDIA CUDA | Installation Command |
|---|---|
| 12.4+ | pip install torch --index-url https://download.pytorch.org/whl/cu124 |
| 12.1 | pip install torch --index-url https://download.pytorch.org/whl/cu121 |
| 11.8 | pip install torch --index-url https://download.pytorch.org/whl/cu118 |
| CPU Only | pip install torch |
- Reinstall PyTorch:
pip uninstall torch torchvision torchaudio -y
pip cache purge
pip install torch --index-url https://download.pytorch.org/whl/cu121Requirements:
- Python 3.11+
- CUDA 11.8 or 12.1
- PyTorch installed first
- Visual C++ 2019 or newer
Error Messages: TRIBE v2 requires CUDA version 11.8 or higher ImportError: cannot import name 'TribeModel' from 'tribev2'
Installation Steps:
- Install PyTorch first:
pip install torch --index-url https://download.pytorch.org/whl/cu121- Clone and install tribev2 from facebookresearch:
git clone https://github.com/facebookresearch/tribev2.git
cd tribev2
pip install -e .- Verify installation:
python -c "from tribev2 import TribeModel; print('TRIBE v2 loaded successfully')"- Load a pretrained model:
from tribev2 import TribeModel
model = TribeModel.from_pretrained("facebook/tribev2", cache_folder="./cache")Error Messages: CUDA is not available Cannot find nvcc compiler
Fix:
-
Open Environment Variables:
- Press
Win + Xβ Click "System" - Click "Advanced system settings"
- Click "Environment Variables" button
- Press
-
Add CUDA to PATH:
- Find Path variable and click Edit
- Add these two paths:
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.1\binC:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.1\libnvvp
- (Replace v12.1 with your CUDA version)
-
Restart terminal and verify:
nvcc --versionError Message: RuntimeError: CUDA out of memory. Tried to allocate X.XX GiB
Solutions:
- Reduce batch size in your code:
batch_size = 32 # Instead of 64 or 128- Clear GPU memory:
python -c "import torch; torch.cuda.empty_cache()"- Check GPU memory:
python -c "import torch; print(torch.cuda.get_device_properties(0))"- Use CPU if needed:
device = "cpu" # Instead of "cuda"Error Message: error: Microsoft Visual C++ 14.0+ is required
Fix:
-
Download Visual Studio Build Tools:
- Go to: https://visualstudio.microsoft.com/downloads/
- Click "Build Tools for Visual Studio 2022"
-
Run the installer and select:
- β Desktop development with C++
- β Windows 11 SDK
-
Complete installation and restart your computer
Check your setup with this command:
python -c "import torch; print('PyTorch:', torch.__version__); print('CUDA Available:', torch.cuda.is_available()); print('CUDA Version:', torch.version.cuda if torch.cuda.is_available() else 'N/A')"Still having issues?
Try a clean reinstall:
pip cache purge
pip install --upgrade pip setuptools wheel
pip install torch --index-url https://download.pytorch.org/whl/cu121For more information, see PyTorch Installation Guide
Note: Windows contributors can use the automated
setup.ps1script for environment setup.
If PowerShell blocks the script, check the Windows setup instructions in CONTRIBUTING.md.
If you want to contribute or modify Heliox OS, build it from the source code:
1. Install the Python daemon:
git clone https://github.com/VyomKulshrestha/Heliox-OS.git
cd Heliox OS/daemon
pip install -e ".[full,dev]"2. Choose your LLM:
- Local (Ollama):
ollama pull llama3.1:8b->ollama serve - Cloud (Gemini/OpenAI/Claude): Add your API key in the app GUI.
3. Run the daemon:
cd daemon
python -m pilot.server4. Run the frontend:
cd tauri-app/ui
npm install
npm run dev"Show me my system info"
"Take a screenshot and read the text on screen"
"Go to Wikipedia's page on AI and summarize the first 3 paragraphs"
"Create a Python project with tests and run them"
"Kill the process using the most CPU"
"Monitor my CPU and alert me when it goes above 80%"
"Download a file and show me a tree of the folder"
"List all .py files on my Desktop"
"Set my volume to 50%"
"Create a CSV with sales data and analyze it"
"What's my IP address?"
"Install Firefox"
Warning
PLEASE READ BEFORE USE: SYSTEM COMPROMISE RISK Heliox OS is an autonomous agent with the ability to execute code, delete files, and run terminal commands directly on your host operating system. While we have provided sandbox measures, the AI has real system access. Do NOT run Heliox OS with root/Administrator privileges unless absolutely necessary. We are not responsible for accidental data loss caused by LLM hallucinations.
- All AI outputs pass through structured schema validation before execution
- Five-tier permission system (read-only through root-level)
- Confirmation required for system-modifying and destructive actions
- Snapshot-based rollback via Btrfs or Timeshift (Linux)
- Append-only audit log for all executed actions
- Command whitelist with optional unrestricted mode
- Encrypted API key storage via platform keyring (GNOME Keyring / Windows Credential Manager)
- API keys are NEVER logged, included in plans, or sent to local LLMs
| Tier | Level | Auto-Execute | Examples |
|---|---|---|---|
| 0 - Read Only | π’ | Yes | file_read, system_info, clipboard_read |
| 1 - User Write | π‘ | Yes | file_write, clipboard_write, env_set |
| 2 - System Modify | π | Needs Confirm | package_install, service_restart, wifi_connect |
| 3 - Destructive | π΄ | Needs Confirm | file_delete, process_kill, power_shutdown |
| 4 - Root Critical | β | Needs Confirm | root operations, disk operations |
Config file: ~/.config/pilot/config.toml
[model]
provider = "ollama" # "ollama" | "cloud"
ollama_model = "llama3.1:8b"
cloud_provider = "gemini" # "gemini" | "openai" | "claude"
[security]
root_enabled = false
confirm_tier2 = true
unrestricted_shell = false
snapshot_on_destructive = true
[server]
host = "127.0.0.1"
port = 8785
[proxy]
http = "http://proxy.example.com:8080"
https = "http://proxy.example.com:8080"
no_proxy = "localhost,127.0.0.1"A: First, verify you are using Python 3.11+. Check your version with python --version. If it still fails, check the logs at ~/.local/state/heliox-os/pilot.log for specific error messages. Ensure that port 8785 is not being used by another application.
A: Heliox OS stores API keys securely in your system keyring (GNOME Keyring/libsecret on Linux, Credential Manager on Windows) or an encrypted vault.enc file. It does not use .env files. Ensure you've added the key via the Settings tab in the UI. If the issue persists, check if libsecret-1-dev is installed (on Linux).
A: You can change your model provider in the ~/.config/heliox-os/config.toml file. Under the [model] section, set provider = "cloud" and specify your cloud_provider (e.g., "gemini", "openai", or "claude").
[model]
provider = "cloud"
cloud_provider = "gemini"A: Ensure that portaudio19-dev and python3-pyaudio are installed on your system. You may also need to grant your terminal or the Heliox app permission to access the microphone in your system settings.
A: Any standard USB or integrated webcam supported by your OS will work. Heliox OS uses OpenCV for vision tasks. If your camera isn't detected, ensure no other application is currently using it. Note that there is no CAMERA_INDEX configuration variable; the system automatically attempts to find the default camera.
A: Heliox OS uses port 8785 for the API and 8786 for mesh networking. If these ports are occupied, you can identify and stop the conflicting process:
Linux/macOS:
lsof -i :8785
kill -9 <PID>Windows:
netstat -ano | findstr :8785
taskkill /PID <PID> /FA: If the UI fails to launch, try clearing the npm cache and reinstalling dependencies:
cd tauri-app/ui
npm install
npm run devEnsure all frontend dependencies are installed successfully before starting the app.
A: Many Windows users encounter missing DLLs or CUDA version mismatches when installing the cognitive engine. Follow these steps:
- Missing DLLs (
msvcp140.dll,vcruntime140_1.dll): Install the Microsoft Visual C++ Redistributable. This is required for PyTorch's C++ extensions. - CUDA Mismatch: Ensure your PyTorch installation matches your system's CUDA version. Run
nvidia-smito check your driver's CUDA version, then reinstall PyTorch if necessary:# Example: For CUDA 12.1 pip install torch --index-url https://download.pytorch.org/whl/cu121 # For CPU-only (no GPU) pip install torch --index-url https://download.pytorch.org/whl/cpu
- OSError [WinError 126]: This usually indicates a missing dependency for
torchortorchaudio. Ensure you are using Python 3.11+ (64-bit) and try reinstalling the daemon dependencies:cd daemon pip install -e ".[full]"
We love contributions! Whether it's adding a new gesture, fixing a bug, or building a new plugin, check out our guides to get started.
- Read our Contributing Guide to set up your dev environment.
- Check the Good First Issues tab to find beginner-friendly tasks.
- Review our Code of Conduct.
- Join the community discussions in GitHub Discussions.
MIT
