Skip to content

Releases: markyip/Privox

v1.2 Release

26 May 05:33

Choose a tag to compare

Privox Release Notes

v1.2 (GPU Stability & VRAM Fix Update)

Release date: 2026-05-26

This update resolves a crash that affected users with NVIDIA GPUs in the 10–12 GB range (e.g. RTX 3080, 4070, 4070 Ti) when using Qwen-ASR alongside a refiner model. It also fixes a separate issue where VRAM was not fully released during the VRAM Saver idle state.

🐛 Bug Fixes

  • Fixed CUDA Out-of-Memory crash on startup (Qwen-ASR + 10–12 GB GPU): When both the grammar/refiner model and the Qwen-ASR model needed to share a mid-range GPU, the refiner could fill all available VRAM before the ASR model had a chance to load. Privox now automatically calculates how much VRAM the ASR model will need and reserves that headroom when loading the refiner — both models coexist without crashing.

  • Fixed VRAM not released during idle (VRAM Saver): After a recent update introduced smarter GPU layer placement via device_map, a subtle issue caused VRAM to remain occupied even after the VRAM Saver triggered and the app showed "Idle (VRAM Free)". The root cause was that PyTorch's accelerate dispatch hooks intercept .cpu() calls on device-mapped models, silently preventing them from moving to CPU. Privox now correctly removes these hooks before offloading, ensuring GPU memory is fully freed at idle.

  • Fixed VRAM Saver timeout setting not accepting 0 (disabled): The Settings UI now correctly allows setting the VRAM Saver timer to 0 to disable it entirely, instead of clamping the value to a minimum of 5 seconds.

🔧 Improvements

  • Smarter VRAM allocation for Qwen-ASR: The ASR model now loads with an explicit VRAM budget (device_map=auto + max_memory) on GPU systems, distributing model layers within available headroom rather than attempting a single large transfer that could fail.
  • VRAM flush between model loads: A garbage-collection and CUDA cache flush step was added between the grammar model and ASR model load sequences, ensuring any scratch memory held by the refiner is returned to the driver before ASR initialises.
  • Improved ASR cleanup on backend switch: Switching ASR models in Settings now correctly frees GPU memory from the previous model, including any accelerate dispatch hooks.

v1.1 Release

15 May 14:12

Choose a tag to compare

Privox Release Notes

v1.1 (Windows Experience Update)

Release date: 2026-05-15

This update focuses on making Privox faster, more reliable, and easier to use on Windows. We've combined several recent improvements into this single "v1.1" release to ensure the best experience for all users.

🚀 Performance & Speed

  • Lightning Fast AI: We've optimized the app to take full advantage of modern NVIDIA graphics cards (including the RTX 40 and 50 series). This allows the AI to process your speech almost instantly without slowing down your computer.
  • Smoother Transcription: The system is now much more efficient at turning your voice into text, providing a snappier feel especially on newer hardware.

🛠️ Reliability & "Just Works"

  • Fixed Startup Issues: We resolved a common technical problem that caused the app to occasionally fail to start on certain Windows systems. It should now launch reliably every time.
  • Better Typing & Pasting: When you finish speaking, Privox is now smarter about finding your active window and pasting your text exactly where your cursor is.
  • Modern AI Engine: Upgraded core transcription engine (llama-cpp-python) to v0.3.23, enabling official Windows binary support and faster, more stable AI operations.
  • Cleaner Installation: The app now does a better job of keeping your computer tidy by automatically cleaning up temporary background files.
  • Refined Installer UI: Unified fonts and styling across the installer, and implemented correct taskbar icon association for a more professional look.
  • Improved Hotkey Reliability: Implemented physical key-state verification to prevent the hotkey from getting "stuck" or missed during high system load.
  • Pre-warm Models: Added a new "Pre-warm Models on Startup" feature (enabled by default) so the AI is ready to transcribe instantly without a first-use delay.

🛡️ Privacy & Security

  • Strictly Local: We've further enhanced our privacy protections to ensure that your dictation never leaves your computer.
  • Silent & Unobtrusive: The app now runs more smoothly in the background, keeping itself out of your way while you work.

Privox v1.0.0 - Initial Release

26 Feb 16:47

Choose a tag to compare

Privox v1.0.0 - Initial Release

Privox is a private, local-first voice input assistant for Windows that transcribes your speech and intelligently refines it using state-of-the-art AI models.

✨ Key Features

🎙️ High-Precision Transcription

  • Faster-Whisper Engine: Blazing fast transcription using the latest AI models.
  • True Multilingual Support: Specialized logic for English, Cantonese, Traditional Chinese, Japanese, Korean, and more.
  • Intelligent LID (Language Detection): Automatically identifies the language you are speaking with high-confidence safety thresholds.
  • Optimized Performance: Parallel model loading and optimized wake-up sequences ensure minimal latency when starting or resuming from idle.

✍️ Intelligent Refinement

  • Smart Refiner: Uses Llama-3.2-3B to polish your speech into grammatically perfect text while removing filler words like "uh" and "um".
  • Hallucination Safeguards: Multi-layered protection prevents the AI from adding unintended text or "hallucinating" on short inputs.
  • Anti-Assistant Protection: Persona guards ensure the AI stays focused on refining your text rather than acting as a chatbot.
  • Clean Output: Automatic meta-commentary stripping removes trailing AI "notes" or internal explanations.
  • Custom Personas: Choose how Privox writes—act like a Technical Writer, Engineer, or Lawyer with a single click.

�️ Private & Secure

  • 100% Local: All processing happens on your computer. Your privacy is guaranteed.
  • Privacy-First Logging: User text and refined output are never written to logs—only diagnostic metadata is tracked.
  • Legitimacy Metadata: Embedded Windows Version Info and application manifests reduce false-positive flags from antivirus software.
  • Smart Auto-Stop: Advanced silence detection and safety timers ensure the microphone stops recording when you do.

✍️ Premium Experience

  • Liquid Glass UI: A clean, monotone aesthetic with centered progress updates and professional typography.
  • One-Key Magic: Single hotkey operation for seamless start/stop recording.

🛠️ Effortless Local Setup

  • Smart Installer (bootstrap.py): Automatically manages portable Python environments and GPU/CUDA dependencies.
  • GPU Orchestration: Built-in support for NVIDIA GPUs with automatic fallback and repair for CPU mode.
  • Disk Safety: Verifies space and system requirements before downloading large AI assets.

🖥️ System Requirements

  • OS: Windows 10/11.
  • GPU: NVIDIA GPU (CUDA 12+) recommended; CPU support available,.
  • Disk: ~15GB free space for the high-quality local AI "brains".