🦭 Shadow Companion

TTS echo for language shadowing with Handy.

How it works

You read an article aloud
Handy transcribes your speech → saves to history database
This companion watches Handy's DB → TTS speaks your words back
You shadow the native pronunciation, repeat

Supports two TTS providers:

Kokoro — Built-in voices, adjustable speed, fast startup
NeuTTS Air — Voice cloning from your own reference audio, streaming playback

Subprocess worker architecture

TTS runs in a child process for full memory reclamation:

Models load lazily on first utterance (not at startup)
After 10 minutes idle, the worker is killed — the OS reclaims all model memory instantly
For NeuTTS, torch is only loaded briefly in a short-lived encoder subprocess (to produce ref_codes.npy), then exits. The long-lived worker uses the ONNX codec + GGUF backbone — no torch at all (~1.5 GB vs ~3 GB)

Rust TTS Worker (NeuTTS)

NeuTTS inference runs in a native Rust binary by default — faster cold start (~5s vs tens of seconds), lower memory, no Python interpreter overhead. The Rust worker is a drop-in replacement for the Python worker subprocess; shadow.py automatically uses it when the binary exists and the provider is neutts.

Default: Rust binary (tts-worker-rs/target/release/tts-worker) is used when present
Fallback: If the binary is missing or SHADOW_RUST_TTS=0 is set, the Python worker runs instead
Kokoro always uses the Python worker (not affected)

Build the Rust worker:

cd tts-worker-rs && cargo build --release

Requires: Rust 1.94+, CMake (for llama.cpp vendored build). First build takes a few minutes; subsequent builds are fast.

Prerequisites

macOS 13+ (kqueue for DB watching, Perry for menubar)
Python 3.10+
Handy — STT app that provides the transcription database
Node.js — for the Raycast extension (optional)

Python packages

Package	Required by	Purpose
`pykokoro`	Kokoro	TTS engine + ONNX inference
`sounddevice`	Both	Audio playback
`numpy`	Both	Audio array handling
`pyperclip`	Kokoro	Clipboard access
`Pillow`	Menubar	Icon generation
`spacy` (+ `en_core_web_sm`)	Kokoro	Text segmentation
`neutts`	NeuTTS	Voice cloning TTS engine
`llama-cpp-python`	NeuTTS	GGUF model inference
`onnxruntime`	NeuTTS	Codec decoder

External tools (optional)

Tool	Purpose
Perry	macOS menubar progress tracker
Raycast	Keyboard-driven control extension

Setup

cd ~/shadow-companion
python3 -m venv .venv
source .venv/bin/activate

# Core dependencies (required for Kokoro)
pip install pykokoro pyperclip sounddevice numpy Pillow

# Download English language model
python -m spacy download en_core_web_sm

# For NeuTTS voice cloning (optional)
pip install neutts llama-cpp-python onnxruntime

Usage

Start the server (background)

python shadow.py serve
python shadow.py serve --provider neutts   # Use NeuTTS voice cloning

Control commands

python shadow.py status                     # Check if running
python shadow.py stop                       # Stop server
python shadow.py restart                    # Restart server
python shadow.py set-voice am_adam          # Change voice (Kokoro, hot-reloads)
python shadow.py set-speed 0.85             # Change speed (Kokoro only)
python shadow.py set-provider kokoro        # Switch TTS engine (requires restart)
python shadow.py set-provider neutts        # Switch to NeuTTS (requires restart)

Or run in foreground

python shadow.py                            # Default: kokoro, am_michael, 1.0x speed
python shadow.py --voice am_adam
python shadow.py --speed 0.85
python shadow.py --provider neutts

NeuTTS Voice Cloning

NeuTTS Air clones a voice from a short reference recording. This lets you practice shadowing with a voice similar to your own — or a native speaker's.

Setup

python shadow.py setup-voice

This records ~15 seconds of speech and saves:

~/.shadow-companion/my-voice.wav — reference audio
~/.shadow-companion/my-voice.txt — reference transcript

Using a native speaker's voice

Replace the reference files with a native speaker's recording (10-15s of clean speech):

cp native-speaker.wav ~/.shadow-companion/my-voice.wav
echo "The transcript of that recording" > ~/.shadow-companion/my-voice.txt

Then restart: python shadow.py restart

Notes

NeuTTS uses streaming playback (infer_stream()) for low-latency audio — chunks play as they're generated
Speed control is not available with NeuTTS
Provider changes require a server restart
Only GGUF backbones are supported for streaming (Q8 by default)
Voice cloning reference is encoded in a separate short-lived subprocess that loads torch, saves ref_codes.npy, then exits — the long-lived TTS worker never loads torch
The ONNX codec decoder (neuphonic/neucodec-onnx-decoder) is used in the worker instead of the PyTorch codec, saving ~2.4 GB of resident memory

Daily Tracking

A macOS menubar meter built with Perry that shows your daily shadowing time. Completely optional — the core shadow companion works without it.

What it tracks

TTS playback duration — how long you spent listening/shadowing (primary metric)
STT recording duration — how long you spoke into Handy (secondary, shown in verify)
Current local day only (not lifetime)
Writes ~/.shadow-companion/daily-progress.json for the menubar app to read
TTS play log stored in ~/.shadow-companion/tts-play-log.json

CLI commands

python shadow.py progress              # Compute and print today's progress
python shadow.py verify                # Detailed audit: TTS time, STT time, per-recording breakdown
python shadow.py set-daily-target 30   # Set daily target in minutes (default: 60)

Menubar app

A compact Perry-native tray icon in your macOS menu bar:

🔋 Battery mode — 5-slice visual (click to toggle)
📝 Text mode — shows minutes as X/Y
Tooltip — hover for exact progress
Context menu — click for config, progress file, toggle mode, quit
No Dock icon — lives only in the menubar

Setup

# 1. Generate icon PNGs
python generate_icons.py

# 2. Compute initial progress
python shadow.py progress

# 3. Compile the Perry app (requires Perry)
perry compile src/main.ts -o dist/shadow-meter

# 4. Run it
./dist/shadow-meter

Or use the build script:

./build.sh
open "dist/Shadow Meter.app"

How it updates

The server logs TTS playback duration after each utterance and writes daily-progress.json whenever it processes a new Handy recording. The menubar app polls this file every 15 seconds.

Verifying accuracy

python shadow.py verify

Shows TTS playback time, STT recording time, every recording counted, its WAV duration, whether the file still exists, totals, and cross-checks against daily-progress.json.

Daily target

Default is 60 minutes. Change it:

python shadow.py set-daily-target 45   # 45 minutes

Progress formula: tts_playback_duration_today / daily_target_duration, clamped 0–1. Battery slices = ceil(progress × 5).

Raycast Extension

Control Shadow Companion from Raycast:

cd ~/shadow-companion/raycast-extension
npm install
npm run dev

Four commands available:

Control Server — Start/stop/restart + see status, provider, and daily progress
Switch Voice — Pick from all Kokoro voices (shows NeuTTS info when that provider is active)
Switch Provider — Switch between Kokoro and NeuTTS (with restart action)
Adjust Speed — Set speech speed, 0.5x–2.0x (Kokoro only; shows "not available" for NeuTTS)

Available Kokoro voices

Male	Female
am_michael (default)	af_heart
am_adam	af_nicole
am_eric	af_sarah
am_liam	af_bella
am_onyx	af_river
am_puck	af_sky

Troubleshooting

CoreML slow on M2: Use --provider kokoro (default) — it uses the CPU ONNX path which is faster (~4-5× realtime) because CoreML only supports ~45% of Kokoro's ONNX nodes, causing graph partitioning overhead.

NeuTTS audio choppy: Fixed — streaming now uses sounddevice.OutputStream with a queue-based callback for gapless chunk playback. If issues persist, check that llama-cpp-python and onnxruntime are up to date.

"Could not find Handy's history.db": Make sure Handy is installed and has been used at least once. Or specify the path: python shadow.py --db /path/to/history.db

Server not responding: Check the log at ~/.shadow-companion/server.log

Menubar shows stale data: Run python shadow.py progress to recompute, or just wait — the server refreshes progress on every new recording and every 5 minutes.

NeuTTS reference audio missing: Run python shadow.py setup-voice to record your reference audio. Or place a WAV file at ~/.shadow-companion/my-voice.wav with a matching .txt transcript.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
raycast-extension		raycast-extension
src		src
tts-worker-rs		tts-worker-rs
.gitignore		.gitignore
README.md		README.md
build.sh		build.sh
codedb.snapshot		codedb.snapshot
generate_icons.py		generate_icons.py
package.json		package.json
perry.toml		perry.toml
shadow.py		shadow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🦭 Shadow Companion

How it works

Subprocess worker architecture

Rust TTS Worker (NeuTTS)

Prerequisites

Python packages

External tools (optional)

Setup

Usage

Start the server (background)

Control commands

Or run in foreground

NeuTTS Voice Cloning

Setup

Using a native speaker's voice

Notes

Daily Tracking

What it tracks

CLI commands

Menubar app

Setup

How it updates

Verifying accuracy

Daily target

Raycast Extension

Available Kokoro voices

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🦭 Shadow Companion

How it works

Subprocess worker architecture

Rust TTS Worker (NeuTTS)

Prerequisites

Python packages

External tools (optional)

Setup

Usage

Start the server (background)

Control commands

Or run in foreground

NeuTTS Voice Cloning

Setup

Using a native speaker's voice

Notes

Daily Tracking

What it tracks

CLI commands

Menubar app

Setup

How it updates

Verifying accuracy

Daily target

Raycast Extension

Available Kokoro voices

Troubleshooting

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages