Feature: Add SenseVoice support via Sherpa-ONNX for on-device ASR

## Motivation

ElatoAI runs realtime voice AI on ESP32. [SenseVoice](https://github.com/FunAudioLLM/SenseVoice) (8K+ stars) is available through [Sherpa-ONNX](https://github.com/k2-fsa/sherpa-onnx) which already supports embedded/IoT platforms including ESP32-compatible inference.

## Why SenseVoice for ElatoAI

- **Non-autoregressive**: Single forward pass, minimal compute per chunk
- **SenseVoice-Small** (234M): 50+ languages with auto detection
- **ONNX format**: Runs via Sherpa-ONNX on embedded devices
- **Built-in VAD**: No separate voice activity detection needed
- **Emotion detection**: Detect user emotions from speech — useful for companion AI

## Integration via Sherpa-ONNX

Sherpa-ONNX provides C/C++ API suitable for embedded:

```c
// C API for embedded devices
SherpaOnnxOfflineRecognizer *recognizer = 
    SherpaOnnxCreateOfflineRecognizer(&config);
// Process audio frames → get text
```

For server-side processing (when ESP32 sends audio to a server):
```bash
pip install funasr vllm
funasr-server --device cuda  # OpenAI-compatible at :8000
```

## References

- FunASR: https://github.com/modelscope/FunASR
- Sherpa-ONNX (C/C++): https://github.com/k2-fsa/sherpa-onnx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Add SenseVoice support via Sherpa-ONNX for on-device ASR #30

Motivation

Why SenseVoice for ElatoAI

Integration via Sherpa-ONNX

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feature: Add SenseVoice support via Sherpa-ONNX for on-device ASR #30

Description

Motivation

Why SenseVoice for ElatoAI

Integration via Sherpa-ONNX

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions