release: stabilize TTS microservice integration, voice system, and UI improvements#46
Merged
Conversation
Introduce voice-language and RVC-based voice features across the app. Adds a new DB migration (migrate_add_voice_language.py) and adds voice_language column to the User model (database/models.py); also updates the existing migration for voice_enabled. DatabaseManager gained getters/setters for user voice language and refactors for user creation/message saving. Core changes: settings reorganized, NLPEngine made resilient when transformers/torch are unavailable, Application now initializes a VoiceProcessor (when enabled) and registers VoiceHandler. Commands and responses updated to expose /voicelang and interactive language selection (callbacks), and new libs/rvc_python plus utility helpers were added. Run the new migration script from project root to update the database before enabling voice features.
Move voice generation to an external TTS microservice and remove the bundled rvc_python library. Key changes: - .env.example: add TTS_SERVICE_URL, reorganize voice/NLP/DB settings and defaults. - README: document new Alya-TTS microservice-based voice setup and update voice feature text. - Removed libs/rvc_python package and related API/CLI/config files to stop bundling RVC internals. - Added utils/tts_queue.py and integrated dispatching to queue TTS jobs rather than inlining heavy audio work. - handlers/voice.py: refactored to translate and dispatch TTS via the queue, use DEFAULT_LANGUAGE, and avoid direct file-based TTS handling. - handlers/conversation.py & core/bot.py: removed inline voice processing from conversation flow and updated handler wiring. - database/database_manager.py: get_user_voice_language now falls back to user's language_code when voice_language is unset. - handlers/admin.py: removed deployment manager/status command and cleaned up registration/logging. - handlers/response/lang.py & handlers/commands.py: trimmed supported languages and updated voicelang labels. - core/nlp.py: clarified torch <2.6 warning message. Overall this refactor isolates audio workloads to a microservice, simplifies the bot code, and removes the bundled RVC implementation in favor of a separate TTS service and queued processing.
feat: add animated loading placeholder for AI response generation
feat: integrate TTS voice responses and improve response loading UX
feat: integrate TTS voice handling, improve loading UX, and stabilize runtime dependencies
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Overview
This PR promotes the latest stable changes from
developmenttoproduction, focusing on TTS architecture, voice system improvements, UI enhancements, and dependency stabilization.Key Changes
Voice System & TTS Architecture
voice_helpers!ttscommand with improved dispatch flowAdmin & Command Improvements
/voiceadd/voiceremove/addadmin/removeadminRVC & Voice Model Refactor (Deprecated Path Cleanup)
UI & Interaction Improvements
System & Backend Improvements
ChatActionSenderwith improved error handlingContextManagerDatabase & Structure Updates
userstable:moodandvoicecolumnsDependency & Environment Stabilization
faiss-cputo1.7.3torchaudiominimum to2.6.0pip <24.1omegaconfCleanup
Notes
Impact
Migration Notes
usersfields (mood,voice)