release: stabilize TTS microservice integration, voice system, and UI improvements by Afdaan · Pull Request #46 · Afdaan/Alya-Bot-Telegram

Afdaan · 2026-03-19T11:05:13Z

Overview

This PR promotes the latest stable changes from development to production, focusing on TTS architecture, voice system improvements, UI enhancements, and dependency stabilization.

Key Changes

Voice System & TTS Architecture

Introduced structured voice handling via voice_helpers
Integrated !tts command with improved dispatch flow
Added support for:
- Replied voice messages
- Group-based TTS triggers
Fully migrated TTS processing to external microservice:
- https://github.com/Afdaan/Alya-TTS
- Reduces load on main service and improves scalability
Removed bundled RVC and local voice model dependencies
Added voice and mood support in database schema

Admin & Command Improvements

Enhanced admin and voice commands to support:
- User mentions
- Reply-based targeting
Supported commands:
- /voiceadd
- /voiceremove
- /addadmin
- /removeadmin
Eliminates the need to manually input user IDs
Improves usability and reduces friction for admin operations

RVC & Voice Model Refactor (Deprecated Path Cleanup)

Removed internal RVC dependencies and local model handling
Cleaned up forced module reload logic and internal lib overrides
Transitioned architecture to microservice-based voice processing

UI & Interaction Improvements

Added animated loading UI with centralized helper
Improved loading timing, punctuation, and responsiveness
Added timeout handling to prevent stuck states
Enhanced error handling during async operations
Improved message editing and response flow

System & Backend Improvements

Introduced ChatActionSender with improved error handling
Refactored context handling using ContextManager
Improved handling of persona prompts with language awareness
Precomputed persona traits and relationship context for efficiency

Database & Structure Updates

Extended users table:
- Added mood and voice columns
Refactored DB imports and migration flow

Dependency & Environment Stabilization

Downgraded faiss-cpu to 1.7.3
Adjusted torchaudio minimum to 2.6.0
Pinned pip <24.1
Removed omegaconf
Cleaned up and simplified deployment steps

Cleanup

Removed unused test files
Removed redundant Python installation steps
Simplified internal module structure

Notes

This release finalizes migration to microservice-based TTS architecture
Improves system stability, scalability, and maintainability
Reduces resource usage on the main bot service

Impact

More scalable and maintainable voice system
Reduced CPU and memory load on main service
Better admin experience with mention/reply support
Improved UX with smoother loading and responses
More stable dependency and deployment pipeline

Migration Notes

Ensure TTS microservice is deployed and accessible:
- https://github.com/Afdaan/Alya-TTS
Apply database migrations for new users fields (mood, voice)
Verify environment dependencies match updated constraints

Introduce voice-language and RVC-based voice features across the app. Adds a new DB migration (migrate_add_voice_language.py) and adds voice_language column to the User model (database/models.py); also updates the existing migration for voice_enabled. DatabaseManager gained getters/setters for user voice language and refactors for user creation/message saving. Core changes: settings reorganized, NLPEngine made resilient when transformers/torch are unavailable, Application now initializes a VoiceProcessor (when enabled) and registers VoiceHandler. Commands and responses updated to expose /voicelang and interactive language selection (callbacks), and new libs/rvc_python plus utility helpers were added. Run the new migration script from project root to update the database before enabling voice features.

Move voice generation to an external TTS microservice and remove the bundled rvc_python library. Key changes: - .env.example: add TTS_SERVICE_URL, reorganize voice/NLP/DB settings and defaults. - README: document new Alya-TTS microservice-based voice setup and update voice feature text. - Removed libs/rvc_python package and related API/CLI/config files to stop bundling RVC internals. - Added utils/tts_queue.py and integrated dispatching to queue TTS jobs rather than inlining heavy audio work. - handlers/voice.py: refactored to translate and dispatch TTS via the queue, use DEFAULT_LANGUAGE, and avoid direct file-based TTS handling. - handlers/conversation.py & core/bot.py: removed inline voice processing from conversation flow and updated handler wiring. - database/database_manager.py: get_user_voice_language now falls back to user's language_code when voice_language is unset. - handlers/admin.py: removed deployment manager/status command and cleaned up registration/logging. - handlers/response/lang.py & handlers/commands.py: trimmed supported languages and updated voicelang labels. - core/nlp.py: clarified torch <2.6 warning message. Overall this refactor isolates audio workloads to a microservice, simplifies the bot code, and removes the bundled RVC implementation in favor of a separate TTS service and queued processing.

feat: add animated loading placeholder for AI response generation

feat: integrate TTS voice responses and improve response loading UX

…aint

…kflows

feat: integrate TTS voice handling, improve loading UX, and stabilize runtime dependencies

Afdaan and others added 30 commits February 9, 2026 19:52

feat: Enhance voice model setup and processing

6e6d91d

feat: Refactor requirements.txt for improved organization and clarity

1949d22

fix: lower torchaudio minimum to 2.6.0

685d334

fix: downgrade faiss-cpu to 1.7.3

469da08

fix: Pin pip <24.1 and uninstall omegaconf

a0f7903

fix: precompute persona traits & relationship text

5567152

fix: add __init__.py for rvc_python packages

d852989

fix: rvc module

25700a3

fix: force reload rvc modules to bypass site-packages

2b8930c

fix: rvc module reload and absolute paths

40c1c93

fix: unignore internal lib folder and force add rvc files

448b0a5

fix: Refactor DB imports, add RVC fix and DB migration

bf7ca4f

fix: Extend users table with mood and voice columns

6522f79

feat: add ChatActionSender and voice/mood DB support

c0923aa

fix: Format context & use ContextManager in voice handler

c4e582a

feat: switch Japanese lang key to 'jp' and prefer RVC

4d91f7e

fix: Remove RVC and local voice model settings

7ad7713

Remove test_vp_init.py

89afc0c

fix: Language-aware persona prompts and handler refactor

24d5fe0

feat: Add animated loading UI and edit responses

e858c7d

feat: centralize loading animations in helpers

102aac8

Merge pull request #43 from Afdaan/feat/ui

84b752e

feat: add animated loading placeholder for AI response generation

Merge pull request #44 from Afdaan/feat/voice

2282272

feat: integrate TTS voice responses and improve response loading UX

feat: add voice_helpers and integrate !tts handling

cd21244

fix: remove send_voice_reply call from VoiceHandler

299c764

fix: handle replied voice notes & group TTS triggers

f190821

fix: remove 'voicelang' from bot commands

6eb5854

fix: add timeout to loading animation and reply fix

ac40e48

Afdaan and others added 11 commits March 19, 2026 03:21

feat: improve loading animation error handling and timing

0f65eff

fix: tweak loading punctuation and animation interval

becadab

fix: tweak loading animation intervals

7a55b37

feat: improve ChatActionSender error handling

7018440

fix: simplify loading messages and TTS dispatch

f12e6a4

fix: ignore 'not found' errors in loading animation

9cdd818

fix: bump numpy requirement to >=1.26,<2.0

d679be2

fix: update Python installation steps and adjust numpy version constr…

c0c3eb6

…aint

fix: remove unnecessary Python installation steps from deployment wor…

174cc70

…kflows

Merge pull request #45 from Afdaan/feat/voice

bc06f81

feat: integrate TTS voice handling, improve loading UX, and stabilize runtime dependencies

feat(admin): support mentions and replies for voice and admin commands

3bdecb4

Afdaan merged commit 4175866 into master Mar 19, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release: stabilize TTS microservice integration, voice system, and UI improvements#46

release: stabilize TTS microservice integration, voice system, and UI improvements#46
Afdaan merged 41 commits into
masterfrom
development

Afdaan commented Mar 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Afdaan commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Key Changes

Voice System & TTS Architecture

Admin & Command Improvements

RVC & Voice Model Refactor (Deprecated Path Cleanup)

UI & Interaction Improvements

System & Backend Improvements

Database & Structure Updates

Dependency & Environment Stabilization

Cleanup

Notes

Impact

Migration Notes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Afdaan commented Mar 19, 2026 •

edited

Loading