Professional AI Voice Cloning Platform
Ethical voice cloning with consent. Local-first processing. Multi-platform support.
Detailed documentation is available in the docs/ directory:
- Project Overview
- Backend Guide (Architecture, Audio Engine)
- API Reference (Endpoints, JSON formats)
- Frontend Guide (Studio UI, State Management)
- Setup & Deployment (Local Dev, Docker)
- π€ Voice Cloning - Clone voices with explicit consent
- π Emotional TTS - Control speed, pitch, and energy
- π 100% Local - No data uploads, complete privacy
- π Multi-Language - Support for 100+ languages
- π± Multi-Platform - Web, Desktop, Mobile
- β‘ Modern Stack - FastAPI + Next.js + Tauri
For detailed setup instructions, see Setup Guide.
docker-compose up -d --buildAccess:
- π Frontend:
http://localhost:3000 - π API Docs:
http://localhost:8000/docs
VoxLabs/
βββ api/ # FastAPI Backend
β βββ main.py # Entry point
β βββ engine/ # Audio processing logic
β βββ static/ # Generated audio files
βββ web/ # Next.js Frontend
β βββ app/ # App Router pages
β βββ lib/ # API client & types
βββ desktop/ # Tauri Desktop App config
βββ docs/ # Project Documentation
βββ docker-compose.yml # Orchestration
βββ README.md # This file
- FastAPI - Python web framework
- Librosa - DSP & Audio analysis
- SoundFile - Audio I/O
- Pydantic - Data validation
- Next.js 16 - React framework
- TypeScript - Type safety
- Tailwind CSS - Styling
- Shadcn UI - Components
- Framer Motion / GSAP - Animations
- Docker - Containerization
- Tauri - Desktop application bundle
VoxLabs is built with ethical AI practices at its core:
β
Consent Required - Explicit consent for all voice cloning operations
β
Local Storage - No data uploads to external servers
β
AI Labels - All generated audio labeled as AI-generated
β
Easy Deletion - Simple voice data revocation
β
Transparent - Open source and fully auditable
β
No Impersonation - Designed to prevent malicious use
- Accessibility - Text-to-speech for visually impaired users
- Content Creation - Voiceovers for videos and podcasts
- Language Learning - Practice pronunciation with native voices
- Personal Assistants - Custom voice for smart home devices
- Game Development - Character voices for indie games
We welcome contributions! Please see CONTRIBUTING.md for:
- Code of conduct
- Development setup
- Coding standards
- Pull request process
- Testing guidelines
This project is licensed under the MIT License - see the LICENSE file for details.
Important Disclaimer: Users are responsible for obtaining proper consent before cloning any voice and must comply with all applicable laws and regulations.
- FastAPI backend with voice cloning
- Next.js frontend with modern UI
- Docker deployment setup
- Emotional TTS controls
- Multi-language support
- Desktop app (Tauri)
- Mobile app (Tauri Mobile)
- npm package (
@voxlabs/client) - PyPI package (
voxlabs) - Cloud deployment guides
- Author: nishanth-kj
- GitHub: @nishanth-kj
- Issues: GitHub Issues
- Discussions: GitHub Discussions
- FastAPI for the excellent web framework
- Next.js team for the React framework
- Open source community for amazing tools
- Contributors and supporters
Made with β€οΈ for ethical AI voice technology
Report Bug β’ Request Feature β’ Documentation
β Star this repo if you find it useful! β