📚 VoxLibri

VoxLibri is a powerful, all-in-one solution for converting eBooks into high-quality audiobooks using state-of-the-art Text-to-Speech (TTS) technology. Whether you have a collection of EPUBs, PDFs, or even scanned images, VoxLibri can transform them into immersive audio experiences with natural-sounding voices and advanced customization.

✨ Key Features

🚀 Multiple TTS Engines:
- Coqui XTTSv2: Industry-leading voice cloning with emotional range and high fidelity.
- Suno Bark: Generative audio model for diverse and expressive speech.
🎙️ Professional Voice Cloning: Clone any voice using just a short audio sample (supports various formats like MP3, WAV, M4A).
📖 Broad Format Support:
- eBooks: EPUB, PDF, MOBI, AZW3, FB2, TXT, DOCX, and more.
- Images & Scans: Integrated OCR (Tesseract) support for PNG, JPG, TIFF, etc.
🌍 Multilingual: Support for dozens of languages including English, Tamil, Spanish, French, German, and many others.
🎨 Modern Dashboards:
- Streamlit Web UI: A sleek, dark-themed interface for easy management.
- Headless CLI: Powerful command-line interface for automation and batch processing.
🛠️ Advanced Control:
- SSML-like Tags: Use [pause], [break], and [voice:path] to fine-tune the narration.
- Audio Processing: Noise removal and music separation using Demucs.
- Format Conversion: Output to M4B (with chapters), MP3, AAC, and more.
🖥️ Hardware Optimized: Automatic setup for CPU, NVIDIA (CUDA), Apple Silicon (MPS), and Intel (XPU).

🛠️ Installation

VoxLibri is designed to be self-contained. The build.py script manages the entire environment for you.

Prerequisites (Linux)

Ensure you have the basic build tools. Most other dependencies are handled automatically.

sudo apt-get update
sudo apt-get install python3-pip python3-venv git

Quick Start

Clone the repository:

git clone https://github.com/Vasanth2005kk/VoxLibri.git
cd VoxLibri

Run the installer and launcher:
```
python3 build.py
```
This will create a virtual environment, install system dependencies (Calibre, FFmpeg, Tesseract), and launch the application.

🚀 How to Use

1. Web Interface (Recommended)

Launch the app using build.py. Once running, open your browser to: http://localhost:7860

Upload: Drop your eBook file.
Configure: Select the language and TTS engine.
Clone: Upload a voice sample to narrate the book in that specific voice.
Convert: Click "Start" and wait for your audiobook to appear in the audiobooks/ directory.

2. Headless Mode (CLI)

For power users or automation, use the headless flags:

./build.py --headless --ebook "/path/to/book.epub" --language eng --voice "/path/to/voice.wav"

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
audiobooks		audiobooks
components		components
ebooks		ebooks
ext		ext
lib		lib
tools		tools
voices		voices
.gitignore		.gitignore
README.md		README.md
app.py		app.py
build.py		build.py
chacking.txt		chacking.txt
requirements.txt		requirements.txt
tamil.txt		tamil.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 VoxLibri

✨ Key Features

🛠️ Installation

Prerequisites (Linux)

Quick Start

🚀 How to Use

1. Web Interface (Recommended)

2. Headless Mode (CLI)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📚 VoxLibri

✨ Key Features

🛠️ Installation

Prerequisites (Linux)

Quick Start

🚀 How to Use

1. Web Interface (Recommended)

2. Headless Mode (CLI)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages