VoiGent is a professional-grade, voice-enabled AI assistant designed to handle To-Do lists and remember important user details across sessions. It features a high-fidelity voice interface with automatic language detection between English and Hindi.
frontend : https://voigent.netlify.app
backend : https://voigent.onrender.com
- High-Fidelity Transcription: Uses Groq Whisper-v3 for near-instant and highly accurate speech-to-text.
- Perfect Language Autodetection: Automatically identifies if you are speaking English, Hindi, or "Hinglish" without any manual toggles.
- Hybrid TTS Engine:
- English (Fast Path): Uses the browser's native synthesis for zero-latency response.
- Hindi (Quality Path): Routes through the backend using gTTS for authentic pronunciation.
- Long-term Memory: Remembers your name, preferences, and important facts using an integrated SQLite database.
- Task Management: Fully functional To-Do list (Add, List, Update, Delete) via LLM function calling.
- Premium Glassmorphic UI: A modern, responsive interface with vibrant gradients and smooth animations.
- Mindful Usage System: Built-in persistent message counter and popup to encourage responsible use of free-tier AI resources.
graph TD
User((User)) <--> UI[Frontend: Vanilla JS/CSS3]
UI <--> BE[Backend: FastAPI]
BE <--> Groq[Groq AI: Llama 3.1 & Whisper]
BE <--> DB[(SQLite: agent_data.db)]
BE <--> gTTS[Google TTS: Hindi Voice]
voice-agent/
├── backend/
│ ├── voice/
│ │ ├── detect.py # Language detection (EN/HI)
│ │ └── tts.py # gTTS Hindi synthesis
│ ├── agent.py # LLM & Function calling logic
│ ├── database.py # SQLite persistence
│ ├── main.py # FastAPI entry point
│ └── requirements.txt # Python dependencies
├── frontend/
│ ├── api.js # Backend API wrappers
│ ├── app.js # UI & Voice logic
│ ├── index.html # Application structure
│ └── style.css # Premium styling
└── README.md
- Python 3.8+
- A Groq API Key
- From the project root (
voice-agent/), create a virtual environment:python3 -m venv venv
- Activate the virtual environment:
- Mac/Linux:
source venv/bin/activate - Windows:
venv\Scripts\activate
- Mac/Linux:
- Navigate to the
backendfolder:cd backend - Create a
.envfile and add your key:GROQ_API_KEY=your_key_here
- Install dependencies:
pip install -r requirements.txt
- Start the server:
uvicorn main:app --reload
- Navigate to the
frontendfolder.cd frontend - Start a local server:
python3 -m http.server 3000
- Open
http://localhost:3000in your browser.