β¨ You touch grass. I'll touch your glass. β¨
An open-source AI agent that sees, speaks, and controls your Android phone through voice commands
DeVA (Device Virtual Assistant) is a revolutionary AI-powered Android assistant that doesn't just respond to your voiceβit actually operates your phone for you. Think of it as having a personal assistant sitting next to you, tapping, swiping, and navigating through apps on your behalf.
π£οΈ "Hey DeVA, send a good morning text to Mom"
π± *DeVA opens Messages, finds Mom, types & sends the message*
β
"Done! Message sent to Mom"
| Traditional Assistants | DeVA |
|---|---|
| β Limited to specific app integrations | β Works with ANY app on your phone |
| β Can only answer questions | β Actually performs actions |
| β Requires API support from each app | β Uses visual UI automation |
| β Closed source, privacy concerns | β 100% Open Source |
| Task | Voice Command |
|---|---|
| π§ Send Messages | "Send a text to John saying I'll be late" |
| π΅ Play Music | "Open Spotify and play my liked songs" |
| β° Set Alarms | "Set an alarm for 7:30 AM tomorrow" |
| πΈ Take Photos | "Open camera and take a selfie" |
| π€οΈ Check Weather | "What's the weather like today?" |
| π± Open Apps | "Open Instagram and go to my messages" |
| π Search | "Search for nearby restaurants on Google Maps" |
| π¬ LinkedIn | "Send welcome message to all new connections" |
|
DeVA sees your screen through accessibility services, understands the context of UI elements, and performs actions like a human wouldβtapping, swiping, and typing. High-quality voice recognition and speech synthesis powered by Google's advanced AI. Just speak naturally, and DeVA understands. DeVA can analyze what's on your screen and provide context-aware responses. Ask "What's on my screen?" and get intelligent summaries. |
Say "Hey DeVA" to wake up your assistant anytime, anywhere. No need to open the app first. 100% open source. Your voice data stays on your device. No shady data collection. Unlike traditional assistants limited to specific integrations, DeVA can interact with any app on your phone through UI automation. |
DeVA is built on a sophisticated multi-agent system that separates responsibilities for reliable reasoning:
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β π§ THE BRAIN (LLM) β
β Gemini-powered reasoning & planning β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β π EARS β β ποΈ EYES β β ποΈ HANDS β β
β β STT/TTS βββββΊβ Accessibility βββββΊβ Actions β β
β β Voice β β Service β β Tap/Swipe β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β π± YOUR ANDROID DEVICE β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
- ConversationalAgentService - Handles voice interactions and conversation flow
- AgentService - Executes multi-step tasks through UI automation
- Eyes (Accessibility) - Reads screen content and UI hierarchy
- SpeechCoordinator - Manages STT/TTS for natural voice interaction
- GeminiApi - Powers intelligent decision making
- π± Android device with API Level 26+ (Android 8.0+)
- π οΈ Android Studio (latest version recommended)
- π Gemini API keys
-
Clone the repository
git clone https://github.com/devanshupardeshi/DeVA.git cd DeVA -
Configure API Keys
Create
local.propertiesin the project root:# Option 1: Direct Gemini API keys (recommended for testing) GEMINI_API_KEYS=your_api_key_1,your_api_key_2 # Option 2: Custom proxy server GCLOUD_PROXY_URL=your_backend_url GCLOUD_PROXY_URL_KEY=your_password
-
Build & Run
- Open in Android Studio
- Let Gradle sync dependencies
- Run on your device
-
Enable Permissions
- Grant Accessibility Service permission
- Grant Microphone permission
- Enable Overlay permission (for floating UI)
|
Communication
|
Entertainment
|
|
Productivity
|
Navigation
|
| Technology | Purpose |
|---|---|
| Kotlin | Primary development language |
| Gemini AI | Natural language understanding & decision making |
| Android Accessibility API | Screen reading & UI automation |
| Firebase | Analytics & user management |
| Google Cloud TTS | High-quality voice synthesis |
| Porcupine | Wake word detection |
We love contributions! Whether it's:
- π Bug reports
- π‘ Feature suggestions
- π Documentation improvements
- π§ Code contributions
Check out our Contributing Guide to get started.
# Fork the repo
# Clone your fork
git clone https://github.com/devanshupardeshi/DeVA.git
# Create a feature branch
git checkout -b feature/amazing-feature
# Make your changes & commit
git commit -m "Add amazing feature"
# Push & create a PR
git push origin feature/amazing-featureThis project is licensed under a Personal Use License.
| Use Case | Allowed |
|---|---|
| β Personal & Educational | Yes |
| β Research & Learning | Yes |
| Requires separate license |
See LICENSE for details.
If you find DeVA useful, please consider giving it a β!
- Built with β€οΈ for the open-source community
- Powered by Google's Gemini AI
- Inspired by the need for truly accessible technology
Made with π by developers who believe AI should work FOR you
