Real-Time Language Translator with AR Visualization

Overview

This project bridges the gap between spoken communication and visual understanding by integrating Neural Machine Translation (NMT) with Augmented Reality (AR). It creates an immersive translation experience where spoken English is instantly converted into Tamil text and overlaid onto the physical world using marker-based tracking.

The system captures audio input, transcribes it, translates it using a Transformer model, and renders the output as a dynamic 3D overlay on specific image targets. This pipeline showcases the seamless convergence of Speech Recognition, Natural Language Processing (NLP), and Computer Vision.

Key Features

Automatic Speech Recognition (ASR): Utilizes Mozilla DeepSpeech to capture and transcribe spoken English with high accuracy in real-time.
Neural Machine Translation: Deploys a custom Transformer-based model to translate English text into Tamil. The architecture uses self-attention mechanisms to handle long-range dependencies and ensure grammatical accuracy in the target language.
Marker-Based AR: Leverages the Vuforia SDK to detect physical image markers. Once a marker is recognized, the translated Tamil text is projected onto it, aligning digital information with the real-world environment.
End-to-End Pipeline: A unified framework that processes raw audio, generates text inference, and updates the AR display with minimal latency.

Technical Stack

Speech Engine: Mozilla DeepSpeech
Translation Model: Transformer (PyTorch)
AR Platform: Unity, Vuforia SDK
Language: Python

Credits

Transformer model implementation adapted from ajhalthor.
Mozilla DeepSpeech

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.vscode		.vscode
Assets		Assets
Packages		Packages
ProjectSettings		ProjectSettings
QCAR		QCAR
Translator		Translator
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-Time Language Translator with AR Visualization

Overview

Key Features

Technical Stack

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Real-Time Language Translator with AR Visualization

Overview

Key Features

Technical Stack

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages