🧠 Sign Language Video Calling Web App

🚀 Real-time AI-powered video calling platform that translates sign language into text for seamless communication.

📸 Screenshots

Video Call	Gesture Detection	Output

🌟 Features

🎥 Real-time video calling (WebRTC)
✋ Sign language gesture recognition (A–Z)
💬 Live subtitle output
🧠 AI-based hand tracking (MediaPipe)
🔗 Room-based connection system
👥 Multi-user support
⚡ Smooth real-time performance (optimized)
🎯 Low-latency communication

🧩 How It Works

Camera → Frame Capture → MediaPipe → Hand Landmarks → ML Model → Prediction → Subtitle Output

👉 The system enables communication between deaf and normal users using AI-powered gesture recognition.

🏗️ Architecture

Frontend (React + WebRTC)
        ↓
WebSocket / Socket.IO
        ↓
Backend (FastAPI + AI Model)
        ↓
Prediction → Frontend Overlay

🛠️ Tech Stack

🔹 Frontend
React.js
WebRTC
Socket.IO
🔹 Backend
FastAPI
TensorFlow / Keras
MediaPipe
🔹 Other
OpenCV
NumPy
WebSockets

📁 Project Structure

Sign-Language-Video-Calling/
│
├── frontend/        # React app
├── backend/         # FastAPI + AI
├── socket/          # Signaling server
├── model/           # Trained model (.h5)
├── dataset/         # Training data
└── README.md

⚙️ Installation

1️⃣ Clone Repo
git clone https://github.com/Shivam09xc/Sign-Language-Video-Calling.git
cd Sign-Language-Video-Calling
2️⃣ Backend
cd backend
pip install -r requirements.txt
uvicorn main:app --reload
3️⃣ Frontend
cd frontend
npm install
npm start
4️⃣ Socket Server
cd socket
npm install
node server.js

🚀 Usage

Open app in browser
Create / Join Room
Start video call
Show gestures ✋
Get real-time text output 💬

⚡ Performance Optimizations

Frame throttling (process every 2–3 frames)
Landmark smoothing (moving average)
Prediction buffering (majority voting)
Confidence filtering
Async processing

📊 Limitations

Needs good lighting
Limited gestures (A–Z)
Performance depends on device

🔮 Future Scope

Sentence-level recognition (LSTM / Transformers)
Voice output (Text-to-Speech)
Mobile app (React Native)
Multi-language support
Face & pose detection

🤝 Contributing

Pull requests are welcome!
For major changes, open an issue first.

📜 License

MIT License

👨‍💻 Author

Shivam Soni
🎓 BTech CSE
🚀 AI + Full Stack Developer

⭐ Support

If you like this project, give it a ⭐ on GitHub and share it!


---

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
backend		backend
frontend		frontend
signaling		signaling
.gitignore		.gitignore
README.md		README.md
render.yaml		render.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Sign Language Video Calling Web App

📸 Screenshots

🌟 Features

🧩 How It Works

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 Sign Language Video Calling Web App

📸 Screenshots

🌟 Features

🧩 How It Works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages