NGT Sign Language Recognition

A real-time sign language recognition system that uses computer vision and machine learning to recognize Nederlandse Gebarentaal (Dutch Sign Language) fingerspelling alphabet gestures.

Project Overview

This project implements an automated sign language recognition system capable of interpreting hand gestures for all 26 letters of the alphabet in real-time. The system uses MediaPipe for hand detection and landmark extraction, combined with a Random Forest classifier for gesture classification.

Key Features

Real-time Recognition: Live webcam-based hand gesture detection and classification
Complete Alphabet Support: Recognizes all 26 letters (A-Z) of the NGT fingerspelling alphabet
Robust Hand Tracking: Uses MediaPipe for hand landmark detection
Intelligent Zone Detection: Defines optimal signing area relative to face position
Interactive Interface: Real-time visual feedback with confidence scores and status indicators

Technical Architecture

Components

Hand and Face Detection (hand_face_detection.py)
- Coordinate normalization and feature preparation
Model Training (train_model.ipynb)
- Train and evaluate Random Forest Classifier model
Data Collection (data_collection.py)
- Collect all necessary hand-gestures data
Real-time Prediction (no UI) (real_time_prediction.py)
- Live webcam gesture recognition
Real-time Prediction (with UI) (real_time_prediction.py)
- Same as above, but with user interface

Model Performance

Algorithm: Random Forest Classifier
Features: 66-dimensional feature vector (63 normalized landmarks + 3 absolute wrist coordinates)
Accuracy: >95% on test dataset, but lower during actual real-time detection
Training Data: Individual frame-based classification approach
Optimization: Hyperparameter tuning with Optuna (5-fold cross-validation)

Installation & Setup

Prerequisites

Recommended Python 3.10 or higher
Webcam for real-time recognition

Environment Setup

Clone the repository:

git clone https://github.com/TimurKambarov/NGT-Sign-Language-Recognition.git
cd NGT-Sign-Language-Recognition

Create virtual environment:

python -m venv venv

# Windows
venv\Scripts\activate

# macOS/Linux
source venv/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```

Usage

Training the Model

Prepare training data:
```
python data_collection.py
```
Train the model:
```
jupyter notebook train_model.ipynb
```

Real-time Recognition

Run the real-time recognition system:

python real_time_prediction.py

Controls

Q: Quit application
M: Toggle mirror mode
Z: Toggle signing zone visibility

File Structure

NGT-Sign-Language-Recognition-/
├── data/
│   NGT_gestures/                # Hand gestures pictures
│   └── samples.csv              # Training dataset
├── models/
│   ├── random_forest_model.joblib    # Trained RF model
│   └── label_encoder_rf.pkl          # Label encoder
├── hand_face_detection.py       # MediaPipe detection utilities
├── real_time_prediction.py      # Main recognition application
├── train_model.ipynb           # Model training notebook
├── streamlit_app.py            # Data collection tool
├── requirements.txt            # Python dependencies
└── README.md                   # This file

Performance Metrics

The system achieves the following performance characteristics:

Accuracy: >95% on test dataset, but ~40% confidence in real-time prediction
Recognition Latency: <100ms per prediction
Supported Gestures: All 26 letters (A-Z)

Technical Details

Feature Engineering

The system uses a 66-dimensional feature vector consisting of:

63 normalized landmarks: Hand keypoints relative to wrist position (21 points × 3 coordinates)
3 absolute coordinates: Wrist position for spatial context

Model Architecture

Algorithm: Random Forest with optimized hyperparameters
Training Strategy: Per-frame classification approach
Validation: 5-fold cross-validation during hyperparameter tuning
Optimization: Optuna-based parameter search

Recognition Pipeline

Frame Capture: Webcam video input
Face Detection: Establish reference coordinate system
Hand Detection: Extract 21 hand landmarks
Feature Extraction: Normalize and prepare feature vector
Classification: Random Forest prediction with confidence score
Post-processing: Confidence filtering and temporal smoothing

Limitations

Current Model Constraints

This implementation serves as a proof-of-concept and has several important limitations:

Limited Training Data: The current model was trained on a relatively small dataset, which significantly impacts recognition accuracy
Low Recognition Confidence: Due to insufficient training samples, the model typically achieves confidence scores around 40% or lower
Reduced Dynamic Gesture Performance: Dynamic gestures (letters requiring movement) show even lower confidence scores compared to static hand positions
Single User Bias: Training data may be biased toward specific hand shapes, sizes, or signing styles
Environmental Sensitivity: Performance may vary significantly under different lighting conditions or camera angles

Recommended Improvements

To achieve production-ready performance, the following enhancements are needed:

Collect substantially more training data (1000+ samples per letter from multiple users)
Include diverse hand sizes and signing styles in training data
Implement temporal smoothing and sequence-based recognition for dynamic gestures
Add data augmentation techniques to improve model robustness
Consider deep learning approaches for better feature extraction

Note: The current confidence threshold is set to 55% for display purposes, but actual predictions often fall below this threshold due to the limited training data.

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/new-feature)
Commit your changes (git commit -am 'Add new feature')
Push to the branch (git push origin feature/new-feature)
Create a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Future Enhancements

Support for word-level recognition
Web application development
Extended vocabulary beyond fingerspelling
Multi-hand gesture recognition

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NGT Sign Language Recognition

Project Overview

Key Features

Technical Architecture

Components

Model Performance

Installation & Setup

Prerequisites

Environment Setup

Usage

Training the Model

Real-time Recognition

Controls

File Structure

Performance Metrics

Technical Details

Feature Engineering

Model Architecture

Recognition Pipeline

Limitations

Current Model Constraints

Recommended Improvements

Contributing

License

Future Enhancements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
data/NGT_gestures		data/NGT_gestures
models		models
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_collection.py		data_collection.py
guide.py		guide.py
hand_face_detection.py		hand_face_detection.py
real_time_prediction.py		real_time_prediction.py
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py
train_model.ipynb		train_model.ipynb

Folders and files

Latest commit

History

Repository files navigation

NGT Sign Language Recognition

Project Overview

Key Features

Technical Architecture

Components

Model Performance

Installation & Setup

Prerequisites

Environment Setup

Usage

Training the Model

Real-time Recognition

Controls

File Structure

Performance Metrics

Technical Details

Feature Engineering

Model Architecture

Recognition Pipeline

Limitations

Current Model Constraints

Recommended Improvements

Contributing

License

Future Enhancements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages