esp-sign-detection

A low-latency, real-time sign language recognition system powered by TensorFlow Lite Micro on the ESP32. This project uses a distributed architecture where a laptop performs hand detection (MediaPipe) and streams processed data to an ESP32 for high-speed edge inference.

Overview

The system recognizes static hand signs (currently optimized for classes A, C, Q, and T) with high precision by offloading computer vision tasks to a host while performing the core AI classification on a microcontroller. Servo actions mirror each class for instant haptic feedback: A → 0°, C → 90°, Q → 180°, and T performs a 0° → 180° sweep.

System Architecture

Laptop (Python/MediaPipe): Captures webcam video, detects hand landmarks, crops the hand ROI, and normalizes the image to a $28 \times 28$ grayscale patch.
Communication: Streams raw 784-byte packets via UDP over Wi-Fi.
ESP32 (TFLite Micro): Receives the packet, performs INT8 Quantized Inference, and outputs probabilities using a specialized C++ bridge.

Hardware & Software

Hardware

Microcontroller: ESP32 (WROOM/DevKit)
Host: Laptop with Webcam

Software Stack

Framework: ESP-IDF (v5.x)
ML Engine: TensorFlow Lite Micro (optimized for ESP32 via esp-tflite-micro)
Build Tool: PlatformIO
Training: PyTorch (ResNet-style Architecture)
Inference (Host): MediaPipe, OpenCV, TensorFlow Lite

Model Specifications

The project uses a custom TinyRes28_ESP32 architecture designed for the strict SRAM limits of the ESP32.

Input Shape: $28 \times 28 \times 1$ (Grayscale)
Quantization: Full INT8 (Per-Channel Symmetric)
Layers: Depthwise Separable Convolutions + Residual (Skip) Connections.
Memory Footprint: ~53KB TFLite model, ~120KB Tensor Arena.

Installation & Setup

Python Client

Install dependencies:

cd python_cam_client
pip install -r requirements.txt

Set your ESP32's IP address in the inference script.
Run the host-side pipeline:
```
python img_test.py
```

ESP Pinout

Servo : GPIO 18
Potentiometer : GPIO 34 (ADC1_6)
Wi-Fi: Standard ESP32 Wi-Fi (2.4GHz)

Note: You will see if the esp is connected to the wifi or not in the serial monitor. If you are not using the monitor, the led (GPIO 2) will blink twice after the init blink to indicate successful Wi-Fi connection else just once to indicate failure.

v0.2.0 Highlights

Accuracy bump: Improved training pipeline now yields ~80% accuracy under consistent indoor lighting once quantized to INT8.
On-device tuning: Added a board-mounted potentiometer that shifts the post-softmax confidence threshold, enabling in-field sensitivity adjustments without reflashing.
Servo-linked classes: Expanded inference targets to A, C, Q, T, each mapped to deterministic servo positions for easier debugging.

Performance Features

Manual 16-byte Alignment: Prevents memory allocation crashes in TFLite Micro's SingleArenaBufferAllocator.
Softmax Temperature Scaling: Match laptop confidence levels using temperature-scaled probabilities ($T=0.6364$).
UDP Reliability: Lightweight header-based packet verification (0xAABB).
Flash Optimization: Model weights stored strictly in Flash (static const) to preserve SRAM for the Wi-Fi stack.

Future

Currently the model classifies A, C, Q and T. We can train for more classes and even try increasing the model complexity to classifiy more classes.
We can port the Image Capturing + Preprocessing code from python to an SoC but would require another ESP32-Cam board (Out of scope for this project).

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
include		include
lib		lib
ml		ml
python_cam_client		python_cam_client
send_pipeline		send_pipeline
src		src
test		test
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENCE		LICENCE
README.md		README.md
dependencies.lock		dependencies.lock
partitions.csv		partitions.csv
platformio.ini		platformio.ini
sdkconfig.upesy_wroom		sdkconfig.upesy_wroom

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

esp-sign-detection

Overview

System Architecture

Hardware & Software

Hardware

Software Stack

Model Specifications

Installation & Setup

Python Client

ESP Pinout

v0.2.0 Highlights

Performance Features

Future

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

esp-sign-detection

Overview

System Architecture

Hardware & Software

Hardware

Software Stack

Model Specifications

Installation & Setup

Python Client

ESP Pinout

v0.2.0 Highlights

Performance Features

Future

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages