ASL Per Frame Detection

Data Pre-Process

The code for preprocess data is in data_process.ipynb which consists of:

Get keypoints using MediaPipe for each image
Flip each image and get the keypoints in the flipper image

Dataset

The code for class Dataset is in dataset.py where the data augmentation is applied:

random flip
- by certain probability, we use the keypoints from the flipped image
- If the flipped image has no keypoints, use the original one
keypoint rotation
- added a random $[-\frac{\pi}{6}, \frac{\pi}{6}]$ rotation to x,y,z axis of the keypoints

Training

The training script for models are in train_val_cnn_aug.ipynb

Inference

To inference, run

python runMediaPipe_CNN.py

and use model weight cnn_model_new.pth

Updates and Experiments

Compared to previous version, I have added the new Dataset class that apply the random transformation to keypoints and the flipped keypoints for images.

Something I have tried are:

Use Mediapipe during training to obtain keypoints, in this way we can first apply image transformation, but it is super slow
For random rotation I used to use range $[-\pi,\pi]$, but then the model confuses letters like O and E
The current hyperparameters are the ones that does best so far. I have tried:
- remove second CNN layer
- for second CNN layer use channel 16
- 2 and 3 hidden layers
- batch size 32
- hidden size 64, 256
- epochs: 50, 100, 200
- lr: 0.01, 0.005 along with same weight decay

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
data		data
models		models
svc_model_moredata.mlpackage		svc_model_moredata.mlpackage
.gitignore		.gitignore
Preprocess dataset5.ipynb		Preprocess dataset5.ipynb
README.roboflow.txt		README.roboflow.txt
Train CNN and NN with new dataset.ipynb		Train CNN and NN with new dataset.ipynb
Train NN with new dataset.ipynb		Train NN with new dataset.ipynb
Train RF and SVC with new dataset.ipynb		Train RF and SVC with new dataset.ipynb
Train RF and SVC.ipynb		Train RF and SVC.ipynb
accuracy_vs_inferences.png		accuracy_vs_inferences.png
cnn_model.pth		cnn_model.pth
cnn_model_2d.pth		cnn_model_2d.pth
cnn_model_new.pth		cnn_model_new.pth
cnn_new_dataset_67.pth		cnn_new_dataset_67.pth
cnn_new_dataset_76.pth		cnn_new_dataset_76.pth
convert_model_to_coreml.py		convert_model_to_coreml.py
data_process.ipynb		data_process.ipynb
dataset.py		dataset.py
emissions.csv		emissions.csv
get_power_stuff.py		get_power_stuff.py
hand_landmarker.task		hand_landmarker.task
mlp_model.pth		mlp_model.pth
mlp_model_full.pth		mlp_model_full.pth
mp_annotations.csv		mp_annotations.csv
nn_new_dataset_879.py		nn_new_dataset_879.py
nn_new_dataset_88.pth		nn_new_dataset_88.pth
nn_new_dataset_88.txt		nn_new_dataset_88.txt
readme.md		readme.md
rf_model.pkl		rf_model.pkl
rf_model_no_scale.pkl		rf_model_no_scale.pkl
runMediaPipe.py		runMediaPipe.py
runMediaPipe_CNN.py		runMediaPipe_CNN.py
runMediaPipe_RF.py		runMediaPipe_RF.py
runMediaPipe_coreml.py		runMediaPipe_coreml.py
svc_model.pkl		svc_model.pkl
svc_model_moredata.pkl		svc_model_moredata.pkl
svc_model_no_scale.pkl		svc_model_no_scale.pkl
train_val.ipynb		train_val.ipynb
train_val_cnn.ipynb		train_val_cnn.ipynb
train_val_cnn_aug.ipynb		train_val_cnn_aug.ipynb
train_val_cnn_aug_new_dataset.ipynb		train_val_cnn_aug_new_dataset.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASL Per Frame Detection

Data Pre-Process

Dataset

Training

Inference

Updates and Experiments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ASL Per Frame Detection

Data Pre-Process

Dataset

Training

Inference

Updates and Experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages