iLSU-T: an Open Dataset for Uruguayan Sign Language Translation (FG2025)

1. Introduction

This is the code and repository for the article (link to the full paper):

Stassi, A., Boria, Y., Di Martino, M., & Randall, G. (2025). iLSU-T: an Open Dataset for Uruguayan Sign Language Translation. In Proceedings of the 19th IEEE International Conference on Automatic Face and Gesture Recoginition (pp. 1-10).

The main contributions of this work are:

iLSU-T, the first dataset with multimodal video, audio, and text for LSU translation. iLSU-T comprises more than 185 hours of curated video from TV broadcasting in Uruguay.
A preprocessing pipeline to derive the iLSU-T dataset.
A theoretical discussion from the linguistic perspective about the problem of aligning and annotating interpreted sign language videos with text.
The first recorded evaluation and benchmarking of state-of-the-art available methods for sign language translation (SLT) in the LSU context.

1.1. SLT datasets context

Dataset	Source language	Target language	#signers	#hours	#samples	Vocabulary	Video quality	Annotations	Source
Phoenix2014T	DGS	German	9	10.5	8257	2k9	210x260@25,fps	text, gloss	TV
LSA-T	LSA	Spanish	103	21.78	14880	14k2	1920x1080@30,fps	text (SD)	Web
CSL-Daily	CSL	Chinese	10	23	20654	2k5	1920x1080@30,fps	text, gloss	Lab
KETI	KSL	Korean	14	28	14672	419	1920x1080@30,fps	text	Lab
AUSLAN-Daily	Auslan	English	67	45	25106	13k9	1280x720/1920x1080@25-30,fps	text	TV
SIGNUM	DGS	German	25	55.3	33210	N/A	776x578@30,fps	text	Lab
How2Sign	English	ASL	11	79	35k2	16k	1280x720@30,fps	text	Lab
OpenASL	ASL	English	220	288	98417	33k5	variable	text	Web
BOBSL	English	BSL	37	1467	1M2	78k	444x444@25,fps	text	TV
iLSU-T (ours)	Spanish	LSU	18	201.5	86k5	37k9	variable, 343x364@25-30,fps	text (SD)	TV

2. Repository structure

This repository is organized is several folders, one per each process. In the following, it is presented a list of the folders with a brief content description for each one:

preprocessing: preprocessing methods to obtain iLSU-T episodes frow raw data, including text files.
data: csv file with all the iLSU-T episodes and metadata. Please see section 3 for access the dataset. FYI, you might have to adjust paths to data (episodes and whisperx files) in .csv.
video_clipping_and_visual_feats: a Jupyter notebook for exploring the iLSU-T episodes and generate video-clips. You will find the instructions to compute I3D visual features from video-clips.
split_and_package_datasets: the scripts for splitting data into train, val and test sets for the whole dataset, and the three considered subsets in the paper.
slt_config_files: config files for the three SOTA methods used in the paper.

3. iLSU-T dataset download

Please visit this website for download the iLSU-T dataset after accepting the License of Restricted Use.

Available data in the website:

iLSU-T episodes,
WhisperX transcriptions, and
20 hours of manual aligned WhisperX transcriptions (work in progress...)

4. Tested methods on iLSU-T

Acknowledgements

iLSU-T was partially supported by a CAP--UdelaR scholarship, Uruguay. Some of the experiments were carried out using ClusterUY. We acknowledge DiNaTel Uruguay for providing us with the raw data, the NICA--UdelaR team for fruitful interdisciplinary discussions, and G. Gómez and F. Lecumberry for their website assistance.

If you use this code and/or data for your work, please do not forget to cite us:

@inproceedings{stassi2025ilsut,
  title={iLSU-T: an Open Dataset for Uruguayan Sign Language Translation},
  author={Stassi, Ariel E. and Boria, Yanina and Di Martino, J Mat{\'\i}as and Randall, Gregory},
  booktitle={2025 IEEE 19th International Conference on Automatic Face and Gesture Recognition (FG)},
  pages={1--10},
  year={2025},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
0_preprocessing		0_preprocessing
1_data		1_data
2_video_clipping_and_visual_feats		2_video_clipping_and_visual_feats
3_split_and_package_datasets		3_split_and_package_datasets
4_slt_config_files		4_slt_config_files
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

iLSU-T: an Open Dataset for Uruguayan Sign Language Translation (FG2025)

1. Introduction

1.1. SLT datasets context

2. Repository structure

3. iLSU-T dataset download

4. Tested methods on iLSU-T

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

iLSU-T: an Open Dataset for Uruguayan Sign Language Translation (FG2025)

1. Introduction

1.1. SLT datasets context

2. Repository structure

3. iLSU-T dataset download

4. Tested methods on iLSU-T

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages