From 7665422c491c6cc0420690b224c8ed3a9341802a Mon Sep 17 00:00:00 2001 From: Amit Moryossef Date: Tue, 28 Apr 2026 09:15:52 +0000 Subject: [PATCH] =?UTF-8?q?Add=20B=C3=B6rstell=20(2024)=20on=20lexical=20v?= =?UTF-8?q?ariation=20in=20sign=20language=20corpora?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Cite borstell-2024-approach in the Glosses section, summarizing the paper's overview of approaches (raw counts, relative frequencies, weighted log odds, signer coverage) for measuring lexical variation in gloss-annotated sign language corpora. Co-Authored-By: Claude Opus 4.7 (1M context) --- src/index.md | 2 ++ src/references.bib | 18 ++++++++++++++++++ 2 files changed, 20 insertions(+) diff --git a/src/index.md b/src/index.md index aa3ec617..cda220e7 100644 --- a/src/index.md +++ b/src/index.md @@ -218,6 +218,8 @@ Do not compare to scores produced with a different or unknown evaluation procedu - Given that glossing is corpus-specific, process glosses in a corpus-specific way, informed by transcription conventions. - Optimize gloss translation baselines with methods shown to be effective for low-resource MT. +@borstell-2024-approach surveyed methods for measuring lexical frequency and variation in gloss-annotated sign language corpora, illustrating with Swedish Sign Language (STS) Corpus data how raw counts, relative frequencies, weighted log odds, and signer coverage each reveal different aspects of sociolinguistic variation while being shaped by Zipfian skew, elicitation topics, and annotation conventions. + The following table additionally exemplifies the various representations for more isolated signs. For this example, we use SignWriting as the notation system. diff --git a/src/references.bib b/src/references.bib index 54d87642..c1f97ead 100644 --- a/src/references.bib +++ b/src/references.bib @@ -4207,3 +4207,21 @@ @inproceedings{borstell-2024-evaluating url = {https://aclanthology.org/2024.signlang-1.4}, year = {2024} } + +@inproceedings{borstell-2024-approach, + title = "How to Approach Lexical Variation in Sign Language Corpora", + author = {B{\"o}rstell, Carl}, + editor = "Efthimiou, Eleni and + Fotinea, Stavroula-Evita and + Hanke, Thomas and + Hochgesang, Julie A. and + Mesch, Johanna and + Schulder, Marc", + booktitle = "Proceedings of the LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources", + month = may, + year = "2024", + address = "Torino, Italia", + publisher = "ELRA and ICCL", + url = "https://aclanthology.org/2024.signlang-1.5/", + pages = "46--53" +}