From 84f56cb334b6eff10453f04443f6935ee6725d86 Mon Sep 17 00:00:00 2001 From: Amit Moryossef Date: Tue, 28 Apr 2026 09:22:19 +0000 Subject: [PATCH] Add DiffSLVA (Xia et al., 2024) to Pose-to-Video / SLVA discussion Cites the SignLang 2024 paper "Diffusion Models for Sign Language Video Anonymization" alongside the existing AnonySign discussion, noting its use of text-guided latent diffusion + ControlNet (HED) and a facial expression enhancement module. Co-Authored-By: Claude Opus 4.7 (1M context) --- src/index.md | 1 + src/references.bib | 13 +++++++++++++ 2 files changed, 14 insertions(+) diff --git a/src/index.md b/src/index.md index ff6d846c..5165791d 100644 --- a/src/index.md +++ b/src/index.md @@ -438,6 +438,7 @@ retaining the original sign language content. Using a conditional variational autoencoder framework, they first extracted pose information from the source video to remove the original signer appearance, then generated a photo-realistic sign language video of a novel appearance from the pose sequence. The authors proposed a novel style loss that ensures style consistency in the anonymized sign language videos. +Extending this line of work, @xia-etal-2024-diffusion proposed DiffSLVA, which leverages pre-trained large-scale text-guided latent diffusion models with ControlNet conditioned on Holistically-Nested Edge (HED) maps to circumvent the need for accurate pose estimation, and adds a dedicated facial expression enhancement module to preserve linguistically essential non-manual features. ##### Sign Language Avatars diff --git a/src/references.bib b/src/references.bib index 7c432b7c..d70cea9c 100644 --- a/src/references.bib +++ b/src/references.bib @@ -4354,6 +4354,15 @@ @inproceedings{uchida-etal-2024-hamnosys author = "Uchida, Tsubasa and Miyazaki, Taro and Kaneko, Hiroyuki", +} + +@inproceedings{xia-etal-2024-diffusion, + title = "Diffusion Models for Sign Language Video Anonymization", + author = "Xia, Zhaoyang and + Zhou, Yang and + Han, Ligong and + Neidle, Carol and + Metaxas, Dimitris N.", editor = "Efthimiou, Eleni and Fotinea, Stavroula-Evita and Hanke, Thomas and @@ -4368,3 +4377,7 @@ @inproceedings{uchida-etal-2024-hamnosys url = "https://aclanthology.org/2024.signlang-1.42/", pages = "376--385" } + + url = "https://aclanthology.org/2024.signlang-1.44/", + pages = "395--407" +}