TrainingarXiv cs.AI — 16 d ago

Target-Side Paraphrase Augmentation for Sign Language Translation with Large Language Models

The article presents a novel target-side paraphrase augmentation strategy for sign language translation (SLT) using GPT-4o to generate paraphrased variants of spoken-language sentences while keeping sign inputs constant. A Signformer-style Transformer is trained in a two-stage process involving pre-training on the augmented dataset and fine-tuning on original references, leading to a BLEU-4 score improvement on the PHOENIX14T dataset from 9.56 to 10.33. This research highlights the potential of LLM-generated paraphrases to enhance decoder generalization in SLT, emphasizing the need for semantic evaluations alongside traditional lexical metrics.

sign-languagellmparaphrase-augmentationrelevance 0.00 · engagement 0.00

Read at source ↗← all news