ResearcharXiv cs.AI — 21 h ago

Linguistically Augmented Audio Speech Data (LinguAS)

Linguistically Augmented Audio Speech Data (LinguAS) has been released, featuring over 800 annotated audio samples of genuine and deepfaked speech, with five Expert-Defined Linguistic Features (EDLFs) included to enhance model training. Models trained on this dataset demonstrated significant performance improvements over ASVspoof 2021 baselines and self-supervised learning models like HuBert and XLSR. This dataset provides critical linguistic and metadata context, enabling researchers to develop more effective detection models for audio deepfakes by leveraging natural human speech characteristics.

speechdatasetdeepfakerelevance 0.00 · engagement 0.00

Read at source ↗← all news