ai-digest.dev
last updated 4 h ago
MultimodalarXiv cs.AI 16 d ago

QC-GAN: A Parameter-Efficient Quaternion Conformer GAN for High-Fidelity Speech Enhancement

The QC-GAN framework introduces a parameter-efficient approach for speech enhancement by leveraging a Quaternion Conformer generator and MetricGAN-based training, achieving high fidelity with only 0.89 million parameters. It attained a PESQ score of 3.48 on the VoiceBank+DEMAND dataset, demonstrating performance comparable to state-of-the-art models at a fraction of the parameter count. Additionally, a smaller variant with 35K parameters achieved a PESQ score of 3.23, indicating significant efficiency improvements for practitioners focused on resource-constrained applications in speech processing.

speech enhancementGANaudiorelevance 0.00 · engagement 0.00
Read at source ↗← all news
QC-GAN: A Parameter-Efficient Quaternion Conformer GAN for High-Fidelity Speech Enhancement — AI News Digest