ai-digest.dev
last updated 4 h ago
AgentsReddit r/LocalLLaMA 5 d ago

Voice-to-voice chatbot update

A voice-to-voice chatbot has been developed utilizing the Qwen3.5-397B model, Whisper-small for speech-to-text, and Orpheus Q4_K_XL for text-to-speech, with a custom SNAC decoder implemented on ONNX. The system achieves near real-time performance with interruptibility while maintaining conversational context, operating efficiently on a 24 GB GPU with a VRAM usage of 21.3 GB or less, and supporting up to 131,072 tokens for extended dialogue. This development is significant for practitioners as it enhances local processing capabilities for voice interactions in AI applications, ensuring privacy and reducing latency.

chatbotvoiceaireal-timerelevance 0.00 · engagement 0.00
Read at source ↗← all news
Voice-to-voice chatbot update — AI News Digest