Dubai-based Camb.AI focuses on speech synthesis and translation for media dubbing. Palabra, backed by Reddit co-founder ...
Google’s Gemini 3.1 Flash TTS adds audio tags, 70-plus languages, and SynthID watermarking for more controllable AI-generated ...
Silicon Valley startup Sabi is emerging from stealth with that goal. The company is developing a brain wearable that decodes ...
DeepL, a translation company best known for its text tools, released a voice-to-voice translation suite today that covers use ...
Google launched a free offline AI dictation app on iOS, highlighting a shift toward private, on-device speech-to-text tools.
Abstract: In this paper, the authors conduct an experimental work on neural networks Text-to-Speech (TTS), aiming to facilitate an appropriate understanding of current research and future tendencies ...
Abstract: This paper introduces a novel deep learning-based system for real-time American Sign Language (ASL) interpretation and translation into speech, aimed at improving communication for ...
Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.