Google has launched Gemini 3.1 Flash TTS in preview, giving developers prompt-based control over AI speech, multi-speaker ...
DeepL, a translation company best known for its text tools, released a voice-to-voice translation suite today that covers use ...
Abstract: This work demonstrates an experimental implementation of a helper bot using IBM Watson. It is primarily aimed at people who know English as a second language. With the help of IBM Watson ...
French AI company Mistral released a new open source text-to-speech model on Thursday that can be used by voice AI assistants or in enterprise use cases like customer support. The model, which lets ...
wget https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-sense-voice-zh-en-ja-ko-yue-2024-07-17.tar.bz2 tar xvf sherpa-onnx-sense-voice-zh ...
Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.