Pocket TTS is an open-source text-to-speech model that runs on CPUs, clones voices from 5 seconds of audio, and keeps voice ...
Abstract: Automatic Speech Recognition (ASR) systems are widely used for speech censoring. Speech Adversarial Example (AE) offers a novel approach to protect speech privacy by forcing ASR to ...