VAD toolkit in this project was used in the paper: J. Kim and M. Hahn, "Voice Activity Detection Using an Adaptive Context Attention Model," in IEEE Signal Processing ...
Abstract: In this study, we explore the use of Vector Quantized Variational Autoencoders (VQ-VAE) for real-time audio spectrogram inpainting, with a focus on minimizing environmental impact. We ...
Intuitively, a Time-Aliased-Hann window is a sound sample that starts playing in the middle, and slowly fades to zero, while at the same time we start playing samples from beginning and slowly fade in ...