The best audio processing library built on Apple's MLX framework, providing fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon. Kokoro Fast, ...
Abstract: The attention mechanism improves underwater acoustic target recognition (UATR) by suppressing irrelevant features. However, due to the uncertainty and scarcity of underwater acoustic target ...
Abstract: The information loss or distortion caused by single-channel speech enhancement (SE) harms the performance of automatic speech recognition (ASR). Observation addition (OA) is an effective ...
PEAK is a desktop-style voice assistant with a web UI, built with Python and JavaScript. It uses speech recognition, text-to-speech, and an AI chatbot backend to handle natural-language commands, open ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results