The pipeline provides a fully open and modular approach, with a focus on leveraging models available through the Transformers library on the Hugging Face hub. The code is designed for easy ...
Creating audio content for your business doesn’t mean you have to invest in expensive production tools or hire voice actors. For businesses with an occasional need for audio, free text-to-speech ...
This is the official repository 👑 for the WenetSpeech-Yue dataset and the source code for WenetSpeech-Pipe speech data preprocessing pipeline. To address the unique linguistic characteristics of ...
Abstract: In this paper, we introduce a speech-conditioned Large Language Model (LLM) integrated with a Mixture of Experts (MoE) based connector to address the challenge of Code-Switching (CS) ...