![Alibaba_SpeechAI Profile](https://pbs.twimg.com/profile_images/1839489881011281920/O3Ro8MJm_x96.jpg)
Alibaba_SpeechAI
@TONGYI_SpeechAI
Followers
2K
Following
241
Statuses
190
As part of Alibaba's Tongyi Lab, we focus on multimodal speech and language models like FunAudioLLM, FunASR, and CosyVoice. Explore our 200+ open-source models!
Joined April 2024
The Tongyi Speech Team has open-sourced two foundational speech models: SenseVoice and CosyVoice. 😄SenseVoice, a multilingual audio understanding model: Its multilingual speech recognition outperforms Whisper by 50% in Chinese and Cantonese, with inference speed 15 times faster, and it supports state-of-the-art emotion recognition and audio event detection. 😄CosyVoice, a multilingual audio generation model: Trained on over 170,000 hours of multilingual audio data, it supports multilingual speech generation, timbre and emotion control. CosyVoice excels in multilingual speech generation, zero-shot speech generation, cross-lingual voice synthesis, and instruction execution. We look forward to developers experiencing and using our models, and we appreciate your valuable feedback. If you like our team's open-source projects, please don't hesitate to give us a star on GitHub. GitHub: Demo Samples: Techinical Report:
17
68
219
@BobbyApocalypse Haven't tried using ComfyUI yet, but support will be added gradually in the future.😊
0
0
1
@dotey 嗯嗯,同样的期待!主要是我们开源项目中缺乏web相关技术栈背景的同学,算法同学以python与c++为主,希望我们这些技术项目早点上百炼平台,届时有更多方式的调用方法;也热盼社区开发者们能参与进来支持其他语言的开发呀🤗
0
0
1