-
Notifications
You must be signed in to change notification settings - Fork 1k
Open
Labels
new modelRequest a new modelRequest a new model
Description
Model description
Chatterbox is a multilingual, zero-shot Text-to-Speech (TTS) model designed for flexible voice synthesis across a wide range of languages without requiring task-specific fine-tuning. It is built on a 0.5B parameter Llama-based architecture.
Find some more information on deepwiki
Prerequisites
- The model is supported in Transformers (i.e., listed here)
- The model can be exported to ONNX with Optimum (i.e., listed here)
Additional information
The main repo along with some examples can be found here.
Some attention to it has already been made in the onnx-community
, providing onnx
exports for the pre-trained model here
Your contribution
With appropriate guidance I can contribute directly to the implementation of this feature.
Jerboas86
Metadata
Metadata
Assignees
Labels
new modelRequest a new modelRequest a new model