ChatTTS is a voice generation model on GitHub at 2noise/chattts,Chat TTS is specifically designed for conversational scenarios. It is ideal for applications such as dialogue tasks for large language model assistants, as well as conversational audio and video introductions. The model supports both Chinese and English, demonstrating high quality and naturalness in speech synthesis. This level of performance is achieved through training on approximately 100,000 hours of Chinese and English data. Additionally, the project team plans to open-source a basic model trained with 40,000 hours of data, which will aid the academic and developer communities in further research and development.
各参数如上:
① text: 要合成语音的文字
② Refine text:是否自动优化文本
③ 温度:随机性参数,越大,随机性越高。可能更好,可能更差
④ voice: 默认 2222, 决定音色的数字, 2222 | 7869 | 6653 | 4099 | 5099,可选其一,或者任意传入将随机使用音色
⑤custom_voice:用于定制声音音调生成的种子值,必须是正整数。如果设置了这个值,将优先使用,并忽略 voice
参数。
⑥ prompt:设置笑声、停顿等。例如,[oral_2][laugh_0][break_6]