ChatTTS is a voice generation model on GitHub at 2noise/chattts, specifically designed for conversational scenarios. It is ideal for applications such as dialogue tasks for large language model assistants, as well as conversational audio and video introductions. The model supports both Chinese and English, demonstrating high quality and naturalness in speech synthesis. ChatTTS is trained on approximately 100,000 hours of Chinese and English data.