Explore the detailed documentation and usage guides of all open source projects in the AI Audio & Speech category.
A deep learning text-to-speech library supporting multiple languages and voice styles.
A Chinese voice cloning tool that can generate new voices from just 5 seconds of audio samples.
A text-to-audio generation model that can generate highly realistic multi-language speech and sound effects.