AI Audio & Speech

Explore the detailed documentation and usage guides of all open source projects in the AI Audio & Speech category.

A deep learning text-to-speech library supporting multiple languages and voice styles.

A Chinese voice cloning tool that can generate new voices from just 5 seconds of audio samples.

A text-to-audio generation model that can generate highly realistic multi-language speech and sound effects.