Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.
The nodes were originally made for use in the Comfyroll Template Workflows.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results