How to use Hifi-Codec as speech encoder in LLM based TTS? #53

JohnHerry · 2024-08-22T10:24:15Z

Hi, I want to know how to use HiFi-Codec to extract speech tokens for use in LLM training. Say if my HiFi-Codec takes 4 codebooks and codebook_size=256, then there will be 256^4 different speech tokens used in LLM training? it seems terrible. Is there any suggestions?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use Hifi-Codec as speech encoder in LLM based TTS? #53

How to use Hifi-Codec as speech encoder in LLM based TTS? #53

JohnHerry commented Aug 22, 2024

How to use Hifi-Codec as speech encoder in LLM based TTS? #53

How to use Hifi-Codec as speech encoder in LLM based TTS? #53

Comments

JohnHerry commented Aug 22, 2024