You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[A bug happened!]
the .onnx version has unclear pronunciation of some words such as "质量", but the .pt version has no problem.
text: '图的质量总体来说比之前好,会有些图的质量非常高,但不稳定,并且画风会变来变去。'
phonemes: 'tʰu↗ tɤ ꭧɨ↘lja↘ŋ ʦʊ↓ŋtʰi↓ lai↗ʂwo→ pi↓ ꭧɨ→ʨʰjɛ↗n xau↓, xwei↘ jou↓ɕje→ tʰu↗ tɤ ꭧɨ↘lja↘ŋ fei→ꭧʰa↗ŋ kau→, ta↘n pu↘ wə↓nti↘ŋ. pi↘ŋʨʰje↓ xwa↘fə→ŋ xwei↘ pjɛ↘nlai↗pjɛ↘nʨʰy↘.'
import soundfile as sf
from kokoro import KPipeline
pipeline = KPipeline(lang_code='z', device='cpu')
text = '图的质量总体来说比之前好,会有些图的质量非常高,但不稳定,并且画风会变来变去。'
zh_voices = ['zf_xiaobei', 'zf_xiaoni', 'zf_xiaoxiao', 'zf_xiaoyi',
'zm_yunjian', 'zm_yunxi', 'zm_yunxia', 'zm_yunyang']
for voice in zh_voices:
for graphemes, phonemes, audio in pipeline(text, voice=voice):
samples = audio.shape[0] if audio is not None else 0
assert samples > 0, "No audio generated"
print(f'{voice} speak {text} {phonemes}')
sf.write(f'output/pt_version_{voice}.wav', audio, 24000)
example audio files are attatched above
### What OS are you seeing the problem on?
Window
### Package version
0.4.2
### Relevant log output
```shell
The text was updated successfully, but these errors were encountered:
The English has never been a problem, but there are issues with the Chinese in the example above. I modified it based on language.py, which uses an English example. The bug I reported is the Chinese example.
What happened?
[A bug happened!]
the .onnx version has unclear pronunciation of some words such as "质量", but the .pt version has no problem.
text: '图的质量总体来说比之前好,会有些图的质量非常高,但不稳定,并且画风会变来变去。'
phonemes: 'tʰu↗ tɤ ꭧɨ↘lja↘ŋ ʦʊ↓ŋtʰi↓ lai↗ʂwo→ pi↓ ꭧɨ→ʨʰjɛ↗n xau↓, xwei↘ jou↓ɕje→ tʰu↗ tɤ ꭧɨ↘lja↘ŋ fei→ꭧʰa↗ŋ kau→, ta↘n pu↘ wə↓nti↘ŋ. pi↘ŋʨʰje↓ xwa↘fə→ŋ xwei↘ pjɛ↘nlai↗pjɛ↘nʨʰy↘.'
example audios:
.onnx
.pt
Steps to reproduce
The latest onnx version kokoro-onnx 0.4.2
native kokoro==0.7.12 misaki[zh]==0.7.12 (https://github.com/hexgrad/kokoro) has no problem:
The text was updated successfully, but these errors were encountered: