You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey @yl4579 thank you for your great work on this (and StyleTTS).
I was wondering if there was a reason for using " " as the blank token in the CTCLoss instead of something distinct from what can be returned from G2p as is suggested here? I was thinking of using something like id 80 if appending onto the vocab defined here.
Was wondering if this would affect the downstream training of StyleTTS much or if the aligner just has to be a "good enough" starting point?
Thanks!
The text was updated successfully, but these errors were encountered:
Hey @yl4579 thank you for your great work on this (and StyleTTS).
I was wondering if there was a reason for using
" "
as the blank token in the CTCLoss instead of something distinct from what can be returned fromG2p
as is suggested here? I was thinking of using something like id 80 if appending onto the vocab defined here.Was wondering if this would affect the downstream training of StyleTTS much or if the aligner just has to be a "good enough" starting point?
Thanks!
The text was updated successfully, but these errors were encountered: