Long pauses between sentences #9

tarsobcaldas · 2024-03-28T15:20:30Z

I'm not sure if it's a speech dispatcher problem or if it's something with the configuration, but when I run the following command, I get fairly short pauses between sentences, but when I run the phrase with spd-say, the pauses become very long, disrupting the flow of the reading.

echo "Then I realized that far from being lost, the details of these beers \
had been carefully stored in archives and brewery store rooms across Britain. \
Discovering the secrets of these lost beers was a possibility. All that was \ 
required was a bit of effort and determination."  | \
piper -m /home/noaxp/.var/app/com.mikeasoft.pied/data/pied/models/en_US-lessac-high.onnx \
--output_raw  | aplay -r 22050 -f S16_LE -t raw -

I've tried changing the configuration file piper.conf to include the flag --sentence_silence, but it doesn't seem to have any effect whatsoever, not to make it shorter or longer.

Still haven't checked how it's working with other output modules.

The text was updated successfully, but these errors were encountered:

KAGEYAM4 · 2024-05-01T08:55:56Z

@tarsobcaldas did you found any solution, i am having the same problem.

tarsobcaldas · 2024-05-01T20:31:34Z

Not yet, unfortunately

KAGEYAM4 · 2024-05-12T11:29:28Z

@tarsobcaldas found solution, it works for me.

source -> ken107/read-aloud#375 (comment)

This is my config, for reference ->

piper.conf

DefaultVoice "en/en_GB/alan/medium/en_GB-alan-medium.onnx"

# Specifying a rarely used symbol & big limit so that speech-dispatcher doesn't cut text into chunks:
GenericDelimiters "˨"
GenericMaxChunkLength 1000000

# These lines are important to specify for every language you'll use, otherwise some characters will not work:
GenericLanguage "en" "en-us" "utf-8"
#GenericLanguage "en" "en-gb" "utf-8"
#GenericLanguage "ru" "ru" "utf-8"

GenericCmdDependency "sox"
GenericCmdDependency "aplay"

GenericExecuteSynth \
"echo '$DATA' | /usr/bin/piper-tts --model '/usr/share/piper-voices/$VOICE' --output_raw | sox -r 22050 -c 1 -b 16 -e signed-integer -t raw - -t wav - tempo $RATE pitch $PITCH norm | aplay -r 22050 -f S16_LE -t raw -"

GenericRateAdd 1
GenericPitchAdd 1
GenericVolumeAdd 1
GenericRateMultiply 1
GenericPitchMultiply 1000

# Adding all voices we want:
#AddVoice "en" "FEMALE1" "en/en_GB/jenny_dioco/medium/en_GB-jenny_dioco-medium.onnx"
#AddVoice "en" "MALE1" "en/en_GB/alan/medium/en_GB-alan-medium.onnx"
#AddVoice "en" "FEMALE1" "en/en_GB/semaine/medium/en_GB-semaine-medium.onnx"
#AddVoice "en" "FEMALE1" "en/en_US/hfc_female/medium/en_US-hfc_female-medium.onnx"
#AddVoice "en" "FEMALE1" "en/en_GB/alba/medium/en_GB-alba-medium.onnx"
#AddVoice "en" "FEMALE1" "en/en_US/amy/medium/en_US-amy-medium.onnx"
#AddVoice "ru" "MALE1" "ru/ru_RU/dmitri/medium/ru_RU-dmitri-medium.onnx"

AddVoice "en" "MALE1" "en/en_US/ryan/high/en_US-ryan-high.onnx"

speechd.conf

AddModule "piper" "sd_generic" "piper.conf"
DefaultModule piper
LanguageDefaultModule "en" "piper"

tarsobcaldas · 2024-05-23T17:02:56Z

Yes, it seems that adding these lines solves the problem:

GenericDelimiters "˨"
GenericMaxChunkLength 1000000

KAGEYAM4 · 2024-05-24T11:42:04Z

even with the above, i was still facing delay when new paragraph starts. i switched to this - brailcom/speechd#866 (comment) --- make sure you are using medium model for this.

mak448a · 2024-06-27T20:07:07Z

@tarsobcaldas Could you reopen this issue? The solution was only a workaround. The file in there says "GENERATED BY PIED," which means that it can probably be fixed on pied's side.

rizzini · 2024-12-18T00:04:49Z

@tarsobcaldas found solution, it works for me.

source -> ken107/read-aloud#375 (comment)

This is my config, for reference ->

piper.conf

DefaultVoice "en/en_GB/alan/medium/en_GB-alan-medium.onnx"

# Specifying a rarely used symbol & big limit so that speech-dispatcher doesn't cut text into chunks:
GenericDelimiters "˨"
GenericMaxChunkLength 1000000

# These lines are important to specify for every language you'll use, otherwise some characters will not work:
GenericLanguage "en" "en-us" "utf-8"
#GenericLanguage "en" "en-gb" "utf-8"
#GenericLanguage "ru" "ru" "utf-8"

GenericCmdDependency "sox"
GenericCmdDependency "aplay"

GenericExecuteSynth \
"echo '$DATA' | /usr/bin/piper-tts --model '/usr/share/piper-voices/$VOICE' --output_raw | sox -r 22050 -c 1 -b 16 -e signed-integer -t raw - -t wav - tempo $RATE pitch $PITCH norm | aplay -r 22050 -f S16_LE -t raw -"

GenericRateAdd 1
GenericPitchAdd 1
GenericVolumeAdd 1
GenericRateMultiply 1
GenericPitchMultiply 1000

# Adding all voices we want:
#AddVoice "en" "FEMALE1" "en/en_GB/jenny_dioco/medium/en_GB-jenny_dioco-medium.onnx"
#AddVoice "en" "MALE1" "en/en_GB/alan/medium/en_GB-alan-medium.onnx"
#AddVoice "en" "FEMALE1" "en/en_GB/semaine/medium/en_GB-semaine-medium.onnx"
#AddVoice "en" "FEMALE1" "en/en_US/hfc_female/medium/en_US-hfc_female-medium.onnx"
#AddVoice "en" "FEMALE1" "en/en_GB/alba/medium/en_GB-alba-medium.onnx"
#AddVoice "en" "FEMALE1" "en/en_US/amy/medium/en_US-amy-medium.onnx"
#AddVoice "ru" "MALE1" "ru/ru_RU/dmitri/medium/ru_RU-dmitri-medium.onnx"

AddVoice "en" "MALE1" "en/en_US/ryan/high/en_US-ryan-high.onnx"

speechd.conf

AddModule "piper" "sd_generic" "piper.conf"
DefaultModule piper
LanguageDefaultModule "en" "piper"

This worked for me as well. Thank you.

tarsobcaldas closed this as completed May 23, 2024

tarsobcaldas reopened this Jun 28, 2024

outusuke mentioned this issue Nov 24, 2024

Getting an error when selecting the voice #24

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long pauses between sentences #9

Long pauses between sentences #9

tarsobcaldas commented Mar 28, 2024

KAGEYAM4 commented May 1, 2024

tarsobcaldas commented May 1, 2024

KAGEYAM4 commented May 12, 2024

tarsobcaldas commented May 23, 2024

KAGEYAM4 commented May 24, 2024

mak448a commented Jun 27, 2024

rizzini commented Dec 18, 2024

Long pauses between sentences #9

Long pauses between sentences #9

Comments

tarsobcaldas commented Mar 28, 2024

KAGEYAM4 commented May 1, 2024

tarsobcaldas commented May 1, 2024

KAGEYAM4 commented May 12, 2024

tarsobcaldas commented May 23, 2024

KAGEYAM4 commented May 24, 2024

mak448a commented Jun 27, 2024

rizzini commented Dec 18, 2024