Skip to content

Latest commit

 

History

History
57 lines (36 loc) · 1.48 KB

ReadMe.md

File metadata and controls

57 lines (36 loc) · 1.48 KB

Simulation set synthesis

1. The simulation rirs

Parameter and path in configuration tencent_challenge_rirgenerator.py file

python ./tencent_challenge_rirgenerator.py

If you use the supplied RIR, this step is not required.

2. Prepare speech and noise sets

2.1 Download the data

speech data:

aishell-1

aishell-3

librispeech-360

VCTK

noise data:

MUSAN

Audioset

2.2 Generate the list file and partition the training and checksum sets

Configure the path to several datasets in ./prepare.sh

bash ./prepare.sh

3. Generate the data

Configure using the supplied parameters

Attention to the data/[dev | train]_[linear|circle]_simu_mix.config . In the config file path should be replaced with the corresponding path.
# dev set of linear mic array 
    python mix_wav.py --mix_config_path=./data/dev_linear_simu_mix.config --save_dir=./data/wavs/dev/simu_linear/ --chunk_len=6 --generate_config=False 

Use the new parameter configuration

# dev set of linear mic array 
    python mix_wav.py --clean_wav_list=./data/dev_clean.lst --noise_wav_list=./data/dev_noise.lst --rir_wav_list=./data/dev_linear_rir.lst --mix_config_path=./data/dev_linear_simu_mix.config --save_dir=./data/wavs/dev/simu_linear --chunk_len=6 --generate_config=True