Skip to content

Latest commit

 

History

History
32 lines (25 loc) · 1.96 KB

citations.md

File metadata and controls

32 lines (25 loc) · 1.96 KB

Citations

  1. Prawda, Karolina, Schlecht, Sebastian J., & Välimäki, Vesa. Dataset of impulse responses from variable acoustics room Arni at Aalto Acoustic Labs [Data set]. Zenodo. https://doi.org/10.5281/zenodo.6985104, 2022.

  2. Richter, Julius, de Oliveira, Danilo, & Gerkmann, Timo.
    Investigating Training Objectives for Generative Speech Enhancement.
    arXiv preprint, arXiv:2409.10753, 2024.

  3. Richter, Julius, Welker, Simon, Lemercier, Jean-Marie, Lay, Bunlong, & Gerkmann, Timo.
    Speech Enhancement and Dereverberation with Diffusion-based Generative Models.
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 2351–2364.
    https://doi.org/10.1109/TASLP.2023.3285241, 2023.

  4. Richter, Julius, Wu, Yi-Chiao, Krenn, Steven, Welker, Simon, Lay, Bunlong, Watanabe, Shinji, Richard, Alexander, & Gerkmann, Timo. EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation. In ISCA Interspeech, 2024.

  5. Schroeder, M. R. New Method of Measuring Reverberation Time. Journal of the Acoustical Society of America March 1968, 37(3), 409–412. https://doi.org/10.1121/1.1938260, March 1968.

  6. Welker, Simon, Richter, Julius, & Gerkmann, Timo.
    Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain.
    In Proceedings of Interspeech 2022, 2928–2932.
    https://doi.org/10.21437/Interspeech.2022-10653, 2022.

  7. Wichern, Gordon, Antognini, Joe, Flynn, Michael, Zhu, Licheng Richard, McQuinn, Emmett, Crow, Dwight, Manilow, Ethan, & Le Roux, Jonathan. WHAM!: Extending Speech Separation to Noisy Environments. In Proceedings of Interspeech, September 2019.