A 64x64 pre-trained diffusion model is all you need for 1-step high-resolution SOTA generation
NeurIPS24
Unified framework enables diverse samplers and 1-step generation SOTAs
ICLR24
Applications:
[SoundGen]

Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation Using GANs and Integrated Unaligned Clean Data
DAFx24

DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
ICASSP23

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
NeurIPS23
### Contact