Skip to content

Latest commit

 

History

History
28 lines (18 loc) · 763 Bytes

README.md

File metadata and controls

28 lines (18 loc) · 763 Bytes

Deep Learning for Audio

Implementation of State-of-the-arts in Audio Event Classification, Sound Event Detection (SED) and even more...in Pytorch


1. Audio Classification

CNN-1D Solution:

  • Build a conv1d model using directly raw waves from audio files

CNN-2D Solution:

  • Using log-mel spectrogram as features
  • CNN-2D as core model, based on image classification task

Benchmark Dataset

The benchmark audio dataset was obtained from Kaggle


2. Sound Event Detection

TODO


References