Build software better, together

clovaai / ClovaCall

Star

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

speech-recognition speech-corpus korean-speech call-based-speech-corpus goal-oriented-dialog interspeech2020

Updated Apr 5, 2022
Python

yc9701 / pansori

Star

Tools for ASR Corpus Generation from Online Video

corpus speech-recognition dataset-generation data-pipeline online-video speech-corpus

Updated Feb 10, 2019
Python

kan-bayashi / LibriTTSLabel

Sponsor

Star

Alignment files of LibriTTS.

speech-synthesis speech-corpus

Updated Mar 16, 2020

lennes / spect

Star

SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/

annotation analysis speech transcript corpus-linguistics transcription spoken-language praat corpus-tools speech-analysis conversational-speech speech-corpus spect

Updated Aug 11, 2023
HTML

khiajohnson / SpiCE-Corpus

Star

An open-access corpus of conversational bilingual speech in Cantonese and English

corpus english-language cantonese-language bilingual-corpora speech-corpus spice-corpus

Updated Apr 28, 2022
JavaScript

ruslan-corpus / ruslan-corpus.github.io

Star

text-to-speech tts russian speech-dataset speech-corpus

Updated Aug 29, 2019
HTML

dcavar / ELAN2split

Star

Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners

xml sox cpp11 speech-recognition elan forced-alignment xerxes speech-corpus

Updated Oct 15, 2018
C++

AsoSoft / AsoSoft-Speech-Corpus

Star

AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition, gender identification, and phonetic analysis.

speech-corpus central-kurdish

Updated Mar 8, 2022

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

Star

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection

Updated Sep 13, 2024
Jupyter Notebook

kevobt / speech-to-text-voxforge

Star

Downloader for the voxforge corpus

downloader generator voxforge speech-corpus

Updated May 10, 2018
Python

ubaleht / SiberianIngrianFinnish

Star

This project is devoted to the Siberian Ingrian Finnish language. Siberian Ingrian Finnish – is a language (dialect) used by the descendants of the settlers who spoke Lower Luga Ingrian Finnish varieties and Lower Luga Ingrian (Izhorian) who have been living in Omsk oblast (previously they lived also in other regions of the Siberia) for more tha…

finnish speech-corpus finnish-language ingrian-finnish izhorian

Updated Apr 11, 2024
C#

ina-foss / InaGVAD

Star

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus tv dataset gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated Jun 6, 2024
Jupyter Notebook

joneavila / DRAL

Star

Code for Dialogs Re-enacted Across Languages (DRAL)

prosody speech-corpus

Updated Nov 4, 2023
Python

vectominist / Switchboard-WSJ-Utils

Star

Utilities for preprocessing the Switchboard and WSJ corpora in Python3

python switchboard torchaudio speech-corpus wsj wtimit

Updated Jul 31, 2020
Python

ubaleht / SiberianTatar

Star

This project is devoted to the dialects of the Siberian Tatars. Around 100,000 people are spoken in these dialects. The language of Siberian Tatars consists of three dialects: Tobolo-Irtysh, Tom and Baraba.

tatar speech-corpus

Updated Oct 6, 2024

MahtaFetrat / GPTInformal-Persian-Speech-Dataset

Star

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection manatts

Updated Sep 22, 2024

mborsdorf / TargetLanguageExtraction

Star

audio multilingual python deep-learning matlab pytorch speech-processing audio-processing source-separation speech-separation speech-dataset auditory-attention speech-corpus speaker-extraction speech-database

Updated Feb 8, 2022

mllpresearch / Europarl-ASR

Star

A 1300-hour English speech and text corpus of parliamentary debates for streaming ASR training and benchmarking, speech data filtering and speech data verbatimization.

automatic-speech-recognition speech-corpus streaming-asr speech-data-filtering speech-data-verbatimization

Updated Mar 30, 2024

mbar0075 / Speech-Technology

Star

Deliverables relating to the Speech Technology University Unit (Notes Courtesy to Dr. Andrea De Marco)

deep-learning keras mfcc cnn-architecture cnn-classification speaker-identification mel-spectrogram speech-corpus speech-technology

Updated Jan 3, 2024
Jupyter Notebook

mllpresearch / ESO-dataset

Star

ESO speech dataset: an English-language speech corpus of the oncology domain for ASR training and benchmarking and MT benchmarking.

machine-translation automatic-speech-recognition oncology domain-adaptation speech-corpus speech-translation large-language-models llm

Updated Apr 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-corpus

Here are 20 public repositories matching this topic...

clovaai / ClovaCall

yc9701 / pansori

kan-bayashi / LibriTTSLabel

lennes / spect

khiajohnson / SpiCE-Corpus

ruslan-corpus / ruslan-corpus.github.io

dcavar / ELAN2split

AsoSoft / AsoSoft-Speech-Corpus

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

kevobt / speech-to-text-voxforge

ubaleht / SiberianIngrianFinnish

ina-foss / InaGVAD

joneavila / DRAL

vectominist / Switchboard-WSJ-Utils

ubaleht / SiberianTatar

MahtaFetrat / GPTInformal-Persian-Speech-Dataset

mborsdorf / TargetLanguageExtraction

mllpresearch / Europarl-ASR

mbar0075 / Speech-Technology

mllpresearch / ESO-dataset

Improve this page

Add this topic to your repo