Self Supervised Low Ressources Speech retrieval

Our project for the "Algorithms for speech and natural language processing (MVA 2021)" class

This project is based on the wav2vec 2.0 model that has been pretrained on massive unsupervised dataset. We then use this pretrain model and perform transfert learning on languages that have not a lot of labeled data. We study the importance of the pretraining language, the impact of the size of the transfert dataset, and the language similarity impact between pretraining and fine tuning

The following shows the Wav2Vec2 architecture

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
__pycache__		__pycache__
wandb		wandb
.gitignore		.gitignore
ASR_project.ipynb		ASR_project.ipynb
ASR_project_minimum_code.ipynb		ASR_project_minimum_code.ipynb
Dufour_Hauret_presentation.pdf		Dufour_Hauret_presentation.pdf
Dufour_Hauret_report.pdf		Dufour_Hauret_report.pdf
README.md		README.md
dataloader.py		dataloader.py
metrics.py		metrics.py
phonemize.py		phonemize.py
wav2vec2.png		wav2vec2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self Supervised Low Ressources Speech retrieval

About

Contributors 2

Languages

nicolas-dufour/self-supervised-low-res-speech

Folders and files

Latest commit

History

Repository files navigation

Self Supervised Low Ressources Speech retrieval

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages