Visual_Synthesis

I worked on project titled Audio Visual Synthesis. Problem statement is "For a given speech signal we have to generate the corresponding lip movements”. To achieve this, I used a phonetically rich audio-visual database containing over 9000 sentences spoken by 4 subject. In this work I chose LSTM-RNN model for predicting the lip shape, as RNN are capable of learning long-term dependencies. Based on the speech input lip shapes were predicted & a short video of a Talking head was generated .

Pre-Processing of the Video and Audio was done Matlab Futher Bi-directional LSTM RNN model was implemented using Keras Library.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
Result		Result
Visual_Synthesis		Visual_Synthesis
synthesis		synthesis
.gitignore		.gitignore
Abhishek_Image_cente.txt		Abhishek_Image_cente.txt
Abhishek_sen_fram.txt		Abhishek_sen_fram.txt
Abhishek_sentence.txt		Abhishek_sentence.txt
Abhishek_test.txt		Abhishek_test.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visual_Synthesis

About

Releases

Packages

Contributors 2

Languages

PRRvalli/Visual_Synthesis

Folders and files

Latest commit

History

Repository files navigation

Visual_Synthesis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages