GitHub

# 我来实际训一个版本. 0.py 输入数据在data_train里面. 一堆.mp4 1.py syncnet 2.py 训练wav2lip网络跑通代码.png 是我跑通的图.跑的wav2lip, 1.py也跑通了. 训完还是inference.py代码使用方法跟wav2lip一样. 目前计算资源有限还没训好. 有训完的可以issue一下交流. ## **Wav2Lip** - a modified wav2lip 384 version Lip-syncing videos using the pre-trained models (Inference) ------- You can lip-sync any video to any audio: ```bash python inference.py --checkpoint_path --face --audio ``` The result is saved (by default) in `results/result_voice.mp4`. You can specify it as an argument, similar to several other available options. The audio source can be any file supported by `FFMPEG` containing audio data: `*.wav`, `*.mp3` or even a video file, from which the code will automatically extract the audio. Train! ---------- There are two major steps: (i) Train the expert lip-sync discriminator, (ii) Train the Wav2Lip model(s). ##### Training the expert discriminator You can use your own data (with resolution 384x384) ```bash python parallel_syncnet_tanh.py ``` ##### Training the Wav2Lip models You can either train the model without the additional visual quality disriminator (< 1 day of training) or use the discriminator (~2 days). For the former, run: ```bash python parallel_wav2lip_margin.py ``` # wav2lip384_my # wav2lip384_my # wav2lip384_my2 "# wav2lip384_my2" "# wav2lip384_my2" ##Lip-syncing videos using the pre-trained models (Inference) You can lip-sync any video to any audio: python inference.py --checkpoint_path --face --audio The result is saved (by default) in results/result_voice.mp4. You can specify it as an argument, similar to several other available options. The audio source can be any file supported by FFMPEG containing audio data: *.wav, *.mp3 or even a video file, from which the code will automatically extract the audio. ##Train! There are two major steps: (i) Train the expert lip-sync discriminator, (ii) Train the Wav2Lip model(s). ##Training the expert discriminator You can use your own data (with resolution 384x384) python parallel_syncnet_tanh.py Training the Wav2Lip models You can either train the model without the additional visual quality disriminator (< 1 day of training) or use the discriminator (~2 days). For the former, run: python parallel_wav2lip_margin.py

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
checkpoints		checkpoints
data_train		data_train
evaluation		evaluation
face_detection		face_detection
filelists		filelists
logs		logs
models		models
preprocessed_root/data_train		preprocessed_root/data_train
results		results
shuzirendemo		shuzirendemo
temp		temp
weight/wav/ex_wav2lip_margin/samples_step000000000		weight/wav/ex_wav2lip_margin/samples_step000000000
.gitignore		.gitignore
0.jpg		0.jpg
0.py		0.py
1.py		1.py
2.py		2.py
3.py		3.py
3231231233.py		3231231233.py
README.md		README.md
audio.py		audio.py
audio.wav		audio.wav
color_syncnet_train.py		color_syncnet_train.py
ffprobe.exe		ffprobe.exe
hparams.py		hparams.py
hq_wav2lip_train.py		hq_wav2lip_train.py
inference.py		inference.py
parallel_syncnet_tanh.py		parallel_syncnet_tanh.py
parallel_wav2lip_margin.py		parallel_wav2lip_margin.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
wav2lip_train.py		wav2lip_train.py
微信截图_20231219083305.png		微信截图_20231219083305.png
跑通代码.png		跑通代码.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

xskyz/wav2lip384_train

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages