SDTalk: Structured Facial Priors and Dual-Branch Motion Fields for Generalizable Gaussian Talking Head Synthesis
It is shown in demo/demo.mp4
Tested on Ubuntu 20.04, CUDA 12.1, PyTorch 2.4.1
cd SDTalkconda env create --file environment.yml
conda activate sdgtalkgit clone --recurse-submodules git@github.com:xg-chu/diff-gaussian-rasterization.git
pip install ./diff-gaussian-rasterization
rm -rf ./diff-gaussian-rasterizationbash prepare.shpython inference.py --image_dir demo/raw/video/source_imgs --pose_dir demo/track_res --audio_dir demo/raw/audio/raw_audio --resume_path assets/SDGTalk.ptThe result is in the render_results.
Partial codes are from GAGAvatar, TalkingGaussian. Face Parsing is from face-parsing. Thanks for these great projects!