SDTalk: Structured Facial Priors and Dual-Branch Motion Fields for Generalizable Gaussian Talking Head Synthesis

Demo

It is shown in demo/demo.mp4

Installation

Tested on Ubuntu 20.04, CUDA 12.1, PyTorch 2.4.1

cd SDTalk

Environment

conda env create --file environment.yml
conda activate sdgtalk

Install the 3DGS renderer

git clone --recurse-submodules git@github.com:xg-chu/diff-gaussian-rasterization.git
pip install ./diff-gaussian-rasterization
rm -rf ./diff-gaussian-rasterization

Preparation

bash prepare.sh

Inference

python inference.py --image_dir demo/raw/video/source_imgs --pose_dir demo/track_res --audio_dir demo/raw/audio/raw_audio --resume_path assets/SDGTalk.pt

The result is in the render_results.

Acknowledgement

Partial codes are from GAGAvatar, TalkingGaussian. Face Parsing is from face-parsing. Thanks for these great projects!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
data		data
demo		demo
gridencoder		gridencoder
models		models
utils		utils
README.md		README.md
environment.yml		environment.yml
inference.py		inference.py
pos_encoding.py		pos_encoding.py
prepare.sh		prepare.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SDTalk: Structured Facial Priors and Dual-Branch Motion Fields for Generalizable Gaussian Talking Head Synthesis

Demo

Installation

Environment

Install the 3DGS renderer

Preparation

Inference

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SDTalk: Structured Facial Priors and Dual-Branch Motion Fields for Generalizable Gaussian Talking Head Synthesis

Demo

Installation

Environment

Install the 3DGS renderer

Preparation

Inference

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages