3D Hand Pose Estimation from RGB Images

Deep learning approach for estimating 3D hand joint positions from single RGB images. Compares a custom CNN architecture against ResNet50 with transfer learning.

Results

Model	MPJPE (mm)	PCK@20mm
Custom CNN	40.83 ± 8.89	19.0%
ResNet50 (ImageNet)	12.92 ± 0.10	82.8%

Transfer learning reduces error by 68.4% and provides significantly more stable training (±0.10mm vs ±8.89mm variance across folds).

Method

Architecture: ResNet50 backbone (pretrained on ImageNet) with custom regression head:

Global average pooling → 2048-dim features
FC layers: 2048 → 512 → 256 → 63 (21 joints × 3 coordinates)
BatchNorm, ReLU, Dropout regularization

Training:

Loss: MSE on root-relative normalized coordinates
Optimizer: AdamW (lr=3e-4, weight_decay=1e-5)
Scheduler: CosineAnnealingLR
Mixed-precision training (FP16)
5-fold cross-validation

Quick Start

git clone https://github.com/Shayank1996/hand-pose-estimation.git
cd hand-pose-estimation
pip install -r requirements.txt

Download FreiHAND dataset and extract to Data/FreiHAND_pub_v2/.

# Train and evaluate
python cross_validation.py    # 5-fold CV on both models
python evaluate.py            # Test set evaluation

Files

File	Description
`baseline_cnn.py`	Custom 4-block CNN architecture
`ablation_study.py`	Side-by-side comparison of CNN vs ResNet50
`hyperparameter_search.py`	Grid search over learning rates and weight decay
`cross_validation.py`	5-fold cross-validation with both models
`evaluate.py`	Final evaluation on held-out test set

Dataset

FreiHAND (Zimmermann et al., ICCV 2019)

32,560 training images (224×224 RGB)
3,960 evaluation images
21 hand keypoints with 3D annotations

Citation

@inproceedings{zimmermann2019freihand,
  title={FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape},
  author={Zimmermann, Christian and Ceylan, Duygu and Yang, Jimei and Russell, Bryan and Argus, Max and Brox, Thomas},
  booktitle={ICCV},
  year={2019}
}

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

3D Hand Pose Estimation from RGB Images

Results

Method

Quick Start

Files

Dataset

Citation

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ablation_study.py		ablation_study.py
baseline_cnn.py		baseline_cnn.py
cross_validation.py		cross_validation.py
evaluate.py		evaluate.py
hyperparameter_search.py		hyperparameter_search.py
requirements.txt		requirements.txt

License

ShayanK1996/hand-pose-estimation

Folders and files

Latest commit

History

Repository files navigation

3D Hand Pose Estimation from RGB Images

Results

Method

Quick Start

Files

Dataset

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages