Human Re-Identification

This repo contains the training code for a model that can generate discriminative image embeddings for the human re-identification task on the Market-1501 dataset.

Query image

Retrieval position	Gallery image	Confidence score (0-1)
1st		0.9410
5th		0.8607
10th		0.8130
15th		0.7678

The results above were produced from the pre-trained model located at models/masked-hog-908/best.pth. This is the default model used for inference and is evaluated in Results.

Environment setup

This project has only been tested on Python 3.12.4, so it's the recommended version.

There are three options for setting up your development environment

Pull Docker image (quickest way to play with the model)
Virtual environment (best for training new models)
Build Docker image

Pull image from Docker Hub

Pull the pre-built image from docker hub

docker pull texashookem/human_reid_mi:inference

Feel free to restrict compute resources to the container, but for demonstration purposes the container should be launched in privileged mode to access needed compute. This will launch the container in interactive mode and initialize a bash shell at startup.

docker run --privileged -it texashookem/human_reid_mi:inference

Perform inference on the default query image

python inference.py --load_index

Perform inference on list of query images

python inference.py --load_index --query_paths <list of image paths> --topk 10

Initializing a virtual env

Ensure python 3.12 is installed

Create a python virtual environment running python -m venv venv
Activate the virtual environment source venv/bin/activate
Install project dependencies pip install -r requirements.txt
Download Market-1501 dataset and rearrange image directories by running python data_prep.py

Build Docker image from source

Build docker container

 docker build -t reid .

This will create a container with the target python version, project dependencies installed, and the data set already present.

Run container in interactive mode

docker run -it -p 6006:6006 -p 8080:8080 reid

Test default model

python inference.py --load_index

Project setup

This section describes the project's experimentation workflow and is meant for those interested in training additional models or reproducing the results presented. If you wish to make use of the provided model weights, skip to Inference.

The project utilizes Tensorboard for tracking experiment metrics and MlFlow for tracking experiment configuration. These tools are recommended for training new models as they centralize experiment information and aid in experiment analysis. Each experiment is assigned a name from MlFlow, so the experiment name is used as an id to create and access files within the /models and /index directories. To start the Tensorboard and MlFlow dashboards run the following script on the command line.

./dashboards.sh

This script is responsible for managing the visualization dashboards. It will automatically stop all launched services when it's terminated. Once the script executes, Tensorboard will be accessible at http://localhost/6006 and MlFlow on http://localhost:8080. At this point, you're in a good spot to begin training new models or evaluating the provided models. A recommended experimentation workflow is outlined below

Train a new model by running python train.py. See the script source for available cli parameters.
Evaluate the trained model on the gallery set by running python eval.py --exp_name <name> --save_index
View Top-5 retrieval results for a query image by running python inference.py --exp_name <name> --query_paths <img path> --load_index

Utilizing a trained model within the eval.py and inference.py scripts requires creating an index over the gallery to perform image retrieval. This is a highly computational process, so it's recommended to serialize an index for a specific model by passing the --save_index argument to either of these scripts. This will serialize index data to disk for later use when the --load_index argument is provided to the above-mentioned scripts.

Inference

To perform inference on an arbritrary set of query images, pass the query image path along with an experiment name as arguments to inference.py. This script will return the top 5, by default, most similar images to the query in the gallery along with their cosine similarity scores.

Some example use cases are provided below.

To retrieve the top 5 most similar images with the default model for a single image

python inference.py --query_paths ./data/market-1501-grouped/query/0003/0003_c1s6_015971_00.jpg --load_index

To retrieve the top 5 most similar images with the default model on all images for identity 0003

python inference.py --query_paths ./data/market-1501-grouped/query/0003/* --load_index

To retrieve the top 5 most similar images with the default model for all query images

python inference.py --query_paths ./data/market-1501-grouped/query/*/* --load_index

To retrieve the top 20 most similar images with the default model on the default query image

python inference.py --load_index --topk 20

To retrieve the top 5 most similar images with a new model on the default image and save the internal index for evaluation

python inference.py --exp_name <mlflow exp name> --save_index

Evaluation

The following snippets will help you evaluate the model on the Market-1501 dataset by computing these evaluation metrics: R-1, R-5, R-10, mAP.

To compute evaluation metrics for the default model

python eval.py --load_index

To compute evaluation metrics on a new model and save the index for inference

python eval.py --exp_name <exp name in /runs> --save_index

Results

I fine-tuned a pre-trained DenseNet model on a seperate classification layer without any data augmentation or learning rate scheduling. I used SGD to optimize model parameters and trained for a total of 20 epochs on an NVIDIA GeForce RTX 4060 Ti. The table below compares my model's performance - /models/masked-hog-908/best.pth - to competitive ReId models.

Model	Learning rate	Epochs	Batch size	Rank-1 Accuracy	Rank-5 Accuracy	Rank-10 Accuracy	mAP
Mine	0.02	20	16	84.68%	66.89%	47.77%	65.34%
stReID	0.10	60	32	97.2%	99.3%	99.5%	86.7%
PDF	0.01	n/a	16	84.14%	n/a	n/a	64.41%
TransReID	0.008	n/a	64	95.2%	n/a	n/a	89.5%

Although my model is not on par with newer models like stReID or TransReID, it obtained decent retrieval performance within 10 epochs of training and became competitive to PDF after 20 epochs.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
index		index
mlruns		mlruns
models		models
runs		runs
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
dashboards.sh		dashboards.sh
data_prep.py		data_prep.py
datasets.py		datasets.py
eval.py		eval.py
index.py		index.py
inference.py		inference.py
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human Re-Identification

Query image

Environment setup

Pull image from Docker Hub

Initializing a virtual env

Build Docker image from source

Project setup

Inference

Evaluation

Results

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Human Re-Identification

Query image

Environment setup

Pull image from Docker Hub

Initializing a virtual env

Build Docker image from source

Project setup

Inference

Evaluation

Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages