VANTAGE-X

Visual Analytics and Transformers for Autonomous Generalized Evaluation

VANTAGE-X is a vehicle damage analysis project that combines fine-tuned YOLOv11-seg detection with optional Qwen2.5-VL reasoning on cropped damage regions. The goal is to provide a practical workflow for detecting visible vehicle damage, segmenting affected regions, and generating concise structured assessments.

Overview

VANTAGE-X provides:

YOLOv11-seg inference for damage detection and instance masks
Optional Qwen2.5-VL crop-level reasoning for severity, location, and short descriptions
A CLI for single-image runs, batch processing, evaluation, and the Gradio app
Data conversion utilities for VIA and COCO workflows
Training and evaluation scripts for the VehiDE dataset

The current runtime pipeline is:

Load an image.
Run YOLOv11-seg to detect damage regions and generate masks.
Optionally run Qwen2.5-VL on each cropped region.
Save annotated images plus text, JSON, and Markdown reports.

Repository Layout

.
├── app/                     # Gradio UI
├── configs/                 # Runtime configuration
├── data/
│   ├── data/                # Dataset loader and conversion scripts
│   └── yolo/                # YOLO dataset config template
├── evaluate/                # Evaluation utilities
├── models/                  # Detector, VLM, and shared datatypes
├── pipeline/                # End-to-end damage pipeline
├── train/                   # Training and COCO->YOLO conversion scripts
├── utils/                   # Reporting and visualization helpers
├── main.py                  # Main CLI entry point
├── requirements.txt
└── README.md

What Is Not Committed

Large local assets are intentionally ignored so the repository stays publishable:

virtual environments
raw dataset image folders
COCO annotation JSON exports
converted YOLO training data
model weights and checkpoints
generated training and evaluation outputs

If you need to reproduce training or inference, prepare the dataset locally, generate the YOLO dataset files, and place model weights on your machine.

Installation

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Recommended:

Python 3.10+
CUDA-capable GPU for faster inference and training
extra VRAM if you enable Qwen2.5-VL together with YOLO

Configuration

Main settings live in configs/config.yaml.

Important values:

yolo.weights: path to the trained YOLO checkpoint
yolo.conf_threshold: default inference confidence threshold
yolo.iou_threshold: detector NMS IoU threshold
yolo.imgsz: inference image size
qwen_vlm.run_vlm: enable or disable crop-level VLM analysis
pipeline.min_mask_fraction: discard tiny masks

The default config points to a trained checkpoint under results/training/.../best.pt, but that artifact is ignored from Git. On a fresh clone, you will need to train or supply weights locally.

CLI Usage

Analyse one image

python main.py run --image path/to/car.jpg --output results/

Analyse a folder

python main.py batch --folder path/to/images --output results/batch/

Disable the VLM step

python main.py run --image path/to/car.jpg --no-vlm

Launch the web app

python main.py app --port 7860

Evaluate on COCO annotations

python main.py evaluate --data-root data/data --split test

Programmatic Usage

from pipeline.damage_pipeline import DamagePipeline

pipeline = DamagePipeline(run_vlm=True)
result = pipeline.run("path/to/car.jpg")

print(result.to_report())
result.save_visualisation("output/car_result.jpg")

Reports Produced

A successful run can generate:

*_annotated.jpg
*_report.txt
*_report.json
*_report.md
batch_summary.json for folder runs

Dataset Preparation

Convert VIA annotations to COCO

python data/data/convert_via_to_coco.py \
    --via-json data/data/Train_annotations.json \
    --images-dir data/data/train \
    --output data/data/Train_annotations_coco.json

If you run the script without arguments, it converts both the default train and test VIA files in data/data/.

Convert COCO annotations to YOLO segmentation format

python train/convert_coco_to_yolo.py --data-root data/data --out-dir data/yolo

This generates YOLO labels and updates data/yolo/dataset.yaml.

Training

python train/train_yolo.py --epochs 100 --batch 16 --imgsz 640

Useful options:

--resume
--device cuda:0
--project results/training
--name yolo11seg_damage

Evaluation Metrics

The evaluation script reports:

mAP
per_class_ap
mean_mask_iou
detection_accuracy
num_evaluated
num_masks_evaluated

Current Damage Classes

dent
scratch
broken glass
lost parts
punctured
torn
broken lights

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VANTAGE-X

Overview

Repository Layout

What Is Not Committed

Installation

Configuration

CLI Usage

Analyse one image

Analyse a folder

Disable the VLM step

Launch the web app

Evaluate on COCO annotations

Programmatic Usage

Reports Produced

Dataset Preparation

Convert VIA annotations to COCO

Convert COCO annotations to YOLO segmentation format

Training

Evaluation Metrics

Current Damage Classes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
configs		configs
data		data
evaluate		evaluate
models		models
pipeline		pipeline
train		train
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

VANTAGE-X

Overview

Repository Layout

What Is Not Committed

Installation

Configuration

CLI Usage

Analyse one image

Analyse a folder

Disable the VLM step

Launch the web app

Evaluate on COCO annotations

Programmatic Usage

Reports Produced

Dataset Preparation

Convert VIA annotations to COCO

Convert COCO annotations to YOLO segmentation format

Training

Evaluation Metrics

Current Damage Classes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages