A Transformer-Based Generalization Pipeline for Inpainting Models

This project fine-tunes a BLIP model to generate better inpainting captions.

The preprint of the paper can be reviewed in this link.

📦 Requirements

Install dependencies (preferably in a virtual environment):

pip install torch torchvision transformers pandas tqdm pillow
Optional (for GPU support):
pip install torch --index-url https://download.pytorch.org/whl/cu118

🧠 Model Overview

Base: BLIP, StableDiffusion-Inpaint

Head: MLP regressor that outputs 3 values: [SSIM, PSNR, CLIP Score]

Loss: Weighted MSE / custom weighted difference loss

🚀 Running the Script

Place your data Place images in a directory like test2014/

Download a pretrained BLIP model (e.g., from Hugging Face) into a blip/ folder

Run main

python main.py

The script will generate a csv file of all the losses. This will be used to train the MLP head and finetune BLIP.

Run training

python finetune_blip.py

The script will:

Train the MLP head for epochs_mlp epochs

Fine-tune select layers of BLIP for epochs_blip epochs

Save the final model to:

blip-v2/fine_tuned_blip_with_metrics.pth

Run main with updated BLIP model

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
examples		examples
A Transformer-Based Generalization Pipeline for Inpainting Models.pdf		A Transformer-Based Generalization Pipeline for Inpainting Models.pdf
README.md		README.md
calculate_fid.py		calculate_fid.py
finetune_blip.py		finetune_blip.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Transformer-Based Generalization Pipeline for Inpainting Models

📦 Requirements

🧠 Model Overview

🚀 Running the Script

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A Transformer-Based Generalization Pipeline for Inpainting Models

📦 Requirements

🧠 Model Overview

🚀 Running the Script

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages