Visual-Prompting

Visual-Prompting (MAE+VQVAE reconstruction)

Datasets Preparation

There are three parts:

CVF datasets

${CVF}/
	train/
	val/

ImageNet datasets

${ImageNet}/
	train/
	val/

Supervised vision datasets

$Painter_ROOT/datasets/
    nyu_depth_v2/
        sync/
        official_splits/
        nyu_depth_v2_labeled.mat
        datasets/nyu_depth_v2/
        nyuv2_sync_image_depth.json  # generated
        nyuv2_test_image_depth.json  # generated
    ade20k/
        images/
        annotations/
        annotations_detectron2/  # generated
        annotations_with_color/  # generated
        ade20k_training_image_semantic.json  # generated
        ade20k_validation_image_semantic.json  # generated
    ADEChallengeData2016/  # sim-link to $Painter_ROOT/datasets/ade20k
    coco/
        train2017/
        val2017/
        annotations/
            instances_train2017.json
            instances_val2017.json
            person_keypoints_val2017.json
            panoptic_train2017.json
            panoptic_val2017.json
            panoptic_train2017/
            panoptic_val2017/
        panoptic_semseg_val2017/  # generated
        panoptic_val2017/  # sim-link to $Painter_ROOT/datasets/coco/annotations/panoptic_val2017
        pano_sem_seg/  # generated
            panoptic_segm_train2017_with_color
            panoptic_segm_val2017_with_color
            coco_train2017_image_panoptic_sem_seg.json
            coco_val2017_image_panoptic_sem_seg.json
        pano_ca_inst/  # generated
            train_aug0/
            train_aug1/
            ...
            train_aug29/
            train_org/
            train_flip/
            val_org/
            coco_train_image_panoptic_inst.json
            coco_val_image_panoptic_inst.json
    coco_pose/
        person_detection_results/
            COCO_val2017_detections_AP_H_56_person.json
        data_pair/  # generated
            train_256x192_aug0/
            train_256x192_aug1/
            ...
            train_256x192_aug19/
            val_256x192/
            test_256x192/
            test_256x192_flip/
        coco_pose_256x192_train.json  # generated
        coco_pose_256x192_val.json  # generated
    derain/
        train/
            input/
            target/
        test/
            Rain100H/
            Rain100L/
            Test100/
            Test1200/
            Test2800/
        derain_train.json
        derain_test_rain100h.json
    denoise/
        SIDD_Medium_Srgb/
        train/
        val/
        denoise_ssid_train.json  # generated
        denoise_ssid_val.json  # generated
    light_enhance/
        our485/
            low/
            high/
        eval15/
            low/
            high/
        enhance_lol_train.json  # generated
        enhance_lol_val.json  # generated

please motified yamls/base.yaml to add or remove datasets:

datasets:
  cvf: 
    image_path: '/mnt1/msranlpintern/wuxun/SemDeDup/cili/scratch/wuxun/yutong/datatsets/CVF_debug'
  imageNet:
    image_path: '/mnt1/msranlpintern/wuxun/SemDeDup/cili/scratch/wuxun/yutong/datatsets/ImageNet'
  append_supervised:
    root_path: '/mnt1/msranlpintern/wuxun/SemDeDup/cili/scratch/wuxun/yutong/Painter/Painter/datasets'
    json_path: 
      deraining:
        - datasets: 'MRNet'
          train_json: derain/derain_train.json
          val_json: derain/derain_test_rain100h.json
      
      colorization:
        - datasets: 'ImageNet'
          train_json: colorization/colorization_ImageNet_train.json
          val_json: colorization/colorization_ImageNet_val.json

      light_enhance:
        - datasets: 'LOL'
          train_json: light_enhance/enhance_lol_train.json
          val_json: light_enhance/enhance_lol_val.json

      depth_estimation:
        - datasets: 'nyu_depth_v2'
          train_json: nyu_depth_v2/nyuv2_sync_image_depth.json
          val_json: nyu_depth_v2/nyuv2_test_image_depth.json

Train & Validate

bash scripts/train.sh
bash scripts/evaluate_*.sh

Description

jax_to_torch_for_mae.py: transform jax mae model to torch.

jax_to_torch_for_vqvae.py: transform jax vqvae model to torch.

torch_vqvae_model.py: torch-version vqvae inference model.

mae_vqvae_recon_visualize.ipynb: torch-version vqvae image reconstruction visualization.

mae_vqvae_recon_visualize.ipynb: torch-version mae+vqvae image reconstruction visualization.

./mae/: the modified folder of the official torch-version mae model.

./figs/: the folder for figures used for reconstruction.

./torch_ckpts/: Please put transformed torch-version mae model 0613_ckpt_torch.pth (google drive link)) and vqvae model xh_ckpt.pth (google drive link) in this folder. Specifically, torch-version 0613_ckpt_torch.pth and xh_ckpt.pth are transformed from jax-version ckpt_xh.npy and checkpoint.zip respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
datasets		datasets
evaluate		evaluate
evaluate_detection		evaluate_detection
figures_dataset		figures_dataset
scripts		scripts
util		util
yamls		yamls
README.md		README.md
__init__.py		__init__.py
demo.ipynb		demo.ipynb
engine_finetune.py		engine_finetune.py
engine_pretrain.py		engine_pretrain.py
main_pretrain.py		main_pretrain.py
models_mae.py		models_mae.py
models_vit.py		models_vit.py
requirements.txt		requirements.txt
tta.py		tta.py
viz_utils.py		viz_utils.py
vqgan.py		vqgan.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visual-Prompting

Datasets Preparation

Train & Validate

Description

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Visual-Prompting

Datasets Preparation

Train & Validate

Description

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages