pickinsta

pickinsta turns a folder of event photos into ranked, Instagram-ready portrait selections. It deduplicates burst shots, scores technical quality, adds a vision pass with CLIP, Claude, or Ollama, then generates smart crops, reports, and an HTML gallery.

What It Does

Resizes source images into a work area so later stages are faster and consistent.
Collapses near-duplicate burst sequences to the best representative image.
Scores technical quality with OpenCV-based metrics such as sharpness, lighting, composition, and clutter.
Applies a vision scorer:
- clip: local, free, zero API cost
- claude: API-based, strongest quality/ranking
- ollama: self-hosted vision scoring
Creates three ranked output variants per selected image:
- NN_cropped_<name>.jpg
- NN_hd_<name>.jpg
- NN_full_<name>.<ext>
Writes selection_report.json, selection_report.md, and index.html.

Quick Start

python3 -m venv .venv
source .venv/bin/activate
make install-dev

mkdir -p ./input ./selected
pickinsta ./input --output ./selected --top 10 --scorer clip

For the default full local setup, make install-dev installs dev tooling and all scorer extras.

Installation

Minimal package install

python -m pip install -e .

This gives the core package plus technical scoring and image processing dependencies.

Install only the scorer extras you need

# CLIP scorer
python -m pip install -e ".[clip]"

# Claude scorer
python -m pip install -e ".[claude]"

# YOLO support for smart crop and richer scorer context
python -m pip install -e ".[yolo]"

# Full runtime without dev tools
python -m pip install -e ".[clip,claude,yolo]"

Optional dependency groups from pyproject.toml:

dev: pytest, ruff, pre-commit
clip: transformers, torch
claude: anthropic, tqdm
yolo: ultralytics

If ultralytics is missing, smart crop falls back to non-YOLO heuristics.

Configuration

Copy the example env file:

cp .env.example .env

Common variables:

# Required for --scorer claude
ANTHROPIC_API_KEY=your_key_here

# Optional Claude model override
ANTHROPIC_MODEL=claude-sonnet-4-6

# Optional CLIP / Hugging Face token
HF_TOKEN=hf_xxx_your_token

# Optional prompt/account context
PICKINSTA_ACCOUNT_CONTEXT="motorcycle enthusiast account"

# Optional Ollama endpoint and model
PICKINSTA_OLLAMA_BASE_URL=http://127.0.0.1:11434
PICKINSTA_OLLAMA_MODEL=qwen2.5vl:7b
PICKINSTA_OLLAMA_TIMEOUT_SEC=300
PICKINSTA_OLLAMA_MAX_IMAGE_EDGE=1024
PICKINSTA_OLLAMA_JPEG_QUALITY=80
PICKINSTA_OLLAMA_KEEP_ALIVE=10m
PICKINSTA_OLLAMA_USE_YOLO_CONTEXT=false
PICKINSTA_OLLAMA_CONCURRENCY=2
PICKINSTA_OLLAMA_MAX_RETRIES=2
PICKINSTA_OLLAMA_RETRY_BACKOFF_SEC=0.75
PICKINSTA_OLLAMA_CIRCUIT_BREAKER_ERRORS=6

# Optional custom YOLO weights
PICKINSTA_YOLO_MODEL=/absolute/path/to/model.pt

Environment resolution behavior:

Claude API key and model settings are resolved from environment, then cwd/.env, then <input>/.env.
HF_TOKEN follows the same search order.
Ollama settings follow the same search order and default to http://127.0.0.1:11434 with model qwen2.5vl:7b.
CLAUDE_MODEL is accepted as a fallback alias for ANTHROPIC_MODEL.

Usage

Common commands

# Local/free scoring
pickinsta ./input --output ./selected --top 10 --scorer clip

# Claude scoring for all technically qualified images
pickinsta ./input --output ./selected --scorer claude --all

# Claude scoring on pre-cropped 4:5 candidates
pickinsta ./input --output ./selected --scorer claude --all --claude-crop-first

# Ollama scoring against a local or remote server
pickinsta ./input --output ./selected --scorer ollama --all

# Use a separate work folder
pickinsta ./input --output ./selected --work ./work --scorer clip

# Re-run vision scoring without using cached scorer results
pickinsta ./input --output ./selected --scorer claude --rescore

# Deduplicate only; no ranking or vision scoring
pickinsta ./input --output ./deduped --dedup-only

Help

pickinsta -h

CLI Surface

Current flags implemented in src/pickinsta/ig_image_selector.py:

input: source folder of event photos
--output, -o: output folder, default selected
--work, -w: intermediate work folder, default <input>_work
--top, -n: number of ranked outputs, default 10
--scorer, -s: clip, claude, or ollama
--vision-pct: fraction of technically scored images passed to vision scoring, default 0.5
--all: score all Stage 2 candidates
--claude-model: override Claude model
--claude-crop-first: pre-crop to 1080x1440 before Claude scoring
--rescore: ignore cached vision results
--dedup-only: emit unique-image outputs after dedup without ranking

Pipeline Overview

High-level stages:

Stage 0: resize inputs into a work folder with EXIF-safe handling.
Stage 1: deduplicate bursts with perceptual hash, histogram checks, temporal grouping, and feature verification.
Stage 2: compute technical quality metrics.
Stage 3: run the selected vision scorer on the top technical candidates or all candidates.
Stage 4: generate 1080x1440 smart crops, plus HD and full variants.
Finalize ranked reports and an HTML gallery.

Score blend:

final_score = 0.3 * technical_composite + 0.7 * vision_normalized

Outputs

For each selected image, pickinsta writes:

NN_cropped_<stem>.jpg: ranked 1080x1440 portrait output
NN_hd_<stem>.jpg: resized work-copy version
NN_full_<stem>.<ext>: original source file copy

Per-run artifacts:

selection_report.json: machine-readable summary of selected outputs
selection_report.md: human-readable report including analyzed image scores
index.html: browsable local gallery

Crop uncertainty is tracked in the reports. When the crop pipeline falls back or detects a risky crop, those reasons are preserved in report metadata and surfaced in the gallery.

Caching

Claude vision responses are cached beside the original input image as <filename>.pickinsta.json.
Technical scoring is cached in the work folder as <filename>.<ext>.techscore.json.
--rescore bypasses cached vision results.
Changing prompt context or model selection can invalidate or bypass prior cache reuse, depending on scorer path and settings.

Scorer Notes

CLIP

Runs locally.
Requires first-run model downloads from Hugging Face.
HF_TOKEN is optional but helps avoid rate limits and warnings.

Claude

Requires ANTHROPIC_API_KEY.
Default model is claude-haiku-4-5-20251001 unless overridden.
--claude-crop-first is useful when final 4:5 crop quality should affect ranking more strongly.

Ollama

Requires a reachable Ollama server and a pulled vision model.
Defaults are tuned for remote inference rather than maximum local parallelism.
See docs/ollama-server-setup.md for setup and tuning guidance.

Benchmarks

Manual benchmark scripts live in tests/benchmarks/.

Benchmark multiple Ollama models:

.venv/bin/python tests/benchmarks/benchmark_ollama_models.py \
  --input ./input \
  --all \
  --runs 3 \
  --models qwen3-vl:8b blaifa/InternVL3_5:8b blaifa/InternVL3_5:4B openbmb/minicpm-v4.5:8b \
  --report docs/ollama-model-speed-benchmark-report.md

Benchmark Ollama with and without YOLO context:

.venv/bin/python tests/benchmarks/benchmark_ollama_yolo.py \
  --input ./input \
  --runs 2 \
  --all \
  --report docs/ollama-yolo-benchmark-report.md

Development

make lint
make test
make check
make pre-commit-install

See tests/README.md for test coverage notes.

Documentation

Primary docs live under docs/:

docs/README.md: documentation index
docs/composition-rules.md: scoring and crop rubric
docs/troubleshooting.md: install/runtime troubleshooting
docs/ollama-server-setup.md: self-hosted Ollama setup and tuning

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.claude		.claude
.github		.github
debug		debug
docs		docs
input_work_bench		input_work_bench
scripts		scripts
src/pickinsta		src/pickinsta
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pickinsta

What It Does

Quick Start

Installation

Recommended

Minimal package install

Install only the scorer extras you need

Configuration

Usage

Common commands

Help

CLI Surface

Pipeline Overview

Outputs

Caching

Scorer Notes

CLIP

Claude

Ollama

Benchmarks

Development

Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pickinsta

What It Does

Quick Start

Installation

Recommended

Minimal package install

Install only the scorer extras you need

Configuration

Usage

Common commands

Help

CLI Surface

Pipeline Overview

Outputs

Caching

Scorer Notes

CLIP

Claude

Ollama

Benchmarks

Development

Documentation

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages