Skip to content

Add Craigslist mirror#5

Draft
FlowRays wants to merge 1 commit into
aiming-lab:mainfrom
FlowRays:feat/craigslist
Draft

Add Craigslist mirror#5
FlowRays wants to merge 1 commit into
aiming-lab:mainfrom
FlowRays:feat/craigslist

Conversation

@FlowRays
Copy link
Copy Markdown

@FlowRays FlowRays commented May 13, 2026

Summary

Adds a Craigslist mirror to WebHarbor on port 40015.

Real site mirrored: https://www.craigslist.org/

The mirror covers Craigslist-style category browsing, search, listing detail pages, posting, replies, login, account edits, saved listings, saved searches, hidden listings, and messages. It includes 20 benchmark tasks across for-sale, housing, jobs, services, community pages, and account workflows.

The search UI was checked against live Craigslist for list, thumb, and gallery views, including icon font usage, card sizing, and image/no-image states.

Seed Data

categories: 34
listings: 179
users: 4
messages: 2
saved_listings: 3
saved_searches: 2
tasks: 20

Listings by group:

community: 35
for_sale: 48
housing: 29
jobs: 34
services: 33

Assets

HF asset PR: https://huggingface.co/datasets/ChilleD/WebHarbor/discussions/5

It uploads craigslist.tar.gz with the seed DB and image assets.

craigslist.tar.gz: 638961 bytes
md5: b3209befab877f7b749d58669821055c

This GitHub PR is still draft while the HF asset PR is pending. After that PR is merged, .assets-revision should be pinned to the HF merge commit and this PR can be marked ready.

Verification

python3 -m py_compile sites/craigslist/app.py sites/craigslist/seed_data.py
./scripts/build.sh webharbor:dev

GET /health on a clean container: ok=true, all 16 sites alive
Site roots: 41000:200 ... 41015:200

POST /reset/craigslist
{"pid":141,"ready":true,"site":"craigslist"}

edcfc946b3e0a082ded90df615396e63  /opt/WebSyn/craigslist/instance/craigslist.db
edcfc946b3e0a082ded90df615396e63  /opt/WebSyn/craigslist/instance_seed/craigslist.db

Also checked in Playwright at 2048x1090: header, query bar, toolbar, view selector, gallery cards, and image areas line up with the live Craigslist measurements; icomoon.css loads and there were no broken images.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant