Add Craigslist mirror#5
Draft
FlowRays wants to merge 1 commit into
Draft
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds a Craigslist mirror to WebHarbor on port
40015.Real site mirrored: https://www.craigslist.org/
The mirror covers Craigslist-style category browsing, search, listing detail pages, posting, replies, login, account edits, saved listings, saved searches, hidden listings, and messages. It includes 20 benchmark tasks across for-sale, housing, jobs, services, community pages, and account workflows.
The search UI was checked against live Craigslist for list, thumb, and gallery views, including icon font usage, card sizing, and image/no-image states.
Seed Data
Listings by group:
Assets
HF asset PR: https://huggingface.co/datasets/ChilleD/WebHarbor/discussions/5
It uploads
craigslist.tar.gzwith the seed DB and image assets.This GitHub PR is still draft while the HF asset PR is pending. After that PR is merged,
.assets-revisionshould be pinned to the HF merge commit and this PR can be marked ready.Verification
Also checked in Playwright at
2048x1090: header, query bar, toolbar, view selector, gallery cards, and image areas line up with the live Craigslist measurements;icomoon.cssloads and there were no broken images.