Skip to content

JUMP Pilot Image Data Downloader and Loader#1

Merged
MattsonCam merged 15 commits intomainfrom
pilot_download_images
Apr 29, 2026
Merged

JUMP Pilot Image Data Downloader and Loader#1
MattsonCam merged 15 commits intomainfrom
pilot_download_images

Conversation

@MattsonCam
Copy link
Copy Markdown
Member

This pr allows someone to download image data and load image metadata from the JUMP pilot dataset. Other datasets may be included in this repo in the future. Also, it will likely be published to pypi later, so it may implement tagging in the future.

@review-notebook-app
Copy link
Copy Markdown

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Comment thread README.md
Comment thread .github/workflows/test.yml Outdated
Comment thread src/jump_image_datasets/jump_pilot/__init__.py
Comment thread src/jump_image_datasets/jump_pilot/image_metadata.py
Comment thread pyproject.toml
Comment thread pyproject.toml Outdated
Comment thread pyproject.toml Outdated
Comment thread src/jump_image_datasets/__init__.py
@@ -0,0 +1,5 @@
"""JUMP pilot dataset metadata and download utilities."""

from . import image_downloader, image_metadata
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Instead of placing this at a lower level, consider adding these imports to the top-level package and using that as a bearing of what is made available within the package (and any sub-packages like this one).

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good idea, I've implement this

Cameron Mattson added 5 commits April 29, 2026 09:10
Use `--frozen` for both `uv sync` and `uv run` in GitHub Actions.
This ensures CI strictly uses the committed lock state and fails fast
when lockfile or dependency metadata drift occurs.
…arquet

Document why `2020_11_04_CPJUMP1_all_plates.parquet` is included in the package
and how it is generated from JUMP Cell Painting Gallery data via the linked
JUMP-single-cell notebook. Summarize the transformation workflow and add an
update policy so future regenerations remain reproducible and traceable.
Update the project dependency from pandas>=1.5 to pandas>=2.2,<4
to align with current ecosystem standards while allowing 3.x.
This tightens compatibility expectations and avoids claiming
support for older, untested pandas releases.
Re-export jump_pilot downloader/metadata symbols from jump_image_datasets
to make the public API explicit at the package boundary.

Add an autouse cache-reset fixture in metadata tests to prevent
cross-test cache leakage and ensure deterministic test behavior.
@MattsonCam
Copy link
Copy Markdown
Member Author

Thanks @d33bs ! Merging now

@MattsonCam MattsonCam merged commit 5f3bcdd into main Apr 29, 2026
10 checks passed
@MattsonCam MattsonCam deleted the pilot_download_images branch April 29, 2026 16:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants