Skip to content

[data][ci] removing databuild from data CI#61508

Open
elliot-barn wants to merge 1 commit intomasterfrom
elliot-barn-removing-databuild-image
Open

[data][ci] removing databuild from data CI#61508
elliot-barn wants to merge 1 commit intomasterfrom
elliot-barn-removing-databuild-image

Conversation

@elliot-barn
Copy link
Contributor

databuild and datalbuild(latest) now are using the same set of dependencies.

removing databuild-py* and replacing with datalbuild-py CI images

Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
@elliot-barn elliot-barn requested a review from bveeramani March 5, 2026 02:29
@elliot-barn elliot-barn requested a review from a team as a code owner March 5, 2026 02:29
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request aims to remove the databuild CI configuration and replace its usages with datalbuild. The changes are mostly correct, but there is a critical issue where the datalbuild image is still being tagged as databuild. This will cause CI failures because tests will be unable to find the correctly named image. A fix is required in ci/docker/datal.build.wanda.yaml to update the image tag to align with the new build names.

commands:
- bazel run //ci/ray_ci:test_in_docker -- //python/ray/data/... data
--build-name databuild-py3.12 --python-version 3.12
--build-name datalbuild-py3.12 --python-version 3.12
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

This change to --build-name datalbuild-py3.12 is correct in principle, but it will cause the CI to fail. The configuration in ci/docker/datal.build.wanda.yaml for this build still specifies the image tag as cr.ray.io/rayproject/databuild-py$PYTHON. This means the test runner will look for an image named datalbuild-py3.12 but won't find it, because the image will be tagged as databuild-py3.12.

To resolve this, the tags field in ci/docker/datal.build.wanda.yaml needs to be updated to cr.ray.io/rayproject/datalbuild-py$PYTHON.

This issue affects all test steps being updated to use datalbuild in this file.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@elliot-barn is this valid?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nope. the tag on the datal.build.wanda.yaml images uses datalbuild-py$PYTHON

Link here: https://github.com/ray-project/ray/blob/master/ci/docker/datal.build.wanda.yaml#L1

Copy link

@cursor cursor bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.

PYTHON: "{{matrix}}"
tags: cibase

- name: databuild-multipy
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed build step still referenced in core CI

High Severity

The databuild-multipy build step was removed from data.rayci.yml, but .buildkite/core.rayci.yml still references it at line ~182 with --build-name databuild-py3.12 and at line ~184 with depends_on: databuild-multipy. This creates a broken dependency — the core CI step depends on a build step that no longer exists, which will cause CI pipeline failures. These references in core.rayci.yml need to be updated to use datalbuild-py3.12 and datalbuild-multipy respectively.

Fix in Cursor Fix in Web

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@elliot-barn , could you double check?

@bveeramani bveeramani self-assigned this Mar 5, 2026
commands:
- bazel run //ci/ray_ci:test_in_docker -- //python/ray/data/... data
--build-name databuild-py3.12 --python-version 3.12
--build-name datalbuild-py3.12 --python-version 3.12
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@elliot-barn is this valid?

@ray-gardener ray-gardener bot added the devprod label Mar 5, 2026
PYTHON: "{{matrix}}"
tags: cibase

- name: databuild-multipy
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@elliot-barn , could you double check?

@aslonnie aslonnie self-requested a review March 6, 2026 19:37
Copy link
Collaborator

@aslonnie aslonnie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ping me for merging when the two issues/coomments are checked.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants