Skip to content

Failed to connect to GCS at address #54

@wowlza

Description

@wowlza

在运行脚本examples/scripts/train_mm_eureka_qwen_7b_single_node.sh时,出现多机分布和 Ray 连接失败,报错如下:

+ sleep 30
+ ray start --address=:6379 --num-gpus 8 --block
Local node IP: 127.0.0.1
[2025-07-23 10:05:49,679 W 51000 51000] gcs_rpc_client.h:151: Failed to connect to GCS at address :6379 within 5 seconds.
[2025-07-23 10:06:19,681 W 51000 51000] gcs_client.cc:183: Failed to get cluster ID from GCS server: TimedOut: Timed out while waiting for GCS to become available.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions