Skip to content

Lack of CPU allreduce implementation hurts CPU algorithm scaling. #72

@RAMitchell

Description

@RAMitchell

The current allreduce implementation for the CPU algorithm uses AllGather as a hack, but this is inefficient.

We need an efficient allreduce implementation in legate, or a way to hack something more efficient together in legateboost.

cpu_scaling_allreduce

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions