Kernel Fusion #72

ejmeitz · 2025-10-02T19:03:34Z

Goals:

Automatically fuse broadcast operations into a single CUDA kernel
Allow users to fuse custom functions which when called invoke a fused kernel

* init broadcasting changes * broadasting almost works * error on user defined functions * support scalar broadcasting * asd * better type promotion * fix floaty unary ops * use optimized square and reciprocal calls given 2 and -1 literals * start tests * add allowscalar and allowdouble * ban promotion to all wider types * working on tests * force NDArray type param for dim to be Int64 * tests up to GEMM pass * up to unary_ops pass * unary reductions pass tests * binops pass except things with NaN * all tests pass * add short hands to copy to and from Julia array * tests for scalars * fix tests i broke * stuff * fix some things * all tests pass * remove if-else * separate out promotion logic * add lcm, gcd, negation tests --------- Co-authored-by: krasow <krasow@u.northwestern.edu>

Co-authored-by: krasow <krasow@u.northwestern.edu>

ejmeitz and others added 11 commits October 2, 2025 14:01

copy changes to fresh branch

dedc837

Update README.md

2d403e0

Update PTX generation (#62)

ac5d40e

@cunumeric macro for scoping intermediate temporary allocations. (#68)

a4237b8

rm compile_wrapper.sh (#69)

c58c6c0

fix .random edge case (#70)

d1a9e72

Co-authored-by: krasow <krasow@u.northwestern.edu>

remove .random and update rand (#71)

4690f22

working on fusion

a6163d5

general outline

5150406

asd

1c78d61

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kernel Fusion #72

Kernel Fusion #72

Uh oh!

ejmeitz commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Kernel Fusion #72

Are you sure you want to change the base?

Kernel Fusion #72

Uh oh!

Conversation

ejmeitz commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants