- For mac M2 GPU, if the `Float64` type does not work, switch it to `Float32`. - For the `write your own kernels`, the `grid` in one block cell should be `groups`