-
Notifications
You must be signed in to change notification settings - Fork 19
Gpu pingpong test #556
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Gpu pingpong test #556
Conversation
e329147 to
49b0fd2
Compare
2cdb527 to
4222191
Compare
|
I'm a bit lost with CI here... Another pair of eyes would help. To summarize what I observe:
To conclude, I have no idea why |
1ffaec0 to
db3ad7d
Compare
b9f4741 to
db3ad7d
Compare
a3f8803 to
49f7d56
Compare
| #include "parsec/data_dist/matrix/two_dim_rectangle_cyclic.h" | ||
| #include "parsec/interfaces/dtd/insert_function_internal.h" | ||
| #include "tests/tests_data.h" | ||
| #include "parsec/mca/device/cuda/device_cuda_internal.h" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why do we need this, this is internal and should not spillover into user code.
|
please rebase and reassess the changes to the CI part (not clear they are still needed). |
make a token pass from CPU to each GPU, and back, a few times, to check a possible bug found by @devreal. Part of the DTD interface was not fully ported to HIP Enable (cuda|hip)_pingpong test in CI Add a PTG GPU pingpong test to compare with the behavior in DTD -- Work in progress Tests need to import the appropriate GPU-specific header file, as insert_function_internal.h doesn't do it for them anymore Enable PTG test over CUDA Fix errors in data distribution initialization and some DAG errors in the PTG of the GPU pingpong test Rename files and directories to match the new status of tests (tests/runtime/cuda is renamed tests/runtime/gpu and the pingpong tests are named to specify the API and not a particular device name, since they should work on both GPU types) Only define the pingpong tests if a suitable compiler is found for the kernels Do a ping-pong-pong test instead of ping-pong, to see how dependencies are tracked on GPU-to-GPU task dependency Fix the checks of the pingpong test, and add it in the Testings.cmake PTG ping-pong test: in order to guide the selection of the best device, the advised data needs to flow from a CPU task, not directly from memory. Trying to introduce the gpu_nvidia runner in the CI matrix Add ROCm, create one github_runner-[device].yaml file per device; remove debugging info from CMakeLists.txt Add some infrastructure to make sure CI does the device tests where it should, and issue an error if things cannot be tested (e.g. because the GPUs are down or the compiler/spack is broken) Trying to work around the xml2 issue with mesa. Signed-off-by: George Bosilca <bosilca@icl.utk.edu> Integrate the gpu_amd/release in the test suite Add support to rocm-smi in check_nb_devices.sh Conditional CMake command that depends upon the github runner loaded to prepare for testing
e6b4966 to
c644262
Compare
…e if we install binutils+headers?
b7055c4 to
36f7822
Compare
|
Split this PR in two: one for the tester itself and another for the CI/runners |
|
There has not been a lot of progress on this, and I think we can autogenerate (hipify) the hip testers, lets postpone to 4.1 |
This creates a simple test that makes data be updated on CPU and on GPUs alternatively.
As part of the test, I found out that HIP was not ported over DTD, and this ports HIP over DTD. It also provides a first test for HIP.