Let's collect information about existing solutions, their architecture, and wear or strong sides. Hopefully, this will help get an overview about the current state and next steps that have to be taken to improve CUDA experience. Also, we will be able to define crucial components that can be shared between the different approaches.
I'm going to post here an overview of ptx-linker and ptx-builder approach in next days.