Increase Performance with Vectorized Memory Access#226
Increase Performance with Vectorized Memory Access#226ByLamacq wants to merge 11 commits intobrichard19:masterfrom
Conversation
|
Have you finds anything on 2080? How much have you checked? |
I'm not looking so I can't find anything. It's just for programming challenge... |
|
could you help out in add these features, addons if --keyspace is 1:3000, new switch --loop --count 2000 --stride 100 |
|
Hello good morning, I want to know if I put several video cards on the same computer to give you an example 4 video cards, these 4 video cards when running the program would have greater power and speed or not? I await your comments. |
|
Yes - you will have greater power and speed. |
First of all, thank you very much for your answer and other questions and the video cards can be any model, for example gtx 1080ti 11GB, some 2 video cards and adding rx 580 8GB about 3 video cards and adding rtx 3060TI 8GB, I would have no problems or have to be all the same models and nvidia or AMD ?, I await your answer. |
|
Yes - they can be any Nvidia cards. I don't know about AMD cards. |
Again thank you very much for responding and I will see to incorporate more video cards then to achieve greater power and speed, I ask you, what video cards do you use? Have you tried the Nvidia GTX, RTX or QUADDRO? Which ones do you recommend using? |
|
I use 12 Nvidia P106-100 mining cards and 2 Nvidia Tesla K80's. |
were you lucky to use so much power and speed with Bitcarck? |
|
Not yet - I just started. |
are they on 2 separate PCs or 1? as if it were a mining rig? |
|
They are on one PC - an old converted mining rig. |
…e changelog file for more informations.
|
@ByLamacq Is this a patch, which could be also applied to OpenCL? Or is this a CUDA-specific optimization? |
|
It's not really a patch but yes it's can be apply to Opencl. Amd gpu have also specific microcode for vector data load. So i think this change in cl code can increase performance. |
|
@BitCrackEvo |
|
@Uzlopak Actualy, opencl read an array of structure : It's not very good but OpenCl is a high language of programmation so it's depend on the implementation by the compilator... But, you can also try to do that yourself... It's a good training to upgrade your skills. |
|
Hi @BitCrackEvo I started to dig deeper. Very interesting. Can you Help me with this question on stack overflow? |
|
This boosted my Jetson Nano about 20% faster. |
Hello,
I changed global memory access from scalar to vector.
Plateform : Ubuntu 16.04, GTX 1050ti, Cuda 10.1 (Up : Original)

Plateform : Ubuntu 18.04, RTX 2080, Cuda 10.2 (Up : Original)

Best regards,
ByLamacq