Update the OpenCL kernel for 128/256 threads/block based on the equivalent CUDA kernel - see commit f2b9db2 from the main Gromacs master branch: gromacs@f2b9db2
Evaluate the performance of the new kernel for AMD and NVIDIA GPUs and decide on the final version or versions of the OpenCL kernel that will be used.
Update the OpenCL kernel for 128/256 threads/block based on the equivalent CUDA kernel - see commit f2b9db2 from the main Gromacs master branch: gromacs@f2b9db2
Evaluate the performance of the new kernel for AMD and NVIDIA GPUs and decide on the final version or versions of the OpenCL kernel that will be used.