| #
773bf0f6
|
| 05-Mar-2024 |
Hong Zhang <hongzhang@anl.gov> |
Add SELLHIP
- The HIP kernels are converted directly from their CUDA version - AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate comp
Add SELLHIP
- The HIP kernels are converted directly from their CUDA version - AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate compile-time code optimization
show more ...
|