style - fix header guards
Whitespace, style, and formatting updates for consistency between CUDA and HIP backendsAdds include guards in JiT header files, even if not strictly necessary, to match the precedent set in cuda-sh
Whitespace, style, and formatting updates for consistency between CUDA and HIP backendsAdds include guards in JiT header files, even if not strictly necessary, to match the precedent set in cuda-shared and hip-shared as well as sycl.
show more ...
gpu - naming consistency fixes
ceed - move GetResourceRoot to backend interface
minor - clean up backend headers for const and argument names
Merge pull request #1197 from sebastiangrimberg/sjg/style-whitespace-fixMinor style consistency updates
hip - fix include order
internal - add CeedCheck macro to reduce repetition
Enforce include order in .clang-format
Fix file endings inconsistency
IWYU fixes (#1182)* iwyu - include fixes * iwyu - silence some iwyu output * minor - clearer macro names * iwyu - fix suggestion of "ceed/ceed.h" externally * iwyu - lighter petsc heade
IWYU fixes (#1182)* iwyu - include fixes * iwyu - silence some iwyu output * minor - clearer macro names * iwyu - fix suggestion of "ceed/ceed.h" externally * iwyu - lighter petsc headers * iwyu - ceed/ceed.h -> ceed.h * iwyu - cuda/hip include fixes
minor - assorted formatting fixes
Switch to clang-format (#1051)* style - switch to clang-format * ci - use newer libxsmm * action - update format action * format - consistent use of {} for multi-line if/for * make - re
Switch to clang-format (#1051)* style - switch to clang-format * ci - use newer libxsmm * action - update format action * format - consistent use of {} for multi-line if/for * make - remove stray newline * make - simpler 'make format' target * ci - use newer libxsmm * doc - minor release note claification * minor - minor fix * minor - minor fix * minor - minor fix * minor - minor fix * make format * format - less aggressive alignment rules * tidy - check for argument name mismatches * fix newline * format - mirror Ratel update to .clang-format * fix merge error * fix merge conflict * fix merge error * drop style in .phony list * Update .clang-format Co-authored-by: Jed Brown <jed@jedbrown.org> * apply updated format Co-authored-by: Jed Brown <jed@jedbrown.org>
Refactor `cuda-gen` and `hip-gen` backends. (#1050)* Add TODO items. * rough, but something like this? * wip - cleaning up some warnings, but more remain * wip - reorganize * wip - miss
Refactor `cuda-gen` and `hip-gen` backends. (#1050)* Add TODO items. * rough, but something like this? * wip - cleaning up some warnings, but more remain * wip - reorganize * wip - missing kernels * wip - replace t1d * fix some kernels * another typo * more * another one * closer * define T_1D * typosgit add .! * WIP: changes to cuda-shared framework for new kernels * fix output writing * buffer fix * buffer sizes * WIP: fixes for 2 and 3D basis kernels * minor * fix weight kernel for 3d * remove debugging output * minor reorg * fix includes * enable collo grad for cuda-shared * move quoted kernels * renaming * missed a rename * small fix * more naming consistency * faster 'useCollograd=false' path in *-gen * more style * one last style fix * clearer collograd condition * Add gen basis kernels to hip-shared * Try some changes to hip-shared basis block sizes for new kernels * cuda - drop extra kernel arg * cuda - fix collograd check logic * update gen comment about parallelization * tidy up fields struct definition * tidy up structs even more * Update hip-gen basis templates use and move other hip-gen device functions to jit-source * Finish hip-gen basis template update; small style updates to match CUDA * missing isStrided * Update block size used in 3D weight for new shared kernels * update release notes Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org> Co-authored-by: nbeams <246972+nbeams@users.noreply.github.com>
hip - guard hipblas header include for HIP_VERSION
QF headers for typedefs and macros (#1036)* jit - qf headers for typedefs and macros * jit - smaller list of permitted files * ceed - only include ceed.h in QF source
gpu - fix setting device id
minor - update copyright headers
Add flag to use atomic adds on supported AMD GPU hardware
Slight modifications for hiprtc usage in ROCm 4.5 (#850)
style - consistent nameing and style for gpu backends
gpu - drop unused argument in init
gpu - drop unused [HIP, CUDA]_MAX_PATH constant
gpu - split common cuda/hip data into separate folder
cuda - separate compile functionality into new header
12345