| #
78cbaa95
|
| 30-Nov-2023 |
Sebastian Grimberg <sebastiangrimb@gmail.com> |
Merge pull request #1300 from CEED/sjg/hcurl-hdiv-basis-cuda-hip
H(div) and H(curl) basis support for `cuda-ref` and `hip-ref` backends
|
| #
d075f50b
|
| 11-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
H(div) and H(curl) basis support for cuda-ref and hip-ref backends
|
| #
506b1a0c
|
| 20-Sep-2023 |
Sebastian Grimberg <sebastiangrimb@gmail.com> |
Non-square operator full assembly (#1316)
* Consistency formatting for operator tests
* Add (failing) test for non-square operator full assembly
* Add support for CPU-based full assembly of no
Non-square operator full assembly (#1316)
* Consistency formatting for operator tests
* Add (failing) test for non-square operator full assembly
* Add support for CPU-based full assembly of non-square CeedOperators
* Fix full assembly of identity quadrature functions and operators with CEED_BASIS_NONE
Also adds a unit test for full assembly which fails prior to the changes in this commit.
* Minor refactor to improve coverage
show more ...
|
| #
21f16bf6
|
| 13-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1315 from CEED/jeremy/queep
Backend Consistency Fixes
|
| #
ca735530
|
| 31-Aug-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
style - fixes for CUDA backends
|
| #
31c137a9
|
| 01-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1320 from CEED/jeremy/jit-header-guards
style - fix header guards
|
| #
94b7b29b
|
| 01-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
style - fix header guards
|
| #
3c708d80
|
| 24-Aug-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1299 from CEED/sjg/cuda-hip-backends-consistency
Style fixes for `cuda` and `hip` backend consistency
|
| #
b2165e7a
|
| 11-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Whitespace, style, and formatting updates for consistency between CUDA and HIP backends
Adds include guards in JiT header files, even if not strictly necessary, to match the precedent set in cuda-sh
Whitespace, style, and formatting updates for consistency between CUDA and HIP backends
Adds include guards in JiT header files, even if not strictly necessary, to match the precedent set in cuda-shared and hip-shared as well as sycl.
show more ...
|
| #
53f7acb1
|
| 24-Aug-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1292 from CEED/sjg/gpu-rstr-unify
Unify `magma` backend restriction with `cuda-ref` and `hip-ref`
|
| #
58549094
|
| 15-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Unify magma backend restriction with cuda/hip-ref, keeping runtime option for deterministic and non-deterministic using magma/det
This also opens the opportunity to make cuda/hip-ref non-determinist
Unify magma backend restriction with cuda/hip-ref, keeping runtime option for deterministic and non-deterministic using magma/det
This also opens the opportunity to make cuda/hip-ref non-deterministic by default and add cuda/hip-ref/det variants.
show more ...
|
| #
38432684
|
| 03-Aug-2023 |
Sebastian Grimberg <sebastiangrimb@gmail.com> |
Merge pull request #1265 from CEED/sjg/hcurl-restr-dev
CeedElemRestriction for H(curl)
|
| #
0c73c039
|
| 22-Jun-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Improve element restriction for H(curl) spaces by reorganizing loops, including use of int8_t for tridiagonal matrix
|
| #
0305e208
|
| 06-May-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Update backends for unified ElemRestrictionCreate variants for all restriction types (default, oriented, strided)
|
| #
4b35598d
|
| 20-Jun-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1231 from CEED/jeremy/consistency
Consistency fixes
|
| #
eb7e6caf
|
| 16-Jun-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - naming consistency fixes
|
| #
30d6126f
|
| 25-Apr-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1202 from CEED/jeremy/const-fix
Clean up backend headers for const and argument names
|
| #
472941f0
|
| 21-Apr-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - fix static vs CEED_INTERN in backend file
|
| #
51475c7c
|
| 20-Apr-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - clean up backend headers for const and argument names
|
| #
6e6704a8
|
| 19-Apr-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1198 from CEED/jeremy/CeedCheck
Add CeedCheck macro to reduce repetition
|
| #
6574a04f
|
| 18-Apr-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
internal - add CeedCheck macro to reduce repetition
|
| #
49aac155
|
| 24-Mar-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
IWYU fixes (#1182)
* iwyu - include fixes
* iwyu - silence some iwyu output
* minor - clearer macro names
* iwyu - fix suggestion of "ceed/ceed.h" externally
* iwyu - lighter petsc heade
IWYU fixes (#1182)
* iwyu - include fixes
* iwyu - silence some iwyu output
* minor - clearer macro names
* iwyu - fix suggestion of "ceed/ceed.h" externally
* iwyu - lighter petsc headers
* iwyu - ceed/ceed.h -> ceed.h
* iwyu - cuda/hip include fixes
show more ...
|
| #
2b730f8b
|
| 17-Nov-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Switch to clang-format (#1051)
* style - switch to clang-format
* ci - use newer libxsmm
* action - update format action
* format - consistent use of {} for multi-line if/for
* make - re
Switch to clang-format (#1051)
* style - switch to clang-format
* ci - use newer libxsmm
* action - update format action
* format - consistent use of {} for multi-line if/for
* make - remove stray newline
* make - simpler 'make format' target
* ci - use newer libxsmm
* doc - minor release note claification
* minor - minor fix
* minor - minor fix
* minor - minor fix
* minor - minor fix
* make format
* format - less aggressive alignment rules
* tidy - check for argument name mismatches
* fix newline
* format - mirror Ratel update to .clang-format
* fix merge error
* fix merge conflict
* fix merge error
* drop style in .phony list
* Update .clang-format
Co-authored-by: Jed Brown <jed@jedbrown.org>
* apply updated format
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| #
9e201c85
|
| 23-Sep-2022 |
Yohann <dudouit1@llnl.gov> |
Refactor `cuda-gen` and `hip-gen` backends. (#1050)
* Add TODO items.
* rough, but something like this?
* wip - cleaning up some warnings, but more remain
* wip - reorganize
* wip - miss
Refactor `cuda-gen` and `hip-gen` backends. (#1050)
* Add TODO items.
* rough, but something like this?
* wip - cleaning up some warnings, but more remain
* wip - reorganize
* wip - missing kernels
* wip - replace t1d
* fix some kernels
* another typo
* more
* another one
* closer
* define T_1D
* typosgit add .!
* WIP: changes to cuda-shared framework for new kernels
* fix output writing
* buffer fix
* buffer sizes
* WIP: fixes for 2 and 3D basis kernels
* minor
* fix weight kernel for 3d
* remove debugging output
* minor reorg
* fix includes
* enable collo grad for cuda-shared
* move quoted kernels
* renaming
* missed a rename
* small fix
* more naming consistency
* faster 'useCollograd=false' path in *-gen
* more style
* one last style fix
* clearer collograd condition
* Add gen basis kernels to hip-shared
* Try some changes to hip-shared basis block sizes for new kernels
* cuda - drop extra kernel arg
* cuda - fix collograd check logic
* update gen comment about parallelization
* tidy up fields struct definition
* tidy up structs even more
* Update hip-gen basis templates use and move other hip-gen device functions to jit-source
* Finish hip-gen basis template update; small style updates to match CUDA
* missing isStrided
* Update block size used in 3D weight for new shared kernels
* update release notes
Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>
Co-authored-by: nbeams <246972+nbeams@users.noreply.github.com>
show more ...
|
| #
428b7a12
|
| 06-Jun-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #977 from CEED/jeremy/fallback-yet-again
Add debugging output to fallback creation
|