| #
94b7b29b
|
| 01-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
style - fix header guards
|
| #
49aac155
|
| 24-Mar-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
IWYU fixes (#1182)
* iwyu - include fixes
* iwyu - silence some iwyu output
* minor - clearer macro names
* iwyu - fix suggestion of "ceed/ceed.h" externally
* iwyu - lighter petsc heade
IWYU fixes (#1182)
* iwyu - include fixes
* iwyu - silence some iwyu output
* minor - clearer macro names
* iwyu - fix suggestion of "ceed/ceed.h" externally
* iwyu - lighter petsc headers
* iwyu - ceed/ceed.h -> ceed.h
* iwyu - cuda/hip include fixes
show more ...
|
| #
2b730f8b
|
| 17-Nov-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Switch to clang-format (#1051)
* style - switch to clang-format
* ci - use newer libxsmm
* action - update format action
* format - consistent use of {} for multi-line if/for
* make - re
Switch to clang-format (#1051)
* style - switch to clang-format
* ci - use newer libxsmm
* action - update format action
* format - consistent use of {} for multi-line if/for
* make - remove stray newline
* make - simpler 'make format' target
* ci - use newer libxsmm
* doc - minor release note claification
* minor - minor fix
* minor - minor fix
* minor - minor fix
* minor - minor fix
* make format
* format - less aggressive alignment rules
* tidy - check for argument name mismatches
* fix newline
* format - mirror Ratel update to .clang-format
* fix merge error
* fix merge conflict
* fix merge error
* drop style in .phony list
* Update .clang-format
Co-authored-by: Jed Brown <jed@jedbrown.org>
* apply updated format
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| #
9e201c85
|
| 23-Sep-2022 |
Yohann <dudouit1@llnl.gov> |
Refactor `cuda-gen` and `hip-gen` backends. (#1050)
* Add TODO items.
* rough, but something like this?
* wip - cleaning up some warnings, but more remain
* wip - reorganize
* wip - miss
Refactor `cuda-gen` and `hip-gen` backends. (#1050)
* Add TODO items.
* rough, but something like this?
* wip - cleaning up some warnings, but more remain
* wip - reorganize
* wip - missing kernels
* wip - replace t1d
* fix some kernels
* another typo
* more
* another one
* closer
* define T_1D
* typosgit add .!
* WIP: changes to cuda-shared framework for new kernels
* fix output writing
* buffer fix
* buffer sizes
* WIP: fixes for 2 and 3D basis kernels
* minor
* fix weight kernel for 3d
* remove debugging output
* minor reorg
* fix includes
* enable collo grad for cuda-shared
* move quoted kernels
* renaming
* missed a rename
* small fix
* more naming consistency
* faster 'useCollograd=false' path in *-gen
* more style
* one last style fix
* clearer collograd condition
* Add gen basis kernels to hip-shared
* Try some changes to hip-shared basis block sizes for new kernels
* cuda - drop extra kernel arg
* cuda - fix collograd check logic
* update gen comment about parallelization
* tidy up fields struct definition
* tidy up structs even more
* Update hip-gen basis templates use and move other hip-gen device functions to jit-source
* Finish hip-gen basis template update; small style updates to match CUDA
* missing isStrided
* Update block size used in 3D weight for new shared kernels
* update release notes
Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>
Co-authored-by: nbeams <246972+nbeams@users.noreply.github.com>
show more ...
|
| #
428b7a12
|
| 06-Jun-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #977 from CEED/jeremy/fallback-yet-again
Add debugging output to fallback creation
|
| #
6aa95790
|
| 06-Jun-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
pc - fix fallback for composite assembly
|
| #
ce18bed9
|
| 17-Mar-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #858 from CEED/jeremy/dump-copy-stuff
Strip redundant/outdated license info duplication
|
| #
3d8e8822
|
| 17-Mar-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - update copyright headers
|
| #
d92fedf5
|
| 22-Dec-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #863 from CEED/jeremy/gpu-jit-code
GPU - separate common code into separate folder
|
| #
7fcac036
|
| 22-Dec-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - split common cuda/hip data into separate folder
|
| #
aa616312
|
| 10-Dec-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #855 from CEED/jeremy/drop-oper-e-data
Remove magic number 16
|
| #
bf4cb664
|
| 08-Dec-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
op - turn magic number 16 into constant CEED_FIELD_MAX
|
| #
ebc204c0
|
| 15-Apr-2021 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #740 from CEED/natalie/device-id
Update device ID selection for HIP/CUDA/MAGMA backends
|
| #
6dbfb411
|
| 05-Apr-2021 |
nbeams <246972+nbeams@users.noreply.github.com> |
Update device ID selection for HIP/CUDA backends; add for MAGMA backends
|
| #
874019bc
|
| 31-Mar-2021 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #716 from CEED/jed/install-backend.h
Jed/install backend.h
|
| #
ec3da8bc
|
| 26-Mar-2021 |
Jed Brown <jed@jedbrown.org> |
Install install backend headers under include/ceed/
This makes it possible to distribute source plugins that provide additional backends. It's also used in MFEM, perhaps temporarily.
Deprecate ceed
Install install backend headers under include/ceed/
This makes it possible to distribute source plugins that provide additional backends. It's also used in MFEM, perhaps temporarily.
Deprecate ceed-backend.h, which was not previously installed, but some users accessed it from an in-place build.
Also install CUDA and HIP headers that allow users to provide CUfunction and hipFunction_t.
Co-authored-by: Jeremy L. Thompson <jeremy.thompson@colorado.edu> Requested-by: Andrew T. Barker <barker29@llnl.gov>
show more ...
|
| #
3d576824
|
| 29-Jan-2021 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
headers - clearify includes to not rely on transitive includes (#701)
* headers - clearify includes to not rely on transitive includes
* style - add header recommendations from 'include-what-you-
headers - clearify includes to not rely on transitive includes (#701)
* headers - clearify includes to not rely on transitive includes
* style - add header recommendations from 'include-what-you-use'
* style - apply 'include-what-you-use' changes to CUDA backends
* style - 'include-what-you-use' for hip backends
* style - drop ceed.h includes in gallery qf source
* docs - add dev notes for header files
* style - header style and alphabetize
show more ...
|
| #
2a8ae808
|
| 17-Sep-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
Merge branch 'main' into icl/hip-magma
|
| #
18d499f1
|
| 17-Sep-2020 |
Yohann <dudouit1@llnl.gov> |
Enable under-integration for cuda-shared and cuda-gen backends (#620)
* Support under integration in cuda-shared.
* Add under-integration to the cuda-gen backend.
* Fix bugs when under-integ i
Enable under-integration for cuda-shared and cuda-gen backends (#620)
* Support under integration in cuda-shared.
* Add under-integration to the cuda-gen backend.
* Fix bugs when under-integ in cuda-shared.
* Factor some code.
* Factor some code in cuda-gen.
* Guard more carefully.
* Introduce T1d.
* Fix a bug in readQuads3d
* Fix bugs in 3D.
* Fix a typo
* Safety init.
* Try something with ContractZ3d.
* Guard the add
* revert add.
* Add more thread guards
* Same as previous
* Fix a bug in add.
* style.
* Check that the bases are tensor in cuda-gen.
* move isTensor
* Add T1d to cuda-gen and guard contractions.
* Fix typos.
* add guards in 1d.
* Rewrite weight functions.
* typo
* CUDA - fix cuda-gen collocated check
* make style.
Co-authored-by: jeremylt <thompson.jeremy.luke@gmail.com>
show more ...
|
| #
77ad9f29
|
| 15-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
Merge branch 'main' into natalie/hip-ref-v2
|
| #
4d36c801
|
| 24-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #568 from CEED/jeremy/cuda-destroy
Small CUDA Tidying
|
| #
7df94212
|
| 23-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
CUDA - clean up includes
|
| #
621cd461
|
| 16-Mar-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #421 from SanderA/sanderarens/fix_ceed_cuda_subclasses
Add Ceed_Cuda struct to Ceed_Cuda_ref/shared/gen.
|
| #
d80fc06a
|
| 24-Feb-2020 |
jeremylt <jeremy.thompson@colorado.edu> |
CUDA - use strides as template parameters for cuda/gen
|
| #
920dcdc4
|
| 14-Feb-2020 |
jeremylt <jeremy.thompson@colorado.edu> |
CUDA - initial impl of strided restrictions in cuda/gen
|