| #
90a4cd18
|
| 27-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Whitespace formatting consistency with libCEED style
|
| #
9b83230a
|
| 27-Oct-2023 |
Sebastian Grimberg <sebastiangrimb@gmail.com> |
Merge pull request #1385 from CEED/sjg/magma-transpose-opt
Improve transpose basis performance for `magma` backend
|
| #
833aa127
|
| 19-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Use shared-memory for transpose non-tensor basis kernel to accelerate load of A matrix
|
| #
7132caa0
|
| 20-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Remove need to zero out V vector before applying basis transpose for magma backends
|
| #
9d15e85b
|
| 18-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
H(div) and H(curl) basis support for magma backend
|
| #
db7ade31
|
| 17-Oct-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1382 from CEED/sjg/magma-formatting-dev
Lazy RTC compilation of MAGMA basis kernels for small P, Q
|
| #
940a72f1
|
| 10-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Formatting consistency for magma backend with cuda-ref and hip-ref
Includes JiT upgrades for Magma non-tensor basis to only compile for N values which are used at runtime. Adds JiT for Magma non-ten
Formatting consistency for magma backend with cuda-ref and hip-ref
Includes JiT upgrades for Magma non-tensor basis to only compile for N values which are used at runtime. Adds JiT for Magma non-tensor basis CEED_EVAL_WEIGHT mode.
show more ...
|
| #
f80f4a74
|
| 09-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Rename files in magma backend for consistency with other libCEED backends
|
| #
21f16bf6
|
| 13-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1315 from CEED/jeremy/queep
Backend Consistency Fixes
|
| #
38293ee6
|
| 31-Aug-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
style - fixes for Magma backends
|
| #
1c66c397
|
| 30-Aug-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Interface consistency fixes (#1312)
* format - consistency fixes
* format - test name consistency
* format - drop comma operator
* format - drop git add . in favor coercion
* format - ad
Interface consistency fixes (#1312)
* format - consistency fixes
* format - test name consistency
* format - drop comma operator
* format - drop git add . in favor coercion
* format - add CeedScalarSwap macro
show more ...
|
| #
53f7acb1
|
| 24-Aug-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1292 from CEED/sjg/gpu-rstr-unify
Unify `magma` backend restriction with `cuda-ref` and `hip-ref`
|
| #
00fb7a04
|
| 17-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Refactor CeedInit_Magma and CeedInit_Magma_det similar to ceed-cuda-common.h and ceed-hip-common.h
|
| #
63ca1801
|
| 08-Jul-2023 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #1249 from CEED/jeremy/memcheck-isnan
Memcheck - warn or error on NaN after write-only access
|
| #
23d4529e
|
| 07-Jul-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
debug - add debug color enum
|
| #
6e6704a8
|
| 19-Apr-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1198 from CEED/jeremy/CeedCheck
Add CeedCheck macro to reduce repetition
|
| #
6574a04f
|
| 18-Apr-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
internal - add CeedCheck macro to reduce repetition
|
| #
49aac155
|
| 24-Mar-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
IWYU fixes (#1182)
* iwyu - include fixes
* iwyu - silence some iwyu output
* minor - clearer macro names
* iwyu - fix suggestion of "ceed/ceed.h" externally
* iwyu - lighter petsc heade
IWYU fixes (#1182)
* iwyu - include fixes
* iwyu - silence some iwyu output
* minor - clearer macro names
* iwyu - fix suggestion of "ceed/ceed.h" externally
* iwyu - lighter petsc headers
* iwyu - ceed/ceed.h -> ceed.h
* iwyu - cuda/hip include fixes
show more ...
|
| #
023b8a51
|
| 25-Jan-2023 |
abdelfattah83 <36712794+abdelfattah83@users.noreply.github.com> |
magma: non-tensor rtc (#1141)
* some refactoring in magma's jit src
* fix path
* fix loading src
* refactor magma nontensor backend
* refactor magma nontensor backend
* [WIP]: new non
magma: non-tensor rtc (#1141)
* some refactoring in magma's jit src
* fix path
* fix loading src
* refactor magma nontensor backend
* refactor magma nontensor backend
* [WIP]: new nontensor basis kernels
* [WIP]: new nontensor basis kernels
* [WIP]: new nontensor basis kernels
* call the new nontensor kernels for low order problems
* multiple compilation for the same kernels but with different tuning parmaters
* magma: allow different nb's for different non-tensor kernels
* tuning data for the non-tensor rtc kernels
* remove no-longer used functions, add new one for tuning the nontensor kernels
* constants for tuning
* tuning functions
* use the tuning functions in compiling/running the new kernels
* bug fix
* fixes
* fixes
* minor
* switch tuning data
* fix name
* fix name
* add function to run cuda kernels with opt-in shared memory feature
* minor fix
* minor fix
* fix calls to batch api
* allow more kernel instances
* temporary timing function
* temporary timing function
* tuning data based on hiprtc
* rollback tuning parameters
* fixes
* fixes
* fix inconsistency in the parameters passed to nvrtc/hiprtc
* minor
* a fix to the nb selector
* cleanup
* merge the opt-in feature in CeedRunKernelDimSharedOptinCuda into CeedRunKernelDimSharedCuda
* fix paths for hip-magma backends
* style
* fixes
* running make format
* undo changes from the last commit
* change HIP_DIR to ROCM_DIR and adjust the paths for magma accordingly
* replace HIP_DIR with ROCM_DIR
show more ...
|
| #
2b730f8b
|
| 17-Nov-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Switch to clang-format (#1051)
* style - switch to clang-format
* ci - use newer libxsmm
* action - update format action
* format - consistent use of {} for multi-line if/for
* make - re
Switch to clang-format (#1051)
* style - switch to clang-format
* ci - use newer libxsmm
* action - update format action
* format - consistent use of {} for multi-line if/for
* make - remove stray newline
* make - simpler 'make format' target
* ci - use newer libxsmm
* doc - minor release note claification
* minor - minor fix
* minor - minor fix
* minor - minor fix
* minor - minor fix
* make format
* format - less aggressive alignment rules
* tidy - check for argument name mismatches
* fix newline
* format - mirror Ratel update to .clang-format
* fix merge error
* fix merge conflict
* fix merge error
* drop style in .phony list
* Update .clang-format
Co-authored-by: Jed Brown <jed@jedbrown.org>
* apply updated format
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| #
01005eab
|
| 30-Aug-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1053 from CEED/natalie/fix-magma-jit-mem
Fix small memory leaks in JIT source code management
|
| #
03f90b05
|
| 26-Aug-2022 |
nbeams <246972+nbeams@users.noreply.github.com> |
magma: free memory used in loading jit kernel source
|
| #
cb03979f
|
| 28-Jun-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1002 from CEED/jeremy/format-ceed-types
Add CeedInt_FMT
|
| #
13f886e9
|
| 28-Jun-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
debug - consistent use of CeedDebug vs CeedDebug256
|
| #
990fdeb6
|
| 21-Jun-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
fmt - add CeedInt_FMT
|