| 7132caa0 | 20-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Remove need to zero out V vector before applying basis transpose for magma backends |
| 9e0c01fa | 20-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Initial commit to optimize magma backend transpose basis application |
| 833aa127 | 19-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Use shared-memory for transpose non-tensor basis kernel to accelerate load of A matrix |
| 9d15e85b | 18-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
H(div) and H(curl) basis support for magma backend |
| 4018a20a | 25-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
release - update version number and notes |
| 940a72f1 | 10-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Formatting consistency for magma backend with cuda-ref and hip-ref
Includes JiT upgrades for Magma non-tensor basis to only compile for N values which are used at runtime. Adds JiT for Magma non-ten
Formatting consistency for magma backend with cuda-ref and hip-ref
Includes JiT upgrades for Magma non-tensor basis to only compile for N values which are used at runtime. Adds JiT for Magma non-tensor basis CEED_EVAL_WEIGHT mode.
show more ...
|
| 3c1e2aff | 11-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Formatting updates for MAGMA JiT kernels |
| f80f4a74 | 09-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Rename files in magma backend for consistency with other libCEED backends |
| 1ff07f3d | 04-Oct-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
ctx - update CeedContextFieldLabelGetDescription with offset |
| a71faab1 | 03-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Remove unused parameter from CeedTensorContractCreate |
| 37eda346 | 29-Aug-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
op - remove CeedOperatorSetNumQuadraturePoints |
| 1c7d1e03 | 20-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1340 from CEED/jeremy/fix-ptsc-orients-copy
Fix CEED_COPY_VALUES for ref rstr at points |
| 506b1a0c | 20-Sep-2023 |
Sebastian Grimberg <sebastiangrimb@gmail.com> |
Non-square operator full assembly (#1316)
* Consistency formatting for operator tests
* Add (failing) test for non-square operator full assembly
* Add support for CPU-based full assembly of no
Non-square operator full assembly (#1316)
* Consistency formatting for operator tests
* Add (failing) test for non-square operator full assembly
* Add support for CPU-based full assembly of non-square CeedOperators
* Fix full assembly of identity quadrature functions and operators with CEED_BASIS_NONE
Also adds a unit test for full assembly which fails prior to the changes in this commit.
* Minor refactor to improve coverage
show more ...
|
| 58c07c4f | 20-Sep-2023 |
Sebastian Grimberg <sebastiangrimb@gmail.com> |
Support CPU shared-memory parallelism with OpenMP (#1279)
* Updates for OpenMP thread safety (one Ceed per thread, potentially with shared output vector)
* Makefile tabs vs. spaces consistency
Support CPU shared-memory parallelism with OpenMP (#1279)
* Updates for OpenMP thread safety (one Ceed per thread, potentially with shared output vector)
* Makefile tabs vs. spaces consistency
* Fix unrelated pragma bug for Intel compilers in `backend.h`
* Address PR feedback: Use _OPENMP macro, simplify OpenMP pragma wrappers
* Address PR feedback: Add new statement macro CeedPragmaThreadPrivate to PREDEFINED entries in Doxyfile
* Add OpenMP support to Intel CI workflow for testing
* Add documentation for OPENMP option and Update releasenotes.md
* Revise OpenMP implementation: Rather than enforcing global variables to be threadprivate, just wrap potential race conditions in a critical block (during registration)
* Avoid returning from OpenMP blocks
* Early break on error when registering backends or QFunctions
* Formatting fixes after rebase, newline after variable declarations
* Address PR feedback: Revert some unintentional changes to debug output
* Update codecov exclusions
show more ...
|
| 07d5dec1 | 20-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
rstr - fix CEED_COPY_VALUES for ref rstr at points |
| 05fa913c | 13-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
rstr - allow restriction to ordered points evec |
| 3ac8f562 | 12-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
rstr - add interface functions for CEED_RESTRICTION_POINTS |
| 363aefef | 14-Sep-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Revert typo from #1315 |
| cfa13e89 | 14-Sep-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Enforce consistent pointer alignment with clang-format |
| 672b0f2a | 14-Sep-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Fix some missing consistency issues from #1315 |
| 21f16bf6 | 13-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1315 from CEED/jeremy/queep
Backend Consistency Fixes |
| ca735530 | 31-Aug-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
style - fixes for CUDA backends |
| 01f0e615 | 18-Jun-2023 |
James Wright <james@jameswright.xyz> |
Add CeedOperatorLinearAssemblePointBlockDiagonalSymbolic |
| 94b7b29b | 01-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
style - fix header guards |
| 056ec191 | 31-Aug-2023 |
James Wright <james@jameswright.xyz> |
fix: Add `CEED_BASIS_COLLOCATED` to ceed.h
- `CEED_BASIS_COLLOCATED` is deprecated, so should still be available to use |