| 6d5e61d4 | 20-Dec-2023 |
Sebastian Grimberg <sjg@amazon.com> |
make format |
| bf9b6c6b | 20-Dec-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Strided transpose element restriction does not require atomic adds |
| 0836ce8d | 12-Dec-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Minor function signature fix caught with ASan |
| 51d50b59 | 07-Dec-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
memcheck - match nan check for QFunction field size |
| e910d748 | 08-Dec-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cpu - fix memcheck detection of uninitalized outputs |
| 38e83183 | 08-Dec-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
swarm - fix op swarm input access |
| 33bb61d4 | 05-Dec-2023 |
Kris Rowe <kris.rowe@anl.gov> |
Replace calls to deprecated `local_accessor::get_pointer()` |
| 78cbaa95 | 30-Nov-2023 |
Sebastian Grimberg <sebastiangrimb@gmail.com> |
Merge pull request #1300 from CEED/sjg/hcurl-hdiv-basis-cuda-hip
H(div) and H(curl) basis support for `cuda-ref` and `hip-ref` backends |
| 48acf710 | 30-Nov-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
OperatorAtPoints (#1390)
* op - add AtPoints interface
* op - reorder AtPoints QFunction for consistency
* op - add CEED_EVAL_WEIGHT for operator AtPoints
* doc - add CeedOperatorCreateAtPo
OperatorAtPoints (#1390)
* op - add AtPoints interface
* op - reorder AtPoints QFunction for consistency
* op - add CEED_EVAL_WEIGHT for operator AtPoints
* doc - add CeedOperatorCreateAtPoints to release notes
show more ...
|
| 9c34f28e | 28-Nov-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
rstr - reorder AtPoints ordering for consistency |
| d075f50b | 11-Aug-2023 |
Sebastian Grimberg <sjg@amazon.com> |
H(div) and H(curl) basis support for cuda-ref and hip-ref backends |
| 93f4dbf1 | 07-Sep-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Remove legacy CeedChk and CeedChkBackend (should use CeedCall, CeedCallBackend instead) |
| d5c4fb64 | 03-Nov-2023 |
nbeams <246972+nbeams@users.noreply.github.com> |
Add updated MI100 tuning data; make style updates for tuning data |
| e7270687 | 03-Nov-2023 |
nbeams <246972+nbeams@users.noreply.github.com> |
Add updated A100 tuning data |
| 922f4715 | 03-Nov-2023 |
nbeams <246972+nbeams@users.noreply.github.com> |
Add updated MI250X tuning data |
| dc215721 | 27-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Add RTC tuning data for V100 and H100 |
| 78d85032 | 17-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Increment NB by 2 or 4 to speed up tuning |
| 47cb5789 | 13-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Add 2*p quadrature rules to tuning data (previously was just 2*(p-1) |
| 4c387b0f | 05-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Add documentation for MAGMA autotuning |
| 7c7f2ed8 | 04-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
New RTC tuning data format |
| acc0bb12 | 04-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Fixes and improvements to autotuning code |
| 26bdecf3 | 04-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Add Python program to automatically generate tuning data for MAGMA RTC kernels |
| ac8b7a1c | 04-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Add simple test program to generate benchmark data for CeedBasis operations |
| 90a4cd18 | 27-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Whitespace formatting consistency with libCEED style |
| 7132caa0 | 20-Oct-2023 |
Sebastian Grimberg <sjg@amazon.com> |
Remove need to zero out V vector before applying basis transpose for magma backends |