| 11ac676f | 24-Jan-2025 |
Zach Atkins <Zach.Atkins@colorado.edu> |
Update hip basis code to conform to vector interfaces |
| 9123fb08 | 29-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - nontensor gen operators |
| 81ae6159 | 11-Nov-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - split AtPoints basis between Transpose/no |
| de84fe53 | 25-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1698 from CEED/jeremy/debug-cleanup
Truncate jit output |
| c11e12f4 | 23-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
qf - require CeedOpGetQF to be Destroyed |
| 9bc66399 | 22-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
ceed - require *GetCeed ceed to be Destroyed |
| 26ef7cda | 22-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
debug - truncate jit output |
| 9c25dd66 | 18-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda/hip - use new include pattern for JiT |
| 681d0ea7 | 19-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
op - ReferenceCopy for CeedOperatorFieldGet* |
| c43990d0 | 10-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - simplification |
| 8b0f7348 | 10-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - clean up small leaks |
| 54404f0b | 03-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - fix atpoints evec for diag incompatibility |
| 19a04db8 | 26-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - only overwite portion of basis target used |
| 8bbba8cd | 26-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - use cached work vectors across operators |
| 034f99fd | 25-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - further ref refactoring |
| 43e13fee | 24-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - refactor ref operator |
| 3e961e14 | 19-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - drop unneeded casts |
| 9e511c80 | 04-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
atpoints - copy when *not* the same
Co-authored-by: Zach Atkins <zach.atkins@colorado.edu> |
| 111870fe | 04-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
AtPoints - fix transpose basis apply on GPU |
| 2e88d319 | 03-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - counting points correctly |
| 8a213570 | 26-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - reuse evecs for AtPoints where able |
| 41655a23 | 22-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - reuse evecs where able |
| 5a5594ff | 22-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - fix CeedCall() vs CeedCallBackend() in backend code |
| f8a0df59 | 21-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Skip duplicate transpose restrictions (#1645)
* cpu - skip duplicate output rstr
* cuda - skip duplicate output rstr
* hip - skip duplicate output rstr |
| db2becc9 | 13-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Add CeedBasisApplyAdd (#1644)
* basis - add CeedBasisApplyAdd + CPU impl
* basis - add ref GPU ApplyAdd
* basis - add shared GPU ApplyAdd
* basis - add MAGMA ApplyAdd
* basis - add CeedB
Add CeedBasisApplyAdd (#1644)
* basis - add CeedBasisApplyAdd + CPU impl
* basis - add ref GPU ApplyAdd
* basis - add shared GPU ApplyAdd
* basis - add MAGMA ApplyAdd
* basis - add CeedBasisApplyAddAtPoints + default impl
* basis - add GPU ApplyAddAtPoints
* tidy - add extra assert to fix clang-tidy
* Apply suggestions from code review
style - consistently use indexing over pointer arithmatic
Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>
* style - more pointer fixes
---------
Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>
show more ...
|