| de84fe53 | 25-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1698 from CEED/jeremy/debug-cleanup
Truncate jit output |
| c11e12f4 | 23-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
qf - require CeedOpGetQF to be Destroyed |
| 9bc66399 | 22-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
ceed - require *GetCeed ceed to be Destroyed |
| 26ef7cda | 22-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
debug - truncate jit output |
| 9c25dd66 | 18-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda/hip - use new include pattern for JiT |
| 681d0ea7 | 19-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
op - ReferenceCopy for CeedOperatorFieldGet* |
| c43990d0 | 10-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - simplification |
| 8b0f7348 | 10-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - clean up small leaks |
| 54404f0b | 03-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - fix atpoints evec for diag incompatibility |
| 19a04db8 | 26-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - only overwite portion of basis target used |
| 8bbba8cd | 26-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - use cached work vectors across operators |
| 034f99fd | 25-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - further ref refactoring |
| 43e13fee | 24-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - refactor ref operator |
| 3e961e14 | 19-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - drop unneeded casts |
| 9e511c80 | 04-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
atpoints - copy when *not* the same
Co-authored-by: Zach Atkins <zach.atkins@colorado.edu> |
| 111870fe | 04-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
AtPoints - fix transpose basis apply on GPU |
| 2e88d319 | 03-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - counting points correctly |
| 8a213570 | 26-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - reuse evecs for AtPoints where able |
| 41655a23 | 22-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - reuse evecs where able |
| 5a5594ff | 22-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - fix CeedCall() vs CeedCallBackend() in backend code |
| f8a0df59 | 21-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Skip duplicate transpose restrictions (#1645)
* cpu - skip duplicate output rstr
* cuda - skip duplicate output rstr
* hip - skip duplicate output rstr |
| db2becc9 | 13-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Add CeedBasisApplyAdd (#1644)
* basis - add CeedBasisApplyAdd + CPU impl
* basis - add ref GPU ApplyAdd
* basis - add shared GPU ApplyAdd
* basis - add MAGMA ApplyAdd
* basis - add CeedB
Add CeedBasisApplyAdd (#1644)
* basis - add CeedBasisApplyAdd + CPU impl
* basis - add ref GPU ApplyAdd
* basis - add shared GPU ApplyAdd
* basis - add MAGMA ApplyAdd
* basis - add CeedBasisApplyAddAtPoints + default impl
* basis - add GPU ApplyAddAtPoints
* tidy - add extra assert to fix clang-tidy
* Apply suggestions from code review
style - consistently use indexing over pointer arithmatic
Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>
* style - more pointer fixes
---------
Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>
show more ...
|
| 3aab95c0 | 05-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
op - minor performance improvement for op with repeat input rstr |
| 13062808 | 02-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
atpoints - remove some extra operations |
| 86e10729 | 02-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
atpoints - fix diagonal bug with stale qvec data |