| 18c38aee | 12-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - make tidy happy about leak |
| c9d5affa | 12-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - minor consistency |
| 124cc107 | 12-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
vec - use memset on GPU when SetValue for 0 |
| 759e0bc3 | 11-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - style consistency |
| 9dafd6df | 27-Jan-2025 |
Zach Atkins <zach.atkins@colorado.edu> |
Vector API compliance for CUDA backends |
| 11ac676f | 24-Jan-2025 |
Zach Atkins <Zach.Atkins@colorado.edu> |
Update hip basis code to conform to vector interfaces |
| 3efc994b | 11-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - fix minor leak |
| 45a787f7 | 07-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - use struct over array for clarity |
| 9ee499e5 | 07-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - remove duplicate mats in gen |
| 0a2a6492 | 06-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda - remove duplicate mats in gen |
| c9192aca | 07-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - swap out bitwise assignment operators for bools
Co-authored-by: Zach Atkins <zach.atkins@colorado.edu> |
| 8d12f40e | 07-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - gen fallback to shared if error |
| ddae5012 | 07-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda - gen fallback to shared if error |
| f82027a4 | 30-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - update gen non-tensor block strategy |
| 9123fb08 | 29-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - nontensor gen operators |
| dc007f05 | 27-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda - nontensor gen operators |
| cc3bdf8c | 16-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
vec - update SetArray to keep old arrays for CEED_COPY_VALUES |
| 4cbc44e0 | 15-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - check for interp, grad in shared backends |
| 79881bbe | 14-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1727 from CEED/zach/hip-nontensor-fix
Fix issue in block sizing for GPU shared basis |
| 97011eab | 14-Jan-2025 |
Zach Atkins <Zach.Atkins@colorado.edu> |
Fix issue in block sizing for GPU shared basis |
| fda26546 | 14-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - fallback if nontensor shared uses too much mem |
| 2d217acf | 13-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - fix missing template, compile values, fn names |
| 1f6c24fe | 07-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
test - shrink sizes in t319 for non-tensor |
| 6c13bbcb | 07-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - add nontensor shared |
| aa4002ad | 03-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - use gen LoadMatrix in shared |