| 8da1e4ae | 18-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1856 from CEED/zach/remove-op-fallback-parent
ceed - replace usage of `ceed->op_fallback_parent` with `ceed->parent` |
| 46b50f9e | 16-Jul-2025 |
Zach Atkins <zach.atkins@colorado.edu> |
op - Simplify operator fallback to use pre-initialized fallback Ceed context |
| 0db52efc | 16-Jul-2025 |
Zach Atkins <zach.atkins@colorado.edu> |
Reduce code duplication |
| 5da5ab9f | 16-Jul-2025 |
Zach Atkins <zach.atkins@colorado.edu> |
op - Add CeedOperatorApplyAddActive for only summing into active outputs |
| 73b5a3bf | 15-Jul-2025 |
Zach Atkins <zach.atkins@colorado.edu> |
ceed - replace usage of `ceed->op_fallback_parent` with `ceed->parent` |
| 27a8a650 | 12-Jul-2025 |
Zach Atkins <Zach.Atkins@colorado.edu> |
Fix grad basis flop counts |
| dc7b9553 | 11-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - minor reduction in AtPoints grad FLOPs |
| 802d760a | 10-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - minor reordering to reduce at-points flops |
| a82cd097 | 09-Jul-2025 |
Zach Atkins <Zach.Atkins@colorado.edu> |
Update flop counts to match GPU templates for 3D |
| 52780386 | 09-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
basis - fix flop counting for gpu at-points |
| c21e34e2 | 09-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
debug - color consistency |
| ca38d01d | 09-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
debug - clearer debug messages for fallback control flow |
| c8758636 | 09-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - fix debug print args |
| 0816752e | 02-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda - QFunction assembly for gen |
| 025ec10c | 27-Jun-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - consistency with &= |
| 120566fc | 26-Jun-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
pc - clearer debug message |
| d3d5610d | 26-Jun-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
op - add GetName helper |
| 54d16554 | 23-Jun-2025 |
Hugh Carson <hughcars@amazon.com> |
Add missing early return |
| a34b87f3 | 18-Jun-2025 |
Zach Atkins <zach.atkins@colorado.edu> |
Update logic for CeedOperatorLinearAssemble to preference single operator assembly over fallback |
| 915834c9 | 17-Jun-2025 |
Zach Atkins <zach.atkins@colorado.edu> |
gen - full assembly at points for CUDA |
| 0183ed61 | 17-Jun-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
GPU Assembly AtPoints (#1833)
* cuda - AtPoints diagonal assembly for gen
* hip - AtPoints diagonal assembly for gen
* pc - use subops for LinearAssemble[Add]Diagonal if composite
* gen - turn mo
GPU Assembly AtPoints (#1833)
* cuda - AtPoints diagonal assembly for gen
* hip - AtPoints diagonal assembly for gen
* pc - use subops for LinearAssemble[Add]Diagonal if composite
* gen - turn more numbers into named variables
* gen - fix alignment for assembly
* gen - check for only one active basis in/out
* HIP gen at points syntax error fixes
* hip - embarassing fix
* gen - add Tab helper to manage indentation
---------
Co-authored-by: Zach Atkins <Zach.Atkins@colorado.edu>
show more ...
|
| d538d163 | 16-Jun-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
doc - improve internal API documentation (#1834)
* doc - improve internal API documentation
* doc - clarify delegation
* doc - update wording for clarity
Co-authored-by: James Wright <james@james
doc - improve internal API documentation (#1834)
* doc - improve internal API documentation
* doc - clarify delegation
* doc - update wording for clarity
Co-authored-by: James Wright <james@jameswright.xyz>
* doc - clarify memcheck==valgrind
* doc - clarify shared gpu backends
* doc - clarify l/e/q-vecs
* doc - clarify occa backend status
* doc - more inheritance clarification
* doc - minor claification to QF fields
* minor - remove unneeded variables
* doc - update inheritance language
* doc - update shared description
* doc - more gpu notes
* doc - update fallback description
* doc - clarify fallback is only for pc support
* doc - minor updates to user facing GPU section
* doc - minor reordering of major sections
---------
Co-authored-by: James Wright <james@jameswright.xyz>
show more ...
|
| ec4241e6 | 28-May-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
doc - add missing function param |
| fd326ce8 | 04-May-2025 |
Zach Atkins <Zach.Atkins@colorado.edu> |
Use backend delegates for work vectors |
| 99641342 | 28-Apr-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
pc - multigrid level setup for AtPoints |