| #
97011eab
|
| 14-Jan-2025 |
Zach Atkins <Zach.Atkins@colorado.edu> |
Fix issue in block sizing for GPU shared basis
|
| #
fda26546
|
| 14-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - fallback if nontensor shared uses too much mem
|
| #
d01feaa6
|
| 13-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1724 from CEED/jeremy/fix-hip-shared
Fix hip/shared NonTensor
|
| #
2d217acf
|
| 13-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - fix missing template, compile values, fn names
|
| #
1a63be7e
|
| 09-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1721 from CEED/jeremy/shared-nontensor
Add non-tensor shared
|
| #
1f6c24fe
|
| 07-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
test - shrink sizes in t319 for non-tensor
|
| #
6c13bbcb
|
| 07-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - add nontensor shared
|
| #
aa4002ad
|
| 03-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - use gen LoadMatrix in shared
|
| #
5a7f61ca
|
| 11-Dec-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1717 from CEED/jeremy/fix-hip-shared-atpoints
Fix hip shared atpoints
|
| #
b4280a96
|
| 11-Dec-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
hip - reduce elem per block for 3d shared AtPoints basis
|
| #
290fc47b
|
| 02-Dec-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1711 from CEED/jeremy/shared-at-points
GPU Shared AtPoints Bases
|
| #
a8d440fb
|
| 02-Dec-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - simplify shared grid counting co-authored-by: zatkins-dev <zach.atkins@colorado.edu>
|
| #
9e1d4b82
|
| 07-Nov-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - shared AtPoints
|
| #
be8d6f55
|
| 12-Nov-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1710 from CEED/jeremy/split-at-points
Split AtPoints basis between Transpose/no
|
| #
81ae6159
|
| 11-Nov-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - split AtPoints basis between Transpose/no
|
| #
e3ae47f6
|
| 23-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1702 from CEED/jeremy/get-ceed-object
Ceed*Get[CeedObject] Needs Destroy
|
| #
9bc66399
|
| 22-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
ceed - require *GetCeed ceed to be Destroyed
|
| #
1dc8b1e6
|
| 21-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1696 from CEED/jeremy/jit-include
JiT include update
|
| #
9c25dd66
|
| 18-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda/hip - use new include pattern for JiT
|
| #
bdd4742d
|
| 02-Oct-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1673 from CEED/jeremy/use-work-vecs
GPU Operators use work vectors
|
| #
19a04db8
|
| 26-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - only overwite portion of basis target used
|
| #
25c4e04a
|
| 05-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1655 from CEED/jeremy/at-points-transpose
AtPoints - fix transpose basis apply on GPU
|
| #
9e511c80
|
| 04-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
atpoints - copy when *not* the same
Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>
|
| #
111870fe
|
| 04-Sep-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
AtPoints - fix transpose basis apply on GPU
|
| #
bd7a0ce7
|
| 22-Aug-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1647 from CEED/jeremy/fix-call
Small Bugfixes
|