| #
d4cc1845
|
| 30-Dec-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1912 from CEED/jeremy/copyright
minor - update copyright to 2026
|
| #
9ba83ac0
|
| 19-Dec-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - update copyright to 2026
|
| #
66c2c381
|
| 18-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1866 from CEED/jeremy/fix-mpm
gpu - fix gen AtPoints transpose
|
| #
49337e26
|
| 18-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - fix gen AtPoints transpose
|
| #
2f00a501
|
| 17-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1863 from CEED/jeremy/at-points-tune
AtPoints Tuning
|
| #
c6cb50fa
|
| 17-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - reorder AtPoints shuffle to avoid bank conflicts
|
| #
360be29c
|
| 17-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - simplify atpoints if guards to prevent divergance
|
| #
f1f13db4
|
| 11-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1854 from CEED/jeremy/minor-at-points-update
Minor reduction in AtPoints grad FLOPs
|
| #
dc7b9553
|
| 11-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - minor reduction in AtPoints grad FLOPs
|
| #
65d13065
|
| 10-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1852 from CEED/jeremy/fix-flop-count
basis - fix flop counting for gpu at-points
|
| #
802d760a
|
| 10-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - minor reordering to reduce at-points flops
|
| #
d6c19ee8
|
| 17-Jun-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - clarify __syncthreads usage (#1838)
|
| #
4b6745b1
|
| 21-Mar-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1762 from CEED/jeremy/gen-mixed
Mixed Tensor/NonTensor for Gen
|
| #
20a16a5f
|
| 20-Mar-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1786 from CEED/jeremy/copy-headers
minor - upate copyright to 2025
|
| #
d275d636
|
| 19-Mar-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - upate copyright to 2025
|
| #
f29bd075
|
| 14-Mar-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - drop changes in AtPoints
|
| #
343e3094
|
| 26-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - isolate core 2D tensor logic to allow flat version
|
| #
f725b54b
|
| 26-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - add P_1D to template args for AtPoints
|
| #
83153ffa
|
| 10-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1723 from CEED/jeremy/at-points-shifts
Fix AtPoints transpose shift
|
| #
a24d84ea
|
| 09-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - fix AtPoints transpose shift
|
| #
390feb51
|
| 02-Jan-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1715 from CEED/jeremy/at-points-gen
AtPoints for */gen
|
| #
4eda27c2
|
| 13-Dec-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - minor fix to 1d AtPoints basis transpose
|
| #
f4112a4e
|
| 10-Dec-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1716 from CEED/jeremy/gen-at-points-pre
Gen AtPoints Updates
|
| #
b6a2eb79
|
| 09-Dec-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
shared - AtPoints template changes for gen
|
| #
290fc47b
|
| 02-Dec-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1711 from CEED/jeremy/shared-at-points
GPU Shared AtPoints Bases
|