History log of /libCEED/backends/ (Results 76 – 100 of 1139)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
a61b1c9117-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - small fixes

efa41df314-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

fix - harmless warnings

74398b5a14-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

hip - add mixed gen

8014c5e714-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - set default dim to max

259057ed14-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - fix flattened indexing

c8e372f013-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - add 3D mixed support

c433aabc11-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - fix 2D flattening

412e568328-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - use 2d Flat variants in gen

343e309426-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - isolate core 2D tensor logic to allow flat version

f725b54b26-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - add P_1D to template args for AtPoints

90c3037418-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - use blocksize of 1 elem AtPoints

28c1f74713-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - log error to debug on JiT try & fail

6b92dc4b10-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

hip - use BASIS_T_1D in codegen

9942127910-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - use BASIS_T_1D in codegen

826538b307-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - restrict input/output array pointers

59fa3f9206-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - use field names for clarity

0c8fbeed26-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - gen should use GetArray over GetArrayWrite

087855af24-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - gen put suboperators on separate streams

c99afcd824-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - gen ApplyAdd functions

e9c76bdd19-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - allow running shared kernels on stream

ea04d07f11-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - isolate gen ApplyAdd inner logic

a877229113-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

hip - fix bug, need to actually get kernels

af0e6e8913-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - add Transpose/TransposeAdd variants for AtPoints

5a05fad612-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1750 from CEED/jeremy/no-handroll-blas

gpu - prefer cu/hipBlas over handrolls

e84c3ebc12-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - prefer cu/hipBlas over handrolls

12345678910>>...46