History log of /libCEED/backends/cuda-gen/ (Results 26 – 50 of 199)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
c8e372f013-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - add 3D mixed support

c433aabc11-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - fix 2D flattening

412e568328-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - use 2d Flat variants in gen

343e309426-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - isolate core 2D tensor logic to allow flat version

f725b54b26-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - add P_1D to template args for AtPoints

90c3037418-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - use blocksize of 1 elem AtPoints

9942127910-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - use BASIS_T_1D in codegen

826538b307-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - restrict input/output array pointers

59fa3f9206-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gen - use field names for clarity

0c8fbeed26-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - gen should use GetArray over GetArrayWrite

087855af24-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - gen put suboperators on separate streams

c99afcd824-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - gen ApplyAdd functions

e9c76bdd19-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - allow running shared kernels on stream

ea04d07f11-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - isolate gen ApplyAdd inner logic

45a787f707-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - use struct over array for clarity

0a2a649206-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - remove duplicate mats in gen

c9192aca07-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - swap out bitwise assignment operators for bools

Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>

8d12f40e07-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

hip - gen fallback to shared if error

ddae501207-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - gen fallback to shared if error

f82027a430-Jan-2025 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - update gen non-tensor block strategy

9123fb0829-Jan-2025 Jeremy L Thompson <jeremy@jeremylt.org>

hip - nontensor gen operators

dc007f0527-Jan-2025 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - nontensor gen operators


/libCEED/.github/workflows/c-fortran-test-arm64.yml
/libCEED/.github/workflows/c-fortran-test-ppc64le.yml
/libCEED/.gitlab-ci.yml
ceed-cuda-gen-operator-build.cpp
ceed-cuda-gen-operator.c
/libCEED/backends/cuda-ref/ceed-cuda-ref-operator.c
/libCEED/backends/cuda-shared/ceed-cuda-shared-basis.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.h
/libCEED/backends/cuda/ceed-cuda-common.c
/libCEED/backends/hip-shared/ceed-hip-shared-basis.c
/libCEED/backends/hip-shared/ceed-hip-shared.c
/libCEED/backends/hip-shared/ceed-hip-shared.h
/libCEED/backends/hip/ceed-hip-common.c
/libCEED/backends/memcheck/ceed-memcheck-vector.c
/libCEED/include/ceed/backend.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-nontensor-templates.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-nontensor.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-read-write-templates.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-tensor-at-points-templates.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-tensor-at-points.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-tensor.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-nontensor-templates.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-nontensor.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-read-write-templates.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-tensor-at-points-templates.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-tensor-at-points.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-tensor.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed.c
/libCEED/python/tests/test-1-vector.py
/libCEED/tests/t128-vector.c
/libCEED/tests/t319-basis.c
3a2968d617-Dec-2024 Jeremy L Thompson <jeremy@jeremylt.org>

hip - AtPoints for hip/gen

8b97b69a05-Dec-2024 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - AtPoints for cuda/gen

f815fac909-Dec-2024 Jeremy L Thompson <jeremy@jeremylt.org>

gen - fun name standardization


ceed-cuda-gen-operator-build.cpp
/libCEED/backends/cuda-ref/ceed-cuda-ref-basis.c
/libCEED/backends/cuda-ref/ceed-cuda-ref.h
/libCEED/backends/cuda-shared/ceed-cuda-shared-basis.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.h
/libCEED/backends/hip-gen/ceed-hip-gen-operator-build.cpp
/libCEED/backends/hip-ref/ceed-hip-ref-basis.c
/libCEED/backends/hip-ref/ceed-hip-ref.h
/libCEED/backends/hip-shared/ceed-hip-shared-basis.c
/libCEED/backends/hip-shared/ceed-hip-shared.h
/libCEED/examples/fluids/include/mat-ceed-impl.h
/libCEED/examples/fluids/include/mat-ceed.h
/libCEED/examples/fluids/src/differential_filter.c
/libCEED/examples/fluids/src/grid_anisotropy_tensor.c
/libCEED/examples/fluids/src/mat-ceed.c
/libCEED/examples/fluids/src/setuplibceed.c
/libCEED/examples/fluids/src/turb_spanstats.c
/libCEED/examples/fluids/src/velocity_gradient_projection.c
/libCEED/examples/petsc/dmswarm.c
/libCEED/examples/petsc/src/swarmutils.c
/libCEED/include/ceed/backend.h
/libCEED/include/ceed/jit-source/cuda/cuda-gen-templates.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-basis-tensor-at-points.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-tensor-at-points-templates.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-tensor-at-points.h
/libCEED/include/ceed/jit-source/hip/hip-gen-templates.h
/libCEED/include/ceed/jit-source/hip/hip-ref-basis-tensor-at-points.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-tensor-at-points-templates.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-tensor-at-points.h
/libCEED/include/ceed/jit-source/sycl/sycl-types.h
/libCEED/include/ceed/types.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-operator.c
/libCEED/interface/ceed-preconditioning.c
/libCEED/rust/libceed/src/basis.rs
/libCEED/rust/libceed/src/elem_restriction.rs
/libCEED/rust/libceed/src/operator.rs
/libCEED/rust/libceed/src/vector.rs
/libCEED/tests/junit_common.py
/libCEED/tests/t354-basis.c
/libCEED/tests/t595-operator.c
/libCEED/tests/t595-operator.h

12345678