History log of /libCEED/backends/ (Results 176 – 200 of 1139)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
bf84744c22-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

leak - add missing CeedFree for string

5a5594ff22-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

minor - fix CeedCall() vs CeedCallBackend() in backend code

f8a0df5921-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Skip duplicate transpose restrictions (#1645)

* cpu - skip duplicate output rstr

* cuda - skip duplicate output rstr

* hip - skip duplicate output rstr

4b3e95d521-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

GPU Gen Reorganize (#1637)

* cuda - pull out basis setup for gen

* cuda - functions for adding basis, rstr gen actions

* cuda - pull QFunction logic into separate fn for gen

* cuda - minor

GPU Gen Reorganize (#1637)

* cuda - pull out basis setup for gen

* cuda - functions for adding basis, rstr gen actions

* cuda - pull QFunction logic into separate fn for gen

* cuda - minor formatting

* cuda - fix basis errorr

* cuda - rename collograd_parallelization to 3d_slices

* cuda - another gen setup function separated

* hip - update gen source building to match cuda

* gpu - fix min size of QF inputs for gen

show more ...

db2becc913-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Add CeedBasisApplyAdd (#1644)

* basis - add CeedBasisApplyAdd + CPU impl

* basis - add ref GPU ApplyAdd

* basis - add shared GPU ApplyAdd

* basis - add MAGMA ApplyAdd

* basis - add CeedB

Add CeedBasisApplyAdd (#1644)

* basis - add CeedBasisApplyAdd + CPU impl

* basis - add ref GPU ApplyAdd

* basis - add shared GPU ApplyAdd

* basis - add MAGMA ApplyAdd

* basis - add CeedBasisApplyAddAtPoints + default impl

* basis - add GPU ApplyAddAtPoints

* tidy - add extra assert to fix clang-tidy

* Apply suggestions from code review

style - consistently use indexing over pointer arithmatic

Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>

* style - more pointer fixes

---------

Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>

show more ...


cuda-ref/ceed-cuda-ref-basis.c
cuda-shared/ceed-cuda-shared-basis.c
cuda-shared/ceed-cuda-shared.h
hip-ref/ceed-hip-ref-basis.c
hip-shared/ceed-hip-shared-basis.c
hip-shared/ceed-hip-shared.h
magma/ceed-magma-basis.c
magma/ceed-magma.h
ref/ceed-ref-basis.c
/libCEED/include/ceed-impl.h
/libCEED/include/ceed/ceed.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-basis-nontensor-templates.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-basis-tensor-at-points.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-basis-tensor.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-operator-assemble.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-restriction-at-points.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-restriction-curl-oriented.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-restriction-offset.h
/libCEED/include/ceed/jit-source/cuda/cuda-ref-restriction-oriented.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-read-write-templates.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-tensor-templates.h
/libCEED/include/ceed/jit-source/cuda/cuda-shared-basis-tensor.h
/libCEED/include/ceed/jit-source/hip/hip-ref-basis-nontensor-templates.h
/libCEED/include/ceed/jit-source/hip/hip-ref-basis-tensor-at-points.h
/libCEED/include/ceed/jit-source/hip/hip-ref-basis-tensor.h
/libCEED/include/ceed/jit-source/hip/hip-ref-operator-assemble.h
/libCEED/include/ceed/jit-source/hip/hip-ref-restriction-at-points.h
/libCEED/include/ceed/jit-source/hip/hip-ref-restriction-curl-oriented.h
/libCEED/include/ceed/jit-source/hip/hip-ref-restriction-offset.h
/libCEED/include/ceed/jit-source/hip/hip-ref-restriction-oriented.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-read-write-templates.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-tensor-templates.h
/libCEED/include/ceed/jit-source/hip/hip-shared-basis-tensor.h
/libCEED/include/ceed/jit-source/magma/magma-basis-grad-1d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-grad-2d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-grad-3d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-interp-1d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-interp-2d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-interp-3d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-interp-deriv-nontensor.h
/libCEED/include/ceed/jit-source/magma/magma-common-nontensor.h
/libCEED/include/ceed/jit-source/magma/magma-common-tensor.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed.c
/libCEED/tests/README.md
/libCEED/tests/t360-basis.c
/libCEED/tests/t361-basis.c
/libCEED/tests/t362-basis.c
/libCEED/tests/t363-basis.c
/libCEED/tests/t364-basis.c
/libCEED/tests/t365-basis.c
3aab95c005-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

op - minor performance improvement for op with repeat input rstr

1306280802-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

atpoints - remove some extra operations

86e1072902-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

atpoints - fix diagonal bug with stale qvec data

382e9c8302-Aug-2024 Jeremy L Thompson <jeremy@jeremylt.org>

atPoints - fix diagonal assembly for mixed

9b443e3b16-Jul-2024 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - minimum input/output array size of 1

0a5597ce11-Jul-2024 Jeremy L Thompson <jeremy@jeremylt.org>

op - cast to CeedSize when creating rstr

afe3bc8a28-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

op - HIP diagonal assembly AtPoints

349fb27d28-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

op - CUDA diagonal assembly AtPoints

67d9480a20-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

hip - add AtPoints CeedOperator

756ca9e920-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - add AtPoints CeedOperator

ad8059fc10-Jul-2024 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - reduce write conflits for AtPoints basis operations

14950a8e21-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

magma - explicitly exclude BasisApplyAtPoints

f7c9815f20-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

AtPoints - ease memory requirement

2d10e82c17-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

AtPoints - fix gpu thread usage

1dda9c1a17-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - add intial AtPoints to shared mem backends, but using ref impl

1c21e86911-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

hip - add BasisApplyAtPoints

34d1461430-May-2024 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - impl BasisApplyAtPoints

958e607d28-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

ref - drop unused variables in OpAtPoints

c122271124-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - skip unneeded restrictions in OpApply

a7efc11424-Jun-2024 Jeremy L Thompson <jeremy@jeremylt.org>

vec - use min of 2 lengths for gpu impl of CopyStrided

12345678910>>...46