History log of /libCEED/backends/ (Results 501 – 525 of 1139)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
c47bfe2b16-Feb-2022 Jed Brown <jed@jedbrown.org>

backends/cuda-shared: limit 1D thread counts

We need to avoid this error:

CUDA_ERROR_LAUNCH_OUT_OF_RESOURCES: max_threads_per_block 512 on block size (24,1,32), shared_size 0, num_regs 106

A pro

backends/cuda-shared: limit 1D thread counts

We need to avoid this error:

CUDA_ERROR_LAUNCH_OUT_OF_RESOURCES: max_threads_per_block 512 on block size (24,1,32), shared_size 0, num_regs 106

A proper solution is to use cuOccupancyMaxPotentialBlockSize to place a
number of elements per block that stays within resource limits. This
would involve a bit more refactoring to do cleanly.

show more ...

63d3996f16-Feb-2022 Jed Brown <jed@jedbrown.org>

backends/cuda: more informative error reporting

f190906a16-Feb-2022 Jed Brown <jed@jedbrown.org>

xsmm: support for 1.17 headers

86e1ed6505-Feb-2022 nbeams <246972+nbeams@users.noreply.github.com>

Add launch bounds to HIP QFunction kernels

f71aa81b01-Feb-2022 nbeams <246972+nbeams@users.noreply.github.com>

add launch bounds to magma kernels;
add macro definition for y-dim of magma basis kernel threadblocks

Co-authored-by: Ahmad Abdelfattah <ahmad@icl.utk.edu>

b3c5430c01-Feb-2022 nbeams <246972+nbeams@users.noreply.github.com>

Add flag to use atomic adds on supported AMD GPU hardware

000294e304-Feb-2022 rezgarshakeri <rezgar.shakeri@colorado.edu>

updated ceed-ref-restriction.c

b435c5a604-Feb-2022 rezgarshakeri <rezgar.shakeri@colorado.edu>

Added CeedElemRestrictionIsOriented function

fc0567d901-Feb-2022 rezgarshakeri <rezgar.shakeri@colorado.edu>

backends/ref: Added oriented element restriction.


/libCEED/.gitlab-ci.yml
ref/ceed-ref-restriction.c
ref/ceed-ref.c
ref/ceed-ref.h
/libCEED/doc/sphinx/source/.gitignore
/libCEED/doc/sphinx/source/releasenotes.md
/libCEED/examples/fluids/README.md
/libCEED/examples/fluids/navierstokes.c
/libCEED/examples/fluids/navierstokes.h
/libCEED/examples/fluids/problems/advection.c
/libCEED/examples/fluids/problems/advection2d.c
/libCEED/examples/fluids/problems/densitycurrent.c
/libCEED/examples/fluids/problems/eulervortex.c
/libCEED/examples/fluids/qfunctions/advection.h
/libCEED/examples/fluids/qfunctions/advection2d.h
/libCEED/examples/fluids/qfunctions/eulervortex.h
/libCEED/examples/fluids/src/cloptions.c
/libCEED/examples/fluids/src/setupdm.c
/libCEED/examples/fluids/src/setuplibceed.c
/libCEED/examples/petsc/README.md
/libCEED/examples/petsc/area.c
/libCEED/examples/petsc/area.h
/libCEED/examples/petsc/bps.c
/libCEED/examples/petsc/bps.h
/libCEED/examples/petsc/bpssphere.c
/libCEED/examples/petsc/bpssphere.h
/libCEED/examples/petsc/include/areaproblemdata.h
/libCEED/examples/petsc/include/bcfunctions.h
/libCEED/examples/petsc/include/bpsproblemdata.h
/libCEED/examples/petsc/include/libceedsetup.h
/libCEED/examples/petsc/include/matops.h
/libCEED/examples/petsc/include/petscutils.h
/libCEED/examples/petsc/include/petscversion.h
/libCEED/examples/petsc/include/sphereproblemdata.h
/libCEED/examples/petsc/include/structs.h
/libCEED/examples/petsc/multigrid.c
/libCEED/examples/petsc/src/petscutils.c
/libCEED/examples/solids/README.md
/libCEED/examples/solids/elasticity.c
/libCEED/examples/solids/elasticity.h
/libCEED/examples/solids/include/boundary.h
/libCEED/examples/solids/include/cl-options.h
/libCEED/examples/solids/include/matops.h
/libCEED/examples/solids/include/misc.h
/libCEED/examples/solids/include/setup-dm.h
/libCEED/examples/solids/include/setup-libceed.h
/libCEED/examples/solids/include/structs.h
/libCEED/examples/solids/include/utils.h
/libCEED/examples/solids/src/misc.c
/libCEED/examples/solids/src/setup-dm.c
/libCEED/gallery/identity/ceed-identity.c
/libCEED/include/ceed-impl.h
/libCEED/include/ceed/backend.h
/libCEED/include/ceed/ceed.h
/libCEED/interface/ceed-elemrestriction.c
/libCEED/interface/ceed-operator.c
/libCEED/interface/ceed-qfunction.c
/libCEED/interface/ceed-qfunctioncontext.c
/libCEED/interface/ceed-types.c
/libCEED/interface/ceed.c
/libCEED/tests/t330-basis.c
/libCEED/tests/t330-basis.h
/libCEED/tests/t407-qfunction.c
/libCEED/tests/t525-operator.c
/libCEED/tests/tap.sh
50c301a501-Feb-2022 Rezgar Shakeri <42816410+rezgarshakeri@users.noreply.github.com>

H(div) basis constructor (#878)

* backends/interface: added H(div) basis constructor

* tests: CEED_EVAL_INTERP/DIV tested for H(div) basis.

* Update interface/ceed-basis.c

Co-authored-by:

H(div) basis constructor (#878)

* backends/interface: added H(div) basis constructor

* tests: CEED_EVAL_INTERP/DIV tested for H(div) basis.

* Update interface/ceed-basis.c

Co-authored-by: Jed Brown <jed@jedbrown.org>

* Update interface/ceed-basis.c

Co-authored-by: Jed Brown <jed@jedbrown.org>

* Update interface/ceed-basis.c

Co-authored-by: Jed Brown <jed@jedbrown.org>

* Update interface/ceed-basis.c

Co-authored-by: Jed Brown <jed@jedbrown.org>

* Update interface/ceed-basis.c

Co-authored-by: Jed Brown <jed@jedbrown.org>

* interface/ceed-basis.c: delete [-1,1] for non-tensor basis constructor

* Update interface/ceed-basis.c

Co-authored-by: Jed Brown <jed@jedbrown.org>

* include/ceed: added CEEDFESpace enum and used it at ceed-basis.c

* tests: added t330-basis.out and changed xhat-->x_hat

* Changed name convention for CeedElemTopology and CeedFESpace enum.

* make style

* interface/ceed-basis.c: Added FE space/topology of the basis to CeedBasisView function

* tests/output: update ...basis.out file

* Changed CEED_FE_TOPO_ ==> CEED_TOPOLOGY_

* backends/ref: delete CeedBasisApplyHdiv_Ref

* Update interface/ceed-basis.c

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

* Update interface/ceed-basis.c

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

* Update include/ceed-impl.h

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

* interface/ceed-bassi.c: update CeedBasisView

* deleted L2 and H(curl) FE spaces

* minor - update output file

* minor - update output file

* style - consistency with main

Co-authored-by: Jed Brown <jed@jedbrown.org>
Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

show more ...

b3e1519b31-Jan-2022 nbeams <246972+nbeams@users.noreply.github.com>

Add launch bounds to hip-gen operator kernel

318af0d126-Jan-2022 Jeremy L Thompson <jeremy@jeremylt.org>

xsmm - add compatibility for xsmm > 1.17


/libCEED/.gitlab-ci.yml
/libCEED/Doxyfile
/libCEED/Makefile
xsmm/ceed-xsmm-tensor-f32.c
xsmm/ceed-xsmm-tensor-f64.c
xsmm/ceed-xsmm.h
/libCEED/doc/sphinx/source/api/CeedQFunction.rst
/libCEED/examples/fluids/index.md
/libCEED/examples/fluids/src/setupdm.c
/libCEED/examples/fluids/src/setuplibceed.c
/libCEED/examples/fluids/src/setupts.c
/libCEED/examples/petsc/src/libceedsetup.c
/libCEED/examples/solids/include/structs.h
/libCEED/examples/solids/problems/finite-strain-mooney-rivlin-initial-1.c
/libCEED/examples/solids/problems/finite-strain-neo-hookean-current-1.c
/libCEED/examples/solids/problems/finite-strain-neo-hookean-current-2.c
/libCEED/examples/solids/problems/finite-strain-neo-hookean-initial-1.c
/libCEED/examples/solids/problems/finite-strain-neo-hookean-initial-2.c
/libCEED/examples/solids/problems/linear.c
/libCEED/examples/solids/problems/small-strain-neo-hookean.c
/libCEED/examples/solids/src/setup-libceed.c
/libCEED/include/ceed/ceed.h
/libCEED/interface/ceed-preconditioning.c
/libCEED/interface/ceed.c
/libCEED/rust/libceed/Cargo.toml
/libCEED/rust/libceed/README.md
/libCEED/rust/libceed/src/basis.rs
/libCEED/rust/libceed/src/elem_restriction.rs
/libCEED/rust/libceed/src/lib.rs
/libCEED/rust/libceed/src/operator.rs
/libCEED/rust/libceed/src/qfunction.rs
/libCEED/rust/libceed/src/vector.rs
/libCEED/tests/output/t504-operator-f.out
/libCEED/tests/output/t504-operator.out
/libCEED/tests/output/t523-operator-f.out
/libCEED/tests/output/t523-operator.out
/libCEED/tests/t500-operator-f.f90
/libCEED/tests/t500-operator.c
/libCEED/tests/t501-operator-f.f90
/libCEED/tests/t501-operator.c
/libCEED/tests/t502-operator-f.f90
/libCEED/tests/t502-operator.c
/libCEED/tests/t503-operator-f.f90
/libCEED/tests/t503-operator.c
/libCEED/tests/t504-operator-f.f90
/libCEED/tests/t504-operator.c
/libCEED/tests/t505-operator-f.f90
/libCEED/tests/t505-operator.c
/libCEED/tests/t506-operator-f.f90
/libCEED/tests/t506-operator.c
/libCEED/tests/t507-operator.c
/libCEED/tests/t510-operator-f.f90
/libCEED/tests/t510-operator.c
/libCEED/tests/t511-operator-f.f90
/libCEED/tests/t511-operator.c
/libCEED/tests/t520-operator-f.f90
/libCEED/tests/t520-operator.c
/libCEED/tests/t521-operator-f.f90
/libCEED/tests/t521-operator.c
/libCEED/tests/t522-operator-f.f90
/libCEED/tests/t522-operator.c
/libCEED/tests/t523-operator-f.f90
/libCEED/tests/t523-operator.c
/libCEED/tests/t524-operator-f.f90
/libCEED/tests/t524-operator.c
/libCEED/tests/t530-operator-f.f90
/libCEED/tests/t530-operator.c
/libCEED/tests/t531-operator-f.f90
/libCEED/tests/t531-operator.c
/libCEED/tests/t532-operator-f.f90
/libCEED/tests/t532-operator.c
/libCEED/tests/t533-operator-f.f90
/libCEED/tests/t533-operator.c
/libCEED/tests/t534-operator-f.f90
/libCEED/tests/t534-operator.c
/libCEED/tests/t535-operator-f.f90
/libCEED/tests/t535-operator.c
/libCEED/tests/t536-operator-f.f90
/libCEED/tests/t536-operator.c
/libCEED/tests/t537-operator.c
/libCEED/tests/t540-operator-f.f90
/libCEED/tests/t540-operator.c
/libCEED/tests/t550-operator-f.f90
/libCEED/tests/t550-operator.c
/libCEED/tests/t551-operator.c
/libCEED/tests/t552-operator-f.f90
/libCEED/tests/t552-operator.c
/libCEED/tests/t560-operator.c
/libCEED/tests/t561-operator.c
/libCEED/tests/t562-operator.c
/libCEED/tests/t563-operator.c
/libCEED/tests/t564-operator.c
9faa593719-Jan-2022 Natalie Beams <246972+nbeams@users.noreply.github.com>

Slight modifications for hiprtc usage in ROCm 4.5 (#850)

a81c17c818-Jan-2022 Jeremy L Thompson <jeremy@jeremylt.org>

cpu - drop some empty TensorContract destructors

706efda318-Jan-2022 Jeremy L Thompson <jeremy@jeremylt.org>

vec/ctx - make backend restore function optional

d7d111ec23-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - style consistency

43c928f423-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - minor style

46dc073423-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - improved human-readability of debugging output

437930d122-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - pull quoted kernels into separate files

0d0321e022-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

style - consistent nameing and style for gpu backends

f87d896c22-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - drop unused argument in init

d7c981f322-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - drop unused [HIP, CUDA]_MAX_PATH constant

7fcac03622-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - split common cuda/hip data into separate folder

6d69246a21-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - separate compile functionality into new header

030d8ffd20-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

cpu - fall through now on INTERP/GRAD cases

1...<<21222324252627282930>>...46