History log of /libCEED/backends/ (Results 326 – 350 of 1139)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
833aa12719-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Use shared-memory for transpose non-tensor basis kernel to accelerate load of A matrix

9d15e85b18-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

H(div) and H(curl) basis support for magma backend

1d281a7b17-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Remove accidental WIP tuning files from #1382


913f846117-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Adjust include groupings

e4e1133f17-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Fix mismatching declaration

940a72f110-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

Formatting consistency for magma backend with cuda-ref and hip-ref

Includes JiT upgrades for Magma non-tensor basis to only compile for N values which are used at runtime. Adds JiT for Magma non-ten

Formatting consistency for magma backend with cuda-ref and hip-ref

Includes JiT upgrades for Magma non-tensor basis to only compile for N values which are used at runtime. Adds JiT for Magma non-tensor basis CEED_EVAL_WEIGHT mode.

show more ...

f80f4a7409-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

Rename files in magma backend for consistency with other libCEED backends


/libCEED/.gitlab-ci.yml
/libCEED/Makefile
magma/ceed-magma-basis.c
magma/ceed-magma-gemm-nontensor.c
magma/ceed-magma-gemm-selector.cpp
magma/tuning/a100.h
magma/tuning/a100_grad_rtc.h
magma/tuning/a100_interp_rtc.h
magma/tuning/indices.h
magma/tuning/mi100.h
magma/tuning/mi250x.h
magma/tuning/mi250x_grad_rtc.h
magma/tuning/mi250x_interp_rtc.h
magma/tuning/v100.h
/libCEED/benchmarks/petsc-bps.sh
/libCEED/benchmarks/petsc-bpsraw.sh
/libCEED/benchmarks/postprocess_base.py
/libCEED/benchmarks/postprocess_plot.py
/libCEED/benchmarks/postprocess_table.py
/libCEED/examples/fluids/blasius.yaml
/libCEED/examples/fluids/conv_plot.py
/libCEED/examples/fluids/index.md
/libCEED/examples/fluids/navierstokes.c
/libCEED/examples/fluids/postprocess/vortexshedding.py
/libCEED/examples/fluids/problems/freestream_bc.c
/libCEED/examples/fluids/problems/newtonian.c
/libCEED/examples/fluids/qfunctions/blasius.h
/libCEED/examples/fluids/qfunctions/channel.h
/libCEED/examples/fluids/qfunctions/densitycurrent.h
/libCEED/examples/fluids/qfunctions/differential_filter.h
/libCEED/examples/fluids/qfunctions/freestream_bc.h
/libCEED/examples/fluids/qfunctions/gaussianwave.h
/libCEED/examples/fluids/qfunctions/newtonian.h
/libCEED/examples/fluids/qfunctions/newtonian_state.h
/libCEED/examples/fluids/qfunctions/sgs_dd_model.h
/libCEED/examples/fluids/qfunctions/stabilization.h
/libCEED/examples/fluids/qfunctions/taylorgreen.h
/libCEED/examples/fluids/qfunctions/turb_spanstats.h
/libCEED/examples/fluids/qfunctions/velocity_gradient_projection.h
/libCEED/examples/fluids/src/setupts.c
/libCEED/examples/fluids/tests-output/fluids-navierstokes-channel-prim.bin
/libCEED/examples/fluids/tests-output/fluids-navierstokes-channel.bin
/libCEED/examples/fluids/tests-output/fluids-navierstokes-dc-explicit.bin
/libCEED/examples/fluids/tests-output/fluids-navierstokes-dc-implicit-stab-none.bin
/libCEED/include/ceed/jit-source/magma/magma-basis-grad-1d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-grad-2d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-grad-3d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-grad-nontensor.h
/libCEED/include/ceed/jit-source/magma/magma-basis-interp-1d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-interp-2d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-interp-3d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-interp-nontensor.h
/libCEED/include/ceed/jit-source/magma/magma-basis-weight-1d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-weight-2d.h
/libCEED/include/ceed/jit-source/magma/magma-basis-weight-3d.h
/libCEED/include/ceed/jit-source/magma/magma-common-defs.h
/libCEED/include/ceed/jit-source/magma/magma-common-nontensor.h
/libCEED/include/ceed/jit-source/magma/magma-common-tensor.h
/libCEED/interface/ceed-operator.c
/libCEED/rust/libceed/src/lib.rs
/libCEED/rust/libceed/src/operator.rs
/libCEED/tests/junit.py
/libCEED/tests/junit_common.py
0cb85d0404-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Revert opt/blocked backend delegate, this is no longer needed after #1362

51888a7103-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Correct opt/blocked fallback to opt/serial for things like QFunction assembly

8130dc2903-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Minor improvement to consistency between ref/blocked and opt/blocked backends for readability

3bf1f30804-Oct-2023 Jeremy L Thompson <jeremy@jeremylt.org>

cpu - clean up delegation/fallback between CPU backends

a71faab103-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Remove unused parameter from CeedTensorContractCreate

d53ea27829-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

magma - fix make tidy issues

1411c26228-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

memcheck - move codecov markers

35aed38328-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

ref - fix rstr parent resource name check

b31f666e28-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

ref - move codecov markers

1c7d1e0320-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1340 from CEED/jeremy/fix-ptsc-orients-copy

Fix CEED_COPY_VALUES for ref rstr at points

506b1a0c20-Sep-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Non-square operator full assembly (#1316)

* Consistency formatting for operator tests

* Add (failing) test for non-square operator full assembly

* Add support for CPU-based full assembly of no

Non-square operator full assembly (#1316)

* Consistency formatting for operator tests

* Add (failing) test for non-square operator full assembly

* Add support for CPU-based full assembly of non-square CeedOperators

* Fix full assembly of identity quadrature functions and operators with CEED_BASIS_NONE

Also adds a unit test for full assembly which fails prior to the changes in this commit.

* Minor refactor to improve coverage

show more ...


blocked/ceed-blocked-operator.c
cuda-ref/ceed-cuda-ref-operator.c
cuda-ref/ceed-cuda-ref.h
hip-ref/ceed-hip-ref-operator.c
opt/ceed-opt-operator.c
ref/ceed-ref-operator.c
sycl-ref/ceed-sycl-ref-operator.sycl.cpp
/libCEED/doc/sphinx/source/libCEEDapi.md
/libCEED/doc/sphinx/source/libCEEDdev.md
/libCEED/doc/sphinx/source/releasenotes.md
/libCEED/examples/petsc/qfunctions/area/areacube.h
/libCEED/include/ceed-impl.h
/libCEED/include/ceed/backend.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-operator.c
/libCEED/interface/ceed-preconditioning.c
/libCEED/interface/ceed-qfunction-register.c
/libCEED/interface/ceed-register.c
/libCEED/interface/ceed.c
/libCEED/tests/README.md
/libCEED/tests/t101-vector.c
/libCEED/tests/t105-vector.c
/libCEED/tests/t203-elemrestriction.c
/libCEED/tests/t204-elemrestriction.c
/libCEED/tests/t208-elemrestriction-f.f90
/libCEED/tests/t208-elemrestriction.c
/libCEED/tests/t209-elemrestriction.c
/libCEED/tests/t213-elemrestriction.c
/libCEED/tests/t300-basis.c
/libCEED/tests/t307-basis.c
/libCEED/tests/t320-basis-f.h
/libCEED/tests/t330-basis.h
/libCEED/tests/t356-basis.c
/libCEED/tests/t406-qfunction-scales.h
/libCEED/tests/t507-operator.c
/libCEED/tests/t509-operator.c
/libCEED/tests/t523-operator.c
/libCEED/tests/t539-operator.c
/libCEED/tests/t553-operator.c
/libCEED/tests/t560-operator.c
/libCEED/tests/t561-operator.c
/libCEED/tests/t562-operator.c
/libCEED/tests/t563-operator.c
/libCEED/tests/t564-operator.c
/libCEED/tests/t565-operator.c
/libCEED/tests/t566-operator.c
/libCEED/tests/t567-operator.c
/libCEED/tests/t568-operator.c
/libCEED/tests/t569-operator.c
/libCEED/tests/t570-operator.c
/libCEED/tests/t580-operator.c
/libCEED/tests/t580-operator.h
/libCEED/tests/t581-operator.c
/libCEED/tests/t582-operator.c
/libCEED/tests/t583-operator.c
58c07c4f20-Sep-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Support CPU shared-memory parallelism with OpenMP (#1279)

* Updates for OpenMP thread safety (one Ceed per thread, potentially with shared output vector)

* Makefile tabs vs. spaces consistency

Support CPU shared-memory parallelism with OpenMP (#1279)

* Updates for OpenMP thread safety (one Ceed per thread, potentially with shared output vector)

* Makefile tabs vs. spaces consistency

* Fix unrelated pragma bug for Intel compilers in `backend.h`

* Address PR feedback: Use _OPENMP macro, simplify OpenMP pragma wrappers

* Address PR feedback: Add new statement macro CeedPragmaThreadPrivate to PREDEFINED entries in Doxyfile

* Add OpenMP support to Intel CI workflow for testing

* Add documentation for OPENMP option and Update releasenotes.md

* Revise OpenMP implementation: Rather than enforcing global variables to be threadprivate, just wrap potential race conditions in a critical block (during registration)

* Avoid returning from OpenMP blocks

* Early break on error when registering backends or QFunctions

* Formatting fixes after rebase, newline after variable declarations

* Address PR feedback: Revert some unintentional changes to debug output

* Update codecov exclusions

show more ...

07d5dec120-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

rstr - fix CEED_COPY_VALUES for ref rstr at points

1249ccc519-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

rstr - renaming for clarity

0930e4e714-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

rstr - add tests for AtPoints

05fa913c13-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

rstr - allow restriction to ordered points evec

2c7e741312-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

rstr - intial CPU implimentation

397164e915-Sep-2023 Sebastian Grimberg <sjg@amazon.com>

Small update for cuda-shared/hip-shared consistency

1...<<11121314151617181920>>...46