History log of /libCEED/backends/xsmm/ceed-xsmm.h (Results 1 – 25 of 42)
Revision Date Author Comments
# d4cc1845 30-Dec-2025 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1912 from CEED/jeremy/copyright

minor - update copyright to 2026


# 9ba83ac0 19-Dec-2025 Jeremy L Thompson <jeremy@jeremylt.org>

minor - update copyright to 2026


# 20a16a5f 20-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1786 from CEED/jeremy/copy-headers

minor - upate copyright to 2025


# d275d636 19-Mar-2025 Jeremy L Thompson <jeremy@jeremylt.org>

minor - upate copyright to 2025


# 509d4af6 28-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Drop JiT Guards in Most QF Source (#1540)

* backend - use pragma once

* gallery - drop source header guards

* ex - drop some qfunction source header guards

* fluids - drop guards on all sin

Drop JiT Guards in Most QF Source (#1540)

* backend - use pragma once

* gallery - drop source header guards

* ex - drop some qfunction source header guards

* fluids - drop guards on all singly included headers

* jit - drop most guards on backend JiT files

* sycl - drop extra header guards

* jit - enable #pragma once for QF source

* fluids - use #pragma once for util/helper qf source

* test - check different multiple includes

* fluids - fix odd include

* jit - update interface for building JiT string from multiple files

show more ...


# a171b6ef 27-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1537 from CEED/jeremy/pragma-once

Use #pragma once for non-JiT headers


# 5aed82e4 27-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

minor - update copyright headers


# d663a15a 04-Oct-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Merge pull request #1360 from CEED/sjg/tensor-contract-constructor


# a71faab1 03-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Remove unused parameter from CeedTensorContractCreate


# c6c10e09 01-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1322 from CEED/sjg/header-guards-fix

Fix a few header includes following PR #1320


# 17b5e52f 01-Sep-2023 Sebastian Grimberg <sjg@amazon.com>

Fix a few header includes following PR #1320


# 4548da4e 12-Jul-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (

Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version

show more ...


# 49aac155 24-Mar-2023 Jeremy L Thompson <jeremy@jeremylt.org>

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc heade

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc headers

* iwyu - ceed/ceed.h -> ceed.h

* iwyu - cuda/hip include fixes

show more ...


# 2b730f8b 17-Nov-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - re

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - remove stray newline

* make - simpler 'make format' target

* ci - use newer libxsmm

* doc - minor release note claification

* minor - minor fix

* minor - minor fix

* minor - minor fix

* minor - minor fix

* make format

* format - less aggressive alignment rules

* tidy - check for argument name mismatches

* fix newline

* format - mirror Ratel update to .clang-format

* fix merge error

* fix merge conflict

* fix merge error

* drop style in .phony list

* Update .clang-format

Co-authored-by: Jed Brown <jed@jedbrown.org>

* apply updated format

Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


# ce18bed9 17-Mar-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #858 from CEED/jeremy/dump-copy-stuff

Strip redundant/outdated license info duplication


# 3d8e8822 17-Mar-2022 Jeremy L Thompson <jeremy@jeremylt.org>

minor - update copyright headers


# 8b036261 16-Feb-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #899 from CEED/jed/cu-lv-cuda-11.6

CI: update lv for cuda-11.6


# f190906a 16-Feb-2022 Jed Brown <jed@jedbrown.org>

xsmm: support for 1.17 headers


# dde41789 26-Jan-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #882 from CEED/jeremy/xsmm-1.17

CI - update to xsmm 1.17


# 318af0d1 26-Jan-2022 Jeremy L Thompson <jeremy@jeremylt.org>

xsmm - add compatibility for xsmm > 1.17


# 80a9ef05 02-Sep-2021 Natalie Beams <246972+nbeams@users.noreply.github.com>

Allow CeedScalar to be single precision (#788)

One can modify `ceed.h` to include `ceed-f32.h` and then use single precision. This is tested for C in CI and has been tested by developers with Rust,

Allow CeedScalar to be single precision (#788)

One can modify `ceed.h` to include `ceed-f32.h` and then use single precision. This is tested for C in CI and has been tested by developers with Rust, Julia, and Python. This interface is evolving and should be considered experimental at this time (thus lack of automated build support).

* Introduce CeedScalarType enum

* WIP changes to allow different definitions of CeedScalar

* Introduce new header files for float and double

* Only use avx tensor contract and MAGMA non-tensor basis if CeedScalar is double

* WIP changes to allow CeedScalar to be float

* WIP start trying to adjust test tolerances for float or double

* fix typos in comments

* install ceed-f32/64 headers

* Fix missing casts for hipMAGMA element restrictions

* make CeedQFunctionContextGetContextSize available for Python bindings

* Changes to Python bindings to allow CeedScalar to be float

* WIP adjust Python tests for float or double

* make style

* remove QFunctionContextGetContextSize from backend header

* Use quotes instead of <> in include statement

* Remove unncessary includes

* Update tolerances for tests

* [Julia] allow CeedScalar to be Float32

* [Julia] Use Preferences instead of custom build configuration

# Conflicts:
# julia/LibCEED.jl/src/C.jl

* [Makefile] Change definition of CC_VENDOR so it works with cross-compilation

* [Julia] Use Preferences in CI

# Conflicts:
# .github/workflows/julia-test-with-style.yml

* [Julia] Update docs about preferences

* [Julia] Add test/Project.toml workaround for Preferences

* Add CeedGetScalarType to get the type of CeedScalar at runtime

* [Julia] Move functions from Ceed.jl to LibCEED.jl

* [Julia] Add support for getting library path and scalar type at runtime

* [Julia] Minor change to checking if CUDA is loaded

* [Julia] Check correct CeedScalar types in basis functions

* [Julia] Fix tests comparing with output file

* [Julia] Change devtests to use CeedScalar instead of Float64

* Update test 402 so context will be same size in double or float

* Update tolerances for ceed examples

* [Julia] CUDA fixes

* remove unused variable in t208

* SchurDecomposition: do not compute tau on final iteration

* Update tolerances for some basis tests (for single precision)

* Make style

* Python style fixes for basis test

* Add single precision output for t300 and t320 and adjust checks; skip t541 in single

* Add LCOV exclusions after moving to new line

* fix spacing

* Python: make CEED_EPSILON available as libceed.EPSILON

* Python: optional parameter to specify different output file for test comparison

* Python: update tests' use of EPSILON and change test_300 output file for single precision

* Python: add convenience function for getting dtype corresponding to CeedScalar

* rust - add single precision support

* [Julia] Fall back on Float64 if CeedGetScalarType is not available

* [Julia] style

* Adjust tolerance for t301

* xsmm - add single precision support

* avx - add single precision support

* Add initial single precision support for MAGMA non-tensor basis

* Skip t300 and t320 in single precision; revert Python t300 changes

* Revert output changes for t300 and t320 in junit

* [Julia] Changes to autogenerated bindings for mixed precision

* [Julia] style

* [Julia] Check scalar type when changing libceed library path

The check is also performed when the package is loaded. This prevents having to
restart the Julia session twice

* [Julia] Require JLLWrappers version 1.3

This is needed to use Preferences to change the library path

* Add documentation page for precision development

Co-authored-by: Will Pazner <will.e.p@gmail.com>

* Cleanup from merge: remove old README

* Return CEED_ALIGN to backend.h

* Make Fortran compiler (FC) optional; empty skips Fortran tests

Use in Python and Rust builds, which may not have a Fortran compiler
installed and thus would produce confusing output.

* Add single precision CI test for Noether

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

Co-authored-by: Will Pazner <will.e.p@gmail.com>
Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>
Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


# 7f565272 09-Apr-2021 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #735 from CEED/jeremy/style-guide

Style Unification


# d1d35e2f 09-Apr-2021 jeremylt <thompson.jeremy.luke@gmail.com>

style - updating C variable style


# 874019bc 31-Mar-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #716 from CEED/jed/install-backend.h

Jed/install backend.h


# ec3da8bc 26-Mar-2021 Jed Brown <jed@jedbrown.org>

Install install backend headers under include/ceed/

This makes it possible to distribute source plugins that provide
additional backends. It's also used in MFEM, perhaps temporarily.

Deprecate ceed

Install install backend headers under include/ceed/

This makes it possible to distribute source plugins that provide
additional backends. It's also used in MFEM, perhaps temporarily.

Deprecate ceed-backend.h, which was not previously installed, but some
users accessed it from an in-place build.

Also install CUDA and HIP headers that allow users to provide CUfunction
and hipFunction_t.

Co-authored-by: Jeremy L. Thompson <jeremy.thompson@colorado.edu>
Requested-by: Andrew T. Barker <barker29@llnl.gov>

show more ...


12