Home
last modified time | relevance | path

Searched hist:"4548 da4e4ef44dc0f2704ad6d48ac0ca4a16bc83" (Results 1 – 9 of 9) sorted by relevance

/libCEED/backends/opt/
H A Dceed-opt.hdiff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version
/libCEED/backends/xsmm/
H A Dceed-xsmm-tensor.cdiff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version
H A Dceed-xsmm.hdiff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version
H A Dceed-xsmm-serial.cdiff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version
H A Dceed-xsmm-blocked.cdiff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version
/libCEED/interface/
H A Dceed-tensor.cdiff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version
/libCEED/
H A DDoxyfilediff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version
H A D.gitlab-ci.ymldiff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version
H A DMakefilediff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)

* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm

* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)

* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile

* LIBXSMM does not require kernels to be released

See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.

* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1

* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation

* Remove an unused macro

* make format

* Rely on LIBXSMM to cache JIT'd kernels

* LIBXSMM dispatched kernels for xsmm/serial backend

* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions

* Address PR comments

* Update GitLab CI LIBXSMM version