Searched hist:"4548 da4e4ef44dc0f2704ad6d48ac0ca4a16bc83" (Results 1 – 9 of 9) sorted by relevance
| /libCEED/backends/opt/ |
| H A D | ceed-opt.h | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|
| /libCEED/backends/xsmm/ |
| H A D | ceed-xsmm-tensor.c | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|
| H A D | ceed-xsmm.h | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|
| H A D | ceed-xsmm-serial.c | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|
| H A D | ceed-xsmm-blocked.c | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|
| /libCEED/interface/ |
| H A D | ceed-tensor.c | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|
| /libCEED/ |
| H A D | Doxyfile | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|
| H A D | .gitlab-ci.yml | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|
| H A D | Makefile | diff 4548da4e4ef44dc0f2704ad6d48ac0ca4a16bc83 Wed Jul 12 19:35:51 UTC 2023 Sebastian Grimberg <sebastiangrimb@gmail.com> Update LIBXSMM backend (#1248)
* Fix LIBXSMM kernel generation calls after 9c0e481 in https://github.com/libxsmm/libxsmm
* Update LIBXSMM interface to work with main branch after commit 1f4cdad (in preparation for v2)
* Allow user specified BLAS_LIB for LIBXSMM dependency in Makefile
* LIBXSMM does not require kernels to be released
See https://github.com/libxsmm/libxsmm/issues/783\#issuecomment-1596655284.
* Improvements for non-tensor CPU-based CeedBasisApply for q_comp > 1
* Revert previous commit since it's faster to apply in P*Q panels, remove an unncessary LIBXSMM kernel compilation
* Remove an unused macro
* make format
* Rely on LIBXSMM to cache JIT'd kernels
* LIBXSMM dispatched kernels for xsmm/serial backend
* Combine ceed-xsmm-tensor-fp64 and -fp32 into single file for all precisions
* Address PR comments
* Update GitLab CI LIBXSMM version
|