History log of /libCEED/rust/libceed-sys/c-src/backends/xsmm/ (Results 51 – 55 of 55)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
84a01de512-Mar-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Serial and Blocked AVX Backends (#198)

* Add serial AVX backend

* Style and README changes

* Simplify AVX serial tensor loop

* Minor performance improvement

* C=1 AVX scalar case

* In

Serial and Blocked AVX Backends (#198)

* Add serial AVX backend

* Style and README changes

* Simplify AVX serial tensor loop

* Minor performance improvement

* C=1 AVX scalar case

* Increase use of AVX commands for edge cases

* Prep for eventual Tensor Object

* Comment updates

* Readme update

* Update README

* Refactor to reduce code

* Increase vectorization in remainder of columns

* Vectorize column remainder on C=1 case

* Switch to static inlining for AVX tensor contract

* Tidying for merge

* make style

* Style cleanup

* Full register use for columns

* Make style

show more ...

cdf4f91809-Mar-2019 jeremylt <jeremy.thompson@colorado.edu>

Apply style changes

856142e106-Feb-2019 jeremylt <jeremy.thompson@colorado.edu>

Backend naming adjustment


/libCEED/.travis.yml
/libCEED/LICENSE
/libCEED/Makefile
/libCEED/README.md
/libCEED/backends/avx/ceed-avx.c
/libCEED/backends/blocked/ceed-blocked-basis.c
/libCEED/backends/blocked/ceed-blocked-operator.c
/libCEED/backends/blocked/ceed-blocked.c
/libCEED/backends/occa/ceed-occa-operator.c
/libCEED/backends/occa/ceed-occa-restrict.c
/libCEED/backends/occa/ceed-occa-restrict.okl
/libCEED/backends/occa/ceed-occa.h
/libCEED/backends/ref/ceed-ref-operator.c
/libCEED/backends/ref/ceed-ref.c
/libCEED/backends/template/ceed-tmpl.c
/libCEED/backends/xsmm/ceed-xsmm-blocked.c
/libCEED/backends/xsmm/ceed-xsmm-serial.c
/libCEED/benchmarks/.gitignore
/libCEED/benchmarks/README.md
/libCEED/benchmarks/benchmark.sh
/libCEED/benchmarks/petsc-bp1.sh
/libCEED/benchmarks/petsc-bp3.sh
/libCEED/benchmarks/postprocess-base.py
/libCEED/benchmarks/postprocess-plot.py
/libCEED/benchmarks/postprocess-table.py
/libCEED/examples/Makefile
/libCEED/examples/README.md
/libCEED/examples/ceed/ex1.cu
/libCEED/examples/mfem/bp1.cu
/libCEED/examples/mfem/bp3.cu
/libCEED/examples/nek5000/README.md
/libCEED/examples/nek5000/bp1.cu
/libCEED/examples/nek5000/bp1.okl
/libCEED/examples/nek5000/bp1.usr
/libCEED/examples/nek5000/bp3.cu
/libCEED/examples/nek5000/bp3.okl
/libCEED/examples/nek5000/bp3.usr
/libCEED/examples/nek5000/make-nek-examples.sh
/libCEED/examples/nek5000/make-nek-tests.sh
/libCEED/examples/nek5000/run-nek-example.sh
/libCEED/examples/petsc/Makefile
/libCEED/examples/petsc/bp1.c
/libCEED/examples/petsc/bp1.cu
/libCEED/examples/petsc/bp1.h
/libCEED/examples/petsc/bp3.c
/libCEED/examples/petsc/bp3.cu
/libCEED/examples/petsc/bp3.h
/libCEED/tests/t000-init-f.f90
/libCEED/tests/t100-vec-f.f90
/libCEED/tests/t101-vec-f.f90
/libCEED/tests/t102-vec-f.f90
/libCEED/tests/t103-vec-f.f90
/libCEED/tests/t104-vec-f.f90
/libCEED/tests/t105-vec-f.f90
/libCEED/tests/t106-vec-f.f90
/libCEED/tests/t107-vec-f.f90
/libCEED/tests/t108-vec-f.f90
/libCEED/tests/t200-elemrestriction-f.f90
/libCEED/tests/t201-elemrestriction-f.f90
/libCEED/tests/t202-elemrestriction-f.f90
/libCEED/tests/t203-elemrestriction-f.f90
/libCEED/tests/t204-elemrestriction-f.f90
/libCEED/tests/t205-elemrestriction-f.f90
/libCEED/tests/t206-elemrestriction-f.f90
/libCEED/tests/t207-elemrestriction-f.f90
/libCEED/tests/t300-basis-f.f90
/libCEED/tests/t301-basis-f.f90
/libCEED/tests/t302-basis-f.f90
/libCEED/tests/t303-basis-f.f90
/libCEED/tests/t304-basis-f.f90
/libCEED/tests/t305-basis-f.f90
/libCEED/tests/t306-basis-f.f90
/libCEED/tests/t307-basis-f.f90
/libCEED/tests/t310-basis-f.f90
/libCEED/tests/t311-basis-f.f90
/libCEED/tests/t312-basis-f.f90
/libCEED/tests/t313-basis-f.f90
/libCEED/tests/t400-qfunction-f.cu
/libCEED/tests/t400-qfunction-f.f90
/libCEED/tests/t400-qfunction.cu
/libCEED/tests/t401-qfunction-f.cu
/libCEED/tests/t401-qfunction-f.f90
/libCEED/tests/t401-qfunction.cu
/libCEED/tests/t500-operator-f.cu
/libCEED/tests/t500-operator-f.f90
/libCEED/tests/t500-operator.cu
/libCEED/tests/t501-operator-f.cu
/libCEED/tests/t501-operator-f.f90
/libCEED/tests/t501-operator.cu
/libCEED/tests/t502-operator-f.cu
/libCEED/tests/t502-operator-f.f90
/libCEED/tests/t502-operator.cu
/libCEED/tests/t510-operator-f.f90
/libCEED/tests/t511-operator-f.f90
/libCEED/tests/tap.sh
c286a8bf14-Jan-2019 jeremylt <jeremy.thompson@colorado.edu>

Switch libXSMM serial basis apply to Nek style

8d713cf620-Dec-2018 jeremylt <jeremy.thompson@colorado.edu>

Initial libXSMM backend


/libCEED/.asanignore
/libCEED/.astylerc
/libCEED/.codecov.yml
/libCEED/.codedocs
/libCEED/.gitignore
/libCEED/.mailmap
/libCEED/.travis.yml
/libCEED/CODE_OF_CONDUCT.md
/libCEED/CONTRIBUTING.md
/libCEED/Doxyfile
/libCEED/LICENSE
/libCEED/Makefile
/libCEED/NOTICE
/libCEED/README.md
/libCEED/azure-pipelines.yml
/libCEED/backends/avx/ceed-avx-basis.c
/libCEED/backends/avx/ceed-avx.c
/libCEED/backends/avx/ceed-avx.h
/libCEED/backends/blocked/ceed-blocked-basis.c
/libCEED/backends/blocked/ceed-blocked-operator.c
/libCEED/backends/blocked/ceed-blocked.c
/libCEED/backends/blocked/ceed-blocked.h
/libCEED/backends/cuda/ceed-cuda-basis.c
/libCEED/backends/cuda/ceed-cuda-operator.c
/libCEED/backends/cuda/ceed-cuda-qfunction.c
/libCEED/backends/cuda/ceed-cuda-restriction.c
/libCEED/backends/cuda/ceed-cuda-vec.c
/libCEED/backends/cuda/ceed-cuda.c
/libCEED/backends/cuda/ceed-cuda.h
/libCEED/backends/cuda/cuda-vec.cu
/libCEED/backends/magma/atomics.cuh
/libCEED/backends/magma/ceed-magma.c
/libCEED/backends/magma/ceed-magma.h
/libCEED/backends/magma/gccm.py
/libCEED/backends/magma/magma_check_cudaerror.h
/libCEED/backends/magma/magma_dtensor_contract.c
/libCEED/backends/occa/ceed-occa-basis.c
/libCEED/backends/occa/ceed-occa-basis.okl
/libCEED/backends/occa/ceed-occa-okl.c
/libCEED/backends/occa/ceed-occa-operator.c
/libCEED/backends/occa/ceed-occa-qfunction-noop.c
/libCEED/backends/occa/ceed-occa-qfunction-op.c
/libCEED/backends/occa/ceed-occa-qfunction.c
/libCEED/backends/occa/ceed-occa-restrict.c
/libCEED/backends/occa/ceed-occa-restrict.okl
/libCEED/backends/occa/ceed-occa-vector.c
/libCEED/backends/occa/ceed-occa.c
/libCEED/backends/occa/ceed-occa.h
/libCEED/backends/ref/ceed-ref-basis.c
/libCEED/backends/ref/ceed-ref-operator.c
/libCEED/backends/ref/ceed-ref-qfunction.c
/libCEED/backends/ref/ceed-ref-restriction.c
/libCEED/backends/ref/ceed-ref-vec.c
/libCEED/backends/ref/ceed-ref.c
/libCEED/backends/ref/ceed-ref.h
/libCEED/backends/template/ceed-tmpl.c
/libCEED/backends/xsmm/ceed-xsmm-basis.c
/libCEED/backends/xsmm/ceed-xsmm-blocked.c
/libCEED/backends/xsmm/ceed-xsmm-serial.c
/libCEED/backends/xsmm/ceed-xsmm.h
/libCEED/ceed.pc.template
/libCEED/doc/doxygenlayout.xml
/libCEED/doc/libCEED.png
/libCEED/doc/libCEEDapi.md
/libCEED/doc/libCEEDdev.md
/libCEED/doc/logo.svg
/libCEED/examples/.gitignore
/libCEED/examples/BPdescription.md
/libCEED/examples/Makefile
/libCEED/examples/ceed/Makefile
/libCEED/examples/ceed/ex1.c
/libCEED/examples/ceed/ex1.cu
/libCEED/examples/ceed/ex1.okl
/libCEED/examples/mfem/.gitignore
/libCEED/examples/mfem/Makefile
/libCEED/examples/mfem/bp1.cpp
/libCEED/examples/mfem/bp1.cu
/libCEED/examples/mfem/bp1.hpp
/libCEED/examples/mfem/bp1.okl
/libCEED/examples/mfem/bp3.cpp
/libCEED/examples/mfem/bp3.cu
/libCEED/examples/mfem/bp3.hpp
/libCEED/examples/mfem/bp3.okl
/libCEED/examples/nek5000/.gitignore
/libCEED/examples/nek5000/README.md
/libCEED/examples/nek5000/SIZE.in
/libCEED/examples/nek5000/boxes/b.box
/libCEED/examples/nek5000/boxes/b1e.rea
/libCEED/examples/nek5000/bp1.okl
/libCEED/examples/nek5000/bp1.usr
/libCEED/examples/nek5000/bp3.okl
/libCEED/examples/nek5000/bp3.usr
/libCEED/examples/nek5000/generate-boxes.sh
/libCEED/examples/nek5000/make-nek-examples.sh
/libCEED/examples/nek5000/run-nek-example.sh
/libCEED/examples/petsc/.gitignore
/libCEED/examples/petsc/Makefile
/libCEED/examples/petsc/bp1.c
/libCEED/examples/petsc/bp1.cu
/libCEED/examples/petsc/bp1.h
/libCEED/examples/petsc/bp1.okl
/libCEED/examples/petsc/bp3.c
/libCEED/examples/petsc/bp3.cu
/libCEED/examples/petsc/bp3.h
/libCEED/examples/petsc/bp3.okl
/libCEED/include/ceed-backend.h
/libCEED/include/ceed-fortran-name.h
/libCEED/include/ceed-impl.h
/libCEED/include/ceed.h
/libCEED/include/ceedf.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-elemrestriction.c
/libCEED/interface/ceed-fortran.c
/libCEED/interface/ceed-operator.c
/libCEED/interface/ceed-qfunction.c
/libCEED/interface/ceed-vec.c
/libCEED/interface/ceed.c
/libCEED/output/ex1.out
/libCEED/output/t202-elemrestriction-f.out
/libCEED/output/t202-elemrestriction.out
/libCEED/output/t203-elemrestriction-f.out
/libCEED/output/t203-elemrestriction.out
/libCEED/output/t300-basis-f.out
/libCEED/output/t300-basis.out
/libCEED/output/t306-basis-f.out
/libCEED/output/t306-basis.out
/libCEED/output/t307-basis-f.out
/libCEED/output/t307-basis.out
/libCEED/output/t310-basis-f.out
/libCEED/output/t310-basis.out
/libCEED/tests/t000-init-f.f
/libCEED/tests/t000-init.c
/libCEED/tests/t100-vec-f.f
/libCEED/tests/t100-vec.c
/libCEED/tests/t101-vec-f.f
/libCEED/tests/t101-vec.c
/libCEED/tests/t102-vec-f.f
/libCEED/tests/t102-vec.c
/libCEED/tests/t103-vec-f.f
/libCEED/tests/t103-vec.c
/libCEED/tests/t104-vec-f.f
/libCEED/tests/t104-vec.c
/libCEED/tests/t105-vec-f.f
/libCEED/tests/t105-vec.c
/libCEED/tests/t106-vec-f.f
/libCEED/tests/t106-vec.c
/libCEED/tests/t107-vec-f.f
/libCEED/tests/t107-vec.c
/libCEED/tests/t108-vec-f.f
/libCEED/tests/t108-vec.c
/libCEED/tests/t200-elemrestriction-f.f
/libCEED/tests/t200-elemrestriction.c
/libCEED/tests/t201-elemrestriction-f.f
/libCEED/tests/t201-elemrestriction.c
/libCEED/tests/t202-elemrestriction-f.f
/libCEED/tests/t202-elemrestriction.c
/libCEED/tests/t203-elemrestriction-f.f
/libCEED/tests/t203-elemrestriction.c
/libCEED/tests/t204-elemrestriction-f.f
/libCEED/tests/t204-elemrestriction.c
/libCEED/tests/t205-elemrestriction-f.f
/libCEED/tests/t205-elemrestriction.c
/libCEED/tests/t206-elemrestriction-f.f
/libCEED/tests/t206-elemrestriction.c
/libCEED/tests/t207-elemrestriction-f.f
/libCEED/tests/t207-elemrestriction.c
/libCEED/tests/t300-basis-f.f
/libCEED/tests/t300-basis.c
/libCEED/tests/t301-basis-f.f
/libCEED/tests/t301-basis.c
/libCEED/tests/t302-basis-f.f
/libCEED/tests/t302-basis.c
/libCEED/tests/t303-basis-f.f
/libCEED/tests/t303-basis.c
/libCEED/tests/t304-basis-f.f
/libCEED/tests/t304-basis.c
/libCEED/tests/t305-basis-f.f
/libCEED/tests/t305-basis.c
/libCEED/tests/t306-basis-f.f
/libCEED/tests/t306-basis.c
/libCEED/tests/t307-basis-f.f
/libCEED/tests/t307-basis.c
/libCEED/tests/t310-basis-f.f
/libCEED/tests/t310-basis-f.h
/libCEED/tests/t310-basis.c
/libCEED/tests/t310-basis.h
/libCEED/tests/t311-basis-f.f
/libCEED/tests/t311-basis.c
/libCEED/tests/t312-basis-f.f
/libCEED/tests/t312-basis.c
/libCEED/tests/t313-basis-f.f
/libCEED/tests/t313-basis.c
/libCEED/tests/t400-qfunction-f.cu
/libCEED/tests/t400-qfunction-f.f
/libCEED/tests/t400-qfunction-f.okl
/libCEED/tests/t400-qfunction.c
/libCEED/tests/t400-qfunction.cu
/libCEED/tests/t400-qfunction.okl
/libCEED/tests/t401-qfunction-f.cu
/libCEED/tests/t401-qfunction-f.f
/libCEED/tests/t401-qfunction-f.okl
/libCEED/tests/t401-qfunction.c
/libCEED/tests/t401-qfunction.cu
/libCEED/tests/t401-qfunction.okl
/libCEED/tests/t500-operator-f.cu
/libCEED/tests/t500-operator-f.f
/libCEED/tests/t500-operator-f.okl
/libCEED/tests/t500-operator.c
/libCEED/tests/t500-operator.cu
/libCEED/tests/t500-operator.okl
/libCEED/tests/t501-operator-f.cu
/libCEED/tests/t501-operator-f.f
/libCEED/tests/t501-operator-f.okl
/libCEED/tests/t501-operator.c
/libCEED/tests/t501-operator.cu
/libCEED/tests/t501-operator.okl
/libCEED/tests/t502-operator-f.cu
/libCEED/tests/t502-operator-f.f
/libCEED/tests/t502-operator-f.okl
/libCEED/tests/t502-operator.c
/libCEED/tests/t502-operator.cu
/libCEED/tests/t502-operator.okl
/libCEED/tests/t510-operator-f.f
/libCEED/tests/t510-operator-f.okl
/libCEED/tests/t510-operator.c
/libCEED/tests/t510-operator.okl
/libCEED/tests/t511-operator-f.f
/libCEED/tests/t511-operator-f.okl
/libCEED/tests/t511-operator.c
/libCEED/tests/t511-operator.okl
/libCEED/tests/tap.sh

123