History log of /libCEED/backends/cuda/ (Results 151 – 175 of 180)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
1d102b4803-Oct-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Assemble Linear QFunction (#306)

* Operator - add interface for AssembleLinearQF

* Operator - Refactor Ref/Blocked/Opt basis apply

* Operator - Add AssembleLinearQF impl for Ref/Blocked/Opt, n

Assemble Linear QFunction (#306)

* Operator - add interface for AssembleLinearQF

* Operator - Refactor Ref/Blocked/Opt basis apply

* Operator - Add AssembleLinearQF impl for Ref/Blocked/Opt, not impl message for OCCA/CUDA

* AssembleQF - Add grad test and clean up code

* CPU - Add operator eval mode error, remove opt inlineing

* Operator - clarify QF assemble documentation, style updates

* Interface - style and consistency updates

* Tests - add more complex assembled qfunction test

* Tests - add fortran test for assemble linear qfunction

* Tests - Update t53* tests for new Fortran source macro

* Merge - small fixes

* Operator - convert to backend creating assembled qdata vector

* Operator - zero qvecs before using in assembly of qf

* Operator - expand assemble QF documentation

* CPU - minor fix in AssembleLineaorQF to prevent uninitalized memory

* Tests - fix wording in t531, t532

show more ...


/libCEED/.gitignore
/libCEED/.mailmap
/libCEED/.travis.yml
/libCEED/Doxyfile
/libCEED/Makefile
/libCEED/README.md
/libCEED/backends/avx/ceed-avx-blocked.c
/libCEED/backends/avx/ceed-avx-serial.c
/libCEED/backends/blocked/ceed-blocked-operator.c
/libCEED/backends/blocked/ceed-blocked.c
/libCEED/backends/cuda-gen/ceed-cuda-gen-operator.c
ceed-cuda-operator.c
/libCEED/backends/magma/atomics.cuh
/libCEED/backends/magma/ceed-magma-basis.c
/libCEED/backends/magma/ceed-magma.c
/libCEED/backends/magma/ceed-magma.h
/libCEED/backends/magma/gccm.py
/libCEED/backends/magma/magma_check_cudaerror.h
/libCEED/backends/magma/magma_dbasisApply_grad.cu
/libCEED/backends/magma/magma_dbasisApply_grad_device.cuh
/libCEED/backends/magma/magma_dbasisApply_interp.cu
/libCEED/backends/magma/magma_dbasisApply_interp_device.cuh
/libCEED/backends/magma/magma_dbasisApply_weight.cu
/libCEED/backends/magma/magma_devptr.cu
/libCEED/backends/magma/magma_tc_device.cuh
/libCEED/backends/memcheck/ceed-memcheck.c
/libCEED/backends/occa/ceed-occa-okl.c
/libCEED/backends/occa/ceed-occa-operator.c
/libCEED/backends/opt/ceed-opt-blocked.c
/libCEED/backends/opt/ceed-opt-operator.c
/libCEED/backends/opt/ceed-opt-serial.c
/libCEED/backends/ref/ceed-ref-basis.c
/libCEED/backends/ref/ceed-ref-operator.c
/libCEED/backends/ref/ceed-ref-restriction.c
/libCEED/backends/ref/ceed-ref-vec.c
/libCEED/backends/ref/ceed-ref.c
/libCEED/backends/template/ceed-tmpl-sub.c
/libCEED/backends/template/ceed-tmpl.c
/libCEED/backends/xsmm/ceed-xsmm-blocked.c
/libCEED/backends/xsmm/ceed-xsmm-serial.c
/libCEED/backends/xsmm/ceed-xsmm-tensor.c
/libCEED/ceed.pc.template
/libCEED/examples/mfem/bp1.hpp
/libCEED/examples/mfem/bp3.hpp
/libCEED/examples/navier-stokes/navierstokes.c
/libCEED/examples/petsc/bps.c
/libCEED/examples/petsc/bpsdmplex.c
/libCEED/gallery/ceed-gallerytemplate.h
/libCEED/gallery/identity/ceed-identity.c
/libCEED/gallery/identity/ceed-identity.h
/libCEED/include/ceed-backend.h
/libCEED/include/ceed-impl.h
/libCEED/include/ceed.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-elemrestriction.c
/libCEED/interface/ceed-fortran.c
/libCEED/interface/ceed-operator.c
/libCEED/interface/ceed-qfunction.c
/libCEED/interface/ceed-tensor.c
/libCEED/interface/ceed-vec.c
/libCEED/interface/ceed.c
/libCEED/tests/README.md
/libCEED/tests/junit.py
/libCEED/tests/output/t210-elemrestriction-f.out
/libCEED/tests/output/t210-elemrestriction.out
/libCEED/tests/output/t211-elemrestriction.out
/libCEED/tests/output/t302-basis-f.out
/libCEED/tests/output/t302-basis.out
/libCEED/tests/t103-vec-f.f90
/libCEED/tests/t103-vec.c
/libCEED/tests/t104-vec-f.f90
/libCEED/tests/t104-vec.c
/libCEED/tests/t105-vec-f.f90
/libCEED/tests/t105-vec.c
/libCEED/tests/t110-vec-f.f90
/libCEED/tests/t110-vec.c
/libCEED/tests/t111-vec-f.f90
/libCEED/tests/t111-vec.c
/libCEED/tests/t112-vec-f.f90
/libCEED/tests/t112-vec.c
/libCEED/tests/t113-vec-f.f90
/libCEED/tests/t113-vec.c
/libCEED/tests/t114-vec-f.f90
/libCEED/tests/t114-vec.c
/libCEED/tests/t115-vec-f.f90
/libCEED/tests/t115-vec.c
/libCEED/tests/t116-vec-f.f90
/libCEED/tests/t116-vec.c
/libCEED/tests/t117-vec-f.f90
/libCEED/tests/t117-vec.c
/libCEED/tests/t118-vec-f.f90
/libCEED/tests/t118-vec.c
/libCEED/tests/t210-elemrestriction-f.f90
/libCEED/tests/t210-elemrestriction.c
/libCEED/tests/t211-elemrestriction.c
/libCEED/tests/t302-basis-f.f90
/libCEED/tests/t302-basis.c
/libCEED/tests/t304-basis-f.f90
/libCEED/tests/t305-basis-f.f90
/libCEED/tests/t306-basis-f.f90
/libCEED/tests/t306-basis.c
/libCEED/tests/t315-basis-f.f90
/libCEED/tests/t315-basis.c
/libCEED/tests/t316-basis.c
/libCEED/tests/t411-qfunction-f.f90
/libCEED/tests/t411-qfunction.c
/libCEED/tests/t412-qfunction-f.f90
/libCEED/tests/t412-qfunction.c
/libCEED/tests/t530-operator-f.f90
/libCEED/tests/t530-operator.c
/libCEED/tests/t530-operator.h
/libCEED/tests/t530-operator.okl
/libCEED/tests/t531-operator-f.f90
/libCEED/tests/t531-operator.c
/libCEED/tests/t531-operator.h
/libCEED/tests/t531-operator.okl
/libCEED/tests/t532-operator-f.f90
/libCEED/tests/t532-operator.c
/libCEED/tests/t532-operator.h
/libCEED/tests/t532-operator.okl
/libCEED/tests/tap.sh
ac421f3917-Sep-2019 Yohann <dudouit1@llnl.gov>

Improved performance of cuda-gen backend (#341)

Thanks-to: Tim Warburton
Some of these optimizations are the results of the knowledge and experience gathered by Tim Warburton and his team in libPar

Improved performance of cuda-gen backend (#341)

Thanks-to: Tim Warburton
Some of these optimizations are the results of the knowledge and experience gathered by Tim Warburton and his team in libParanumal and then ported to libCEED.

* Add colocated gradient in 3D.

* Treat the qFunction by slice in 3d to avoid using too many registers.

* Minor fix

* Minor fix.

* Minor fix

* Compute the colocated gradient slice by slice.

* Add synchthreads after initialization of the matrices.

* Remove code print.

* Add a critical #pragma unroll

* Fix typo on "collocated".

* Remove dead code.

* Use ColloGrad3d functions.

* Fix cuda-gen backend when collocated gradient is not available.

* make style

* make style

* Add some comments.

* Replace int by CeedInt.

show more ...


/libCEED/Makefile
/libCEED/backends/cuda-gen/ceed-cuda-gen-operator-build.cpp
/libCEED/backends/cuda-gen/ceed-cuda-gen-operator.c
/libCEED/backends/cuda-gen/ceed-cuda-gen.h
/libCEED/backends/cuda-reg/ceed-cuda-reg-basis.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-restriction.c
/libCEED/backends/cuda-shared/ceed-cuda-shared-basis.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.h
ceed-cuda-basis.c
ceed-cuda-restriction.c
ceed-cuda.h
/libCEED/benchmarks/README.md
/libCEED/examples/ceed/ex1.c
/libCEED/examples/ceed/ex1.h
/libCEED/examples/ceed/ex2.h
/libCEED/examples/mfem/bp1.h
/libCEED/examples/mfem/bp1.hpp
/libCEED/examples/mfem/bp3.h
/libCEED/examples/mfem/bp3.hpp
/libCEED/examples/navier-stokes/common.h
/libCEED/examples/nek/bps/bps.h
/libCEED/examples/nek/bps/bps.usr
/libCEED/examples/nek/nek-examples.sh
/libCEED/examples/petsc/bp1.h
/libCEED/examples/petsc/bp2.h
/libCEED/examples/petsc/bp3.h
/libCEED/examples/petsc/bp4.h
/libCEED/examples/petsc/bps.c
/libCEED/examples/petsc/bpsdmplex.c
/libCEED/include/ceed.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-fortran.c
/libCEED/interface/ceed-qfunction.c
/libCEED/tests/README.md
/libCEED/tests/junit.py
/libCEED/tests/output/t301-basis-f.out
/libCEED/tests/output/t301-basis.out
/libCEED/tests/output/t302-basis-f.out
/libCEED/tests/output/t302-basis.out
/libCEED/tests/output/t304-basis-f.out
/libCEED/tests/output/t304-basis.out
/libCEED/tests/output/t305-basis-f.out
/libCEED/tests/output/t305-basis.out
/libCEED/tests/output/t320-basis-f.out
/libCEED/tests/output/t320-basis.out
/libCEED/tests/t301-basis-f.f90
/libCEED/tests/t301-basis.c
/libCEED/tests/t302-basis-f.f90
/libCEED/tests/t302-basis.c
/libCEED/tests/t303-basis-f.f90
/libCEED/tests/t303-basis.c
/libCEED/tests/t304-basis-f.f90
/libCEED/tests/t304-basis.c
/libCEED/tests/t305-basis-f.f90
/libCEED/tests/t305-basis.c
/libCEED/tests/t310-basis-f.f90
/libCEED/tests/t310-basis.c
/libCEED/tests/t311-basis-f.f90
/libCEED/tests/t311-basis.c
/libCEED/tests/t312-basis-f.f90
/libCEED/tests/t312-basis.c
/libCEED/tests/t313-basis-f.f90
/libCEED/tests/t313-basis.c
/libCEED/tests/t314-basis-f.f90
/libCEED/tests/t314-basis.c
/libCEED/tests/t315-basis-f.f90
/libCEED/tests/t315-basis.c
/libCEED/tests/t316-basis-f.f90
/libCEED/tests/t316-basis.c
/libCEED/tests/t320-basis-f.f90
/libCEED/tests/t320-basis-f.h
/libCEED/tests/t320-basis.c
/libCEED/tests/t320-basis.h
/libCEED/tests/t321-basis-f.f90
/libCEED/tests/t321-basis.c
/libCEED/tests/t322-basis-f.f90
/libCEED/tests/t322-basis.c
/libCEED/tests/t323-basis-f.f90
/libCEED/tests/t323-basis.c
/libCEED/tests/t400-qfunction-f.f90
/libCEED/tests/t401-qfunction-f.f90
/libCEED/tests/t500-operator-f.f90
/libCEED/tests/t501-operator-f.f90
/libCEED/tests/t502-operator-f.f90
/libCEED/tests/t510-operator-f.f90
/libCEED/tests/t510-operator.c
/libCEED/tests/t511-operator-f.f90
/libCEED/tests/t511-operator.c
/libCEED/tests/t520-operator-f.f90
/libCEED/tests/t520-operator.c
/libCEED/tests/t521-operator-f.f90
/libCEED/tests/t521-operator.c
/libCEED/tests/tap.sh
288c044313-Sep-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

QFunction Create by Name (#311)

This PR adds a QFunction gallery to libCEED with 1D, 2D, and 3D mass and Poisson operators.

Closes issue #37, issue #340

* Add QFunction gallery, rename focca

QFunction Create by Name (#311)

This PR adds a QFunction gallery to libCEED with 1D, 2D, and 3D mass and Poisson operators.

Closes issue #37, issue #340

* Add QFunction gallery, rename focca

* Gallery - add initial QFunctions

* Add a test for using the QF gallery

* Modify ex1 to use gallery

* Add multiple test configs to tap

* Move output to test directory

* Update junit

* Add OCCA galley exception

* Add ex2

* Update ex2 for dim->ncompx

* Gallery - modify to work for CUDA as is

* Update Documentation

* Gallery - typo fix

* Gallery - convention change, postappend qfunction family variant

* Gallery - update template with new name checking convention

* Gallery - condense diff3DBuild QFunction

* Gallery - rename diff -> poisson

* Gallery - clarify poisson3DBuild comment

* Gallery - use Pragma SIMD, store Qdata in Voigt convention

* Examples - Convert BP3-6 to Voigt convention

* Examples - add cl option to switch between header and gallery qfs in CEED examples

* Examples - clean up construction of QF name

* Gallery - Switch to PascalCase for gallery names

* Doc - fix function type page

* Interface - Make sure strncpy result is null terminated

* Gallery - Update Poisson 2/3D Apply to new QF body

* make style

* make style - fix worst style problems

* make style - add gallery to make style

* Doc - update documentation errors and inconsistencies

* Examples - test ex1 ex2 with and without gallary

* Examples - reduce testing of ex1/ex2 without gallery, clean up non-gallery qfunctions

* MFEM - revert another make style mistake

* Manual make style updates

* Doc - update function documentation page

* Style updates, document test numbering conventions

* doc: resolve ambiguous image location warning, allow more Dot nodes

* Tests - style and cast cleanup

* Tests - fix README indentation

show more ...


/libCEED/CONTRIBUTING.md
/libCEED/Doxyfile
/libCEED/Makefile
/libCEED/backends/cuda-gen/ceed-cuda-gen-operator.c
/libCEED/backends/cuda-gen/ceed-cuda-gen-qfunction.c
/libCEED/backends/cuda-gen/ceed-cuda-gen.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-basis.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-restriction.c
/libCEED/backends/cuda-shared/ceed-cuda-shared-basis.c
ceed-cuda-basis.c
ceed-cuda-operator.c
ceed-cuda-qfunction.c
ceed-cuda-restriction.c
ceed-cuda-vec.c
ceed-cuda.h
/libCEED/backends/magma/ceed-magma.c
/libCEED/backends/occa/ceed-occa-qfunction.c
/libCEED/backends/ref/ceed-ref-basis.c
/libCEED/benchmarks/README.md
/libCEED/doc/doxygenlayout.xml
/libCEED/doc/img/libCEED.png
/libCEED/doc/img/logo.svg
/libCEED/examples/README.md
/libCEED/examples/ceed/README.md
/libCEED/examples/ceed/ex1.c
/libCEED/examples/ceed/ex1.h
/libCEED/examples/ceed/ex2.c
/libCEED/examples/ceed/ex2.h
/libCEED/examples/mfem/bp1.hpp
/libCEED/examples/mfem/bp3.h
/libCEED/examples/mfem/bp3.hpp
/libCEED/examples/navier-stokes/README.md
/libCEED/examples/navier-stokes/advection.h
/libCEED/examples/navier-stokes/densitycurrent.h
/libCEED/examples/navier-stokes/navierstokes.c
/libCEED/examples/nek/README.md
/libCEED/examples/nek/bps/bps.h
/libCEED/examples/nek/bps/bps.usr
/libCEED/examples/petsc/README.md
/libCEED/examples/petsc/bp3.h
/libCEED/examples/petsc/bp4.h
/libCEED/examples/petsc/bps.c
/libCEED/gallery/ceed-gallerytemplate.c
/libCEED/gallery/ceed-gallerytemplate.h
/libCEED/gallery/mass1d/ceed-mass1dbuild.c
/libCEED/gallery/mass1d/ceed-mass1dbuild.h
/libCEED/gallery/mass1d/ceed-massapply.c
/libCEED/gallery/mass1d/ceed-massapply.h
/libCEED/gallery/mass2d/ceed-mass2dbuild.c
/libCEED/gallery/mass2d/ceed-mass2dbuild.h
/libCEED/gallery/mass3d/ceed-mass3dbuild.c
/libCEED/gallery/mass3d/ceed-mass3dbuild.h
/libCEED/gallery/poisson1d/ceed-poisson1dapply.c
/libCEED/gallery/poisson1d/ceed-poisson1dapply.h
/libCEED/gallery/poisson1d/ceed-poisson1dbuild.c
/libCEED/gallery/poisson1d/ceed-poisson1dbuild.h
/libCEED/gallery/poisson2d/ceed-poisson2dapply.c
/libCEED/gallery/poisson2d/ceed-poisson2dapply.h
/libCEED/gallery/poisson2d/ceed-poisson2dbuild.c
/libCEED/gallery/poisson2d/ceed-poisson2dbuild.h
/libCEED/gallery/poisson3d/ceed-poisson3dapply.c
/libCEED/gallery/poisson3d/ceed-poisson3dapply.h
/libCEED/gallery/poisson3d/ceed-poisson3dbuild.c
/libCEED/gallery/poisson3d/ceed-poisson3dbuild.h
/libCEED/include/ceed-backend.h
/libCEED/include/ceed-impl.h
/libCEED/include/ceed.h
/libCEED/include/ceedf.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-elemrestriction.c
/libCEED/interface/ceed-fortran.c
/libCEED/interface/ceed-operator.c
/libCEED/interface/ceed-qfunction.c
/libCEED/interface/ceed-tensor.c
/libCEED/interface/ceed-vec.c
/libCEED/interface/ceed.c
/libCEED/tests/README.md
/libCEED/tests/junit.py
/libCEED/tests/output/t202-elemrestriction-f.out
/libCEED/tests/output/t202-elemrestriction.out
/libCEED/tests/output/t203-elemrestriction-f.out
/libCEED/tests/output/t203-elemrestriction.out
/libCEED/tests/output/t208-elemrestriction-f.out
/libCEED/tests/output/t208-elemrestriction.out
/libCEED/tests/output/t300-basis-f.out
/libCEED/tests/output/t300-basis.out
/libCEED/tests/output/t306-basis-f.out
/libCEED/tests/output/t306-basis.out
/libCEED/tests/output/t307-basis-f.out
/libCEED/tests/output/t307-basis.out
/libCEED/tests/output/t310-basis-f.out
/libCEED/tests/output/t310-basis.out
/libCEED/tests/t001-ceed.c
/libCEED/tests/t100-vec.c
/libCEED/tests/t101-vec.c
/libCEED/tests/t102-vec.c
/libCEED/tests/t103-vec.c
/libCEED/tests/t104-vec.c
/libCEED/tests/t105-vec.c
/libCEED/tests/t106-vec.c
/libCEED/tests/t108-vec.c
/libCEED/tests/t109-vec.c
/libCEED/tests/t110-vec.c
/libCEED/tests/t200-elemrestriction.c
/libCEED/tests/t201-elemrestriction.c
/libCEED/tests/t202-elemrestriction.c
/libCEED/tests/t203-elemrestriction.c
/libCEED/tests/t204-elemrestriction.c
/libCEED/tests/t205-elemrestriction.c
/libCEED/tests/t206-elemrestriction.c
/libCEED/tests/t207-elemrestriction.c
/libCEED/tests/t208-elemrestriction.c
/libCEED/tests/t209-elemrestriction.c
/libCEED/tests/t300-basis.c
/libCEED/tests/t301-basis.c
/libCEED/tests/t302-basis.c
/libCEED/tests/t303-basis.c
/libCEED/tests/t304-basis.c
/libCEED/tests/t305-basis.c
/libCEED/tests/t309-basis.c
/libCEED/tests/t310-basis.c
/libCEED/tests/t311-basis.c
/libCEED/tests/t312-basis.c
/libCEED/tests/t313-basis.c
/libCEED/tests/t314-basis.c
/libCEED/tests/t400-qfunction.c
/libCEED/tests/t401-qfunction.c
/libCEED/tests/t410-qfunction-f.f90
/libCEED/tests/t410-qfunction.c
/libCEED/tests/t500-operator.c
/libCEED/tests/t501-operator.c
/libCEED/tests/t502-operator.c
/libCEED/tests/tap.sh
ee07ded211-Sep-2019 Valeria Barra <39932030+valeriabarra@users.noreply.github.com>

Add CeedPragmaOMP to bps (#338)


* Convert petsc BP3&4 to loops

* Update petsc/bp4.h looping

* Switch to CeedPragmaSIMD and make examples/petsc/bp3.h consistent with bp4.h

Remove CeedPragm

Add CeedPragmaOMP to bps (#338)


* Convert petsc BP3&4 to loops

* Update petsc/bp4.h looping

* Switch to CeedPragmaSIMD and make examples/petsc/bp3.h consistent with bp4.h

Remove CeedPragmaOMP directive in Nek example and update documentation

* Remove restric qualifier in petsc/bp3.h and update documentation

show more ...

dc729fe205-Sep-2019 jeremylt <jeremy.thompson@colorado.edu>

CUDA - Rename SyncState -> CeedCudaSyncState

4d537eea02-Sep-2019 Yohann <dudouit1@llnl.gov>

Single Source QFunction (#304)

Introduce a new macro CEED_QFUNCTION that allows to define qFunctions in a single source code independently of the targeted backend.

Thanks-to: Jeremy Thompson
Tha

Single Source QFunction (#304)

Introduce a new macro CEED_QFUNCTION that allows to define qFunctions in a single source code independently of the targeted backend.

Thanks-to: Jeremy Thompson
Thanks-to: Jed Brown
This work is the result of a fruitful discussion between Jed Brown, Jeremy Thompson and Yohann Dudouit. Jeremy Thompson also implemented important features in this commit and was very active and helpful all along the progress of this work.

[NEWS] Breaking change: QFunctionField parameter 'ncomp' changed to 'size'. This change requires setting the previous value of 'ncomp' to 'ncomp*dim' when adding a QFunctionField with eval mode 'CEED_EVAL_GRAD'.

* First steps toward cuda-gen backend!

* Closer to real code generation.

* Generated code should be ready for nvrtc.

* The code generation skeleton is ready.

* Hack with the qfunction to make the operator kernel compile.

* Some tweaks in the makefile + Input fields structure change.

* Remove using cout.

* 1d interp and grad device functions.

* 1d readDofs, readQuads, writeDofs, writeQuads.

* Remove dead code.

* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d

* 2d interp and grad

* 3d interp and grad

* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel

* Add the qFunction reader + inlining

* Add qf files for the tests.

* Add qf file for ceed/ex1

* Add qf file for mfem/bp1

* All tests pass.

* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.

* mfem/bp1 passes + remove dead code

* Fix a bug in n_quads_out for writeQuads

* mfem/bp3 passes.

* All tests all examples pass.

* Temporary tweaks for mfem benchmarking

* Add Context management.

* Modify .qf files to take into account the context.

* Enable optimizations.

* First set of optimization for 2D and 3D.

* double pointer format for the qFunction.

* Change the .qf files to have the same code as the C functions.

* Make previous Cuda backends use .qf files.

* Add a return value to qFunctions.

* Make cpu backends use .qf files.

* Minor: clean commented code.

* Add guarded math.h for petsc examples.

* Remove previous nek qf files.

* Remove .cu files.

* Remove .qf files.

* Remove dead code in the tests.

* make style

* Make style fix.

* more make style fixes.

* CEED_QFUNCTION - improve macro for CPU filenames

* Add CEED_QFUNCTION macro to navierstokes.c

* Fix PETSc gitignore

* Change default NS problemtype to density_current (#307) in navierstokes.c

* Fix petsc bp1.h

* Real Fix for petsc bp1.h...

* fix

* README - Add /gpu/cuda/gen

* PETSc - Update dmplex example to use *_loc

* cuda/reg - fix typo

* Revert a couple of small changes

* Fix a bug in mfem bp3 similar to the previous bug in petsc bp3.

* Make PETSc qfunctions look closer to master, and minor style for debugging.

* More uniformity changes

* Fix a strange CUDA_OUT_OF_RESSOURCE bug.

* NS - fix fname variables

* Use a different convention for qFunction ncomp.

* update cuda-gen backend and bpsdmplex.

* PETSc - style update

* update mfem bp1 and bp3.

* Interface - Use size instead of ncomp for QFunction fields

* update ceed example and tests.

* Tests - Update ncomp to size

* CPU Backends - Update ncomp to size

* CPU Backends - style

* Nek - Update ncomp to size

* Opt - fix style

* CUDA - update ncomp to size

* Doc - Update API documentation for QFunction \ncomp->size

* OCCA - Patch QFunction ncomp -> size, work but revamp will be better

* OCCA - assert dim>0 for clang-tidy

* CUDA - Change GetNumComp to GetSize

* Basis - Shift check for dim > 0 to interface

* Doc update

* Update NS field size

* NS - Fix problem options

show more ...


/libCEED/.travis.yml
/libCEED/Doxyfile
/libCEED/Makefile
/libCEED/README.md
/libCEED/backends/blocked/ceed-blocked-operator.c
/libCEED/backends/cuda-gen/ceed-cuda-gen-operator-build.cpp
/libCEED/backends/cuda-gen/ceed-cuda-gen-qfunction.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-basis.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-restriction.c
/libCEED/backends/cuda-shared/ceed-cuda-shared-basis.c
ceed-cuda-operator.c
ceed-cuda-qfunction-load.cpp
ceed-cuda-qfunction-load.h
ceed-cuda-qfunction.c
ceed-cuda-restriction.c
ceed-cuda.h
/libCEED/backends/memcheck/ceed-memcheck-qfunction.c
/libCEED/backends/occa/ceed-occa-operator.c
/libCEED/backends/occa/ceed-occa-qfunction-noop.c
/libCEED/backends/occa/ceed-occa-qfunction-op.c
/libCEED/backends/occa/ceed-occa-qfunction.c
/libCEED/backends/occa/ceed-occa-restrict.c
/libCEED/backends/opt/ceed-opt-operator.c
/libCEED/backends/ref/ceed-ref-basis.c
/libCEED/backends/ref/ceed-ref-operator.c
/libCEED/backends/ref/ceed-ref-restriction.c
/libCEED/backends/ref/ceed-ref.h
/libCEED/doc/libCEEDapi.md
/libCEED/examples/Makefile
/libCEED/examples/ceed/ex1.c
/libCEED/examples/ceed/ex1.h
/libCEED/examples/mfem/bp1.cpp
/libCEED/examples/mfem/bp1.h
/libCEED/examples/mfem/bp1.hpp
/libCEED/examples/mfem/bp3.cpp
/libCEED/examples/mfem/bp3.h
/libCEED/examples/mfem/bp3.hpp
/libCEED/examples/navier-stokes/advection.h
/libCEED/examples/navier-stokes/common.h
/libCEED/examples/navier-stokes/densitycurrent.h
/libCEED/examples/navier-stokes/navierstokes.c
/libCEED/examples/nek/bps/bps.h
/libCEED/examples/nek/bps/bps.usr
/libCEED/examples/petsc/.gitignore
/libCEED/examples/petsc/bp1.h
/libCEED/examples/petsc/bp2.h
/libCEED/examples/petsc/bp3.h
/libCEED/examples/petsc/bp4.h
/libCEED/examples/petsc/bps.c
/libCEED/examples/petsc/bpsdmplex.c
/libCEED/examples/petsc/common.h
/libCEED/include/ceed-backend.h
/libCEED/include/ceed-impl.h
/libCEED/include/ceed.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-elemrestriction.c
/libCEED/interface/ceed-fortran.c
/libCEED/interface/ceed-qfunction.c
/libCEED/interface/ceed.c
/libCEED/tests/t001-ceed-f.f90
/libCEED/tests/t001-ceed.c
/libCEED/tests/t100-vec-f.f90
/libCEED/tests/t100-vec.c
/libCEED/tests/t101-vec-f.f90
/libCEED/tests/t101-vec.c
/libCEED/tests/t103-vec-f.f90
/libCEED/tests/t103-vec.c
/libCEED/tests/t104-vec-f.f90
/libCEED/tests/t104-vec.c
/libCEED/tests/t105-vec-f.f90
/libCEED/tests/t105-vec.c
/libCEED/tests/t106-vec-f.f90
/libCEED/tests/t106-vec.c
/libCEED/tests/t107-vec-f.f90
/libCEED/tests/t107-vec.c
/libCEED/tests/t108-vec-f.f90
/libCEED/tests/t108-vec.c
/libCEED/tests/t109-vec-f.f90
/libCEED/tests/t109-vec.c
/libCEED/tests/t110-vec-f.f90
/libCEED/tests/t110-vec.c
/libCEED/tests/t200-elemrestriction-f.f90
/libCEED/tests/t200-elemrestriction.c
/libCEED/tests/t201-elemrestriction-f.f90
/libCEED/tests/t201-elemrestriction.c
/libCEED/tests/t204-elemrestriction-f.f90
/libCEED/tests/t204-elemrestriction.c
/libCEED/tests/t205-elemrestriction-f.f90
/libCEED/tests/t205-elemrestriction.c
/libCEED/tests/t206-elemrestriction-f.f90
/libCEED/tests/t206-elemrestriction.c
/libCEED/tests/t207-elemrestriction-f.f90
/libCEED/tests/t207-elemrestriction.c
/libCEED/tests/t208-elemrestriction.c
/libCEED/tests/t209-elemrestriction-f.f90
/libCEED/tests/t209-elemrestriction.c
/libCEED/tests/t301-basis-f.f90
/libCEED/tests/t301-basis.c
/libCEED/tests/t302-basis-f.f90
/libCEED/tests/t302-basis.c
/libCEED/tests/t303-basis-f.f90
/libCEED/tests/t303-basis.c
/libCEED/tests/t304-basis-f.f90
/libCEED/tests/t304-basis.c
/libCEED/tests/t305-basis-f.f90
/libCEED/tests/t305-basis.c
/libCEED/tests/t306-basis-f.f90
/libCEED/tests/t307-basis-f.f90
/libCEED/tests/t308-basis-f.f90
/libCEED/tests/t308-basis.c
/libCEED/tests/t309-basis-f.f90
/libCEED/tests/t309-basis.c
/libCEED/tests/t311-basis-f.f90
/libCEED/tests/t311-basis.c
/libCEED/tests/t312-basis-f.f90
/libCEED/tests/t313-basis-f.f90
/libCEED/tests/t313-basis.c
/libCEED/tests/t314-basis-f.f90
/libCEED/tests/t400-qfunction-f.f90
/libCEED/tests/t400-qfunction-f.h
/libCEED/tests/t400-qfunction.c
/libCEED/tests/t400-qfunction.h
/libCEED/tests/t401-qfunction-f.f90
/libCEED/tests/t401-qfunction-f.h
/libCEED/tests/t401-qfunction.c
/libCEED/tests/t401-qfunction.h
/libCEED/tests/t500-operator-f.f90
/libCEED/tests/t500-operator-f.h
/libCEED/tests/t500-operator.c
/libCEED/tests/t500-operator.h
/libCEED/tests/t501-operator-f.f90
/libCEED/tests/t501-operator-f.h
/libCEED/tests/t501-operator.c
/libCEED/tests/t501-operator.h
/libCEED/tests/t502-operator-f.f90
/libCEED/tests/t502-operator-f.h
/libCEED/tests/t502-operator.c
/libCEED/tests/t502-operator.h
/libCEED/tests/t510-operator-f.f90
/libCEED/tests/t510-operator-f.h
/libCEED/tests/t510-operator.c
/libCEED/tests/t510-operator.h
/libCEED/tests/t511-operator-f.f90
/libCEED/tests/t511-operator-f.h
/libCEED/tests/t511-operator.c
/libCEED/tests/t511-operator.h
/libCEED/tests/t520-operator-f.f90
/libCEED/tests/t520-operator-f.h
/libCEED/tests/t520-operator.c
/libCEED/tests/t520-operator.h
/libCEED/tests/t521-operator-f.f90
/libCEED/tests/t521-operator-f.h
/libCEED/tests/t521-operator.c
/libCEED/tests/t521-operator.h
8795c94522-Aug-2019 jeremylt <jeremy.thompson@colorado.edu>

Rename NDoF to NNodes and style updates

241a4b8325-Jul-2019 Yohann <yohann.dudouit@gmail.com>

Full jit compiled operator: cuda-gen backend (#275)

* First steps toward cuda-gen backend!

* Closer to real code generation.

* Generated code should be ready for nvrtc.

* The code generatio

Full jit compiled operator: cuda-gen backend (#275)

* First steps toward cuda-gen backend!

* Closer to real code generation.

* Generated code should be ready for nvrtc.

* The code generation skeleton is ready.

* Hack with the qfunction to make the operator kernel compile.

* Some tweaks in the makefile + Input fields structure change.

* Remove using cout.

* 1d interp and grad device functions.

* 1d readDofs, readQuads, writeDofs, writeQuads.

* Remove dead code.

* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d

* 2d interp and grad

* 3d interp and grad

* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel

* Add the qFunction reader + inlining

* Add qf files for the tests.

* Add qf file for ceed/ex1

* Add qf file for mfem/bp1

* All tests pass.

* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.

* mfem/bp1 passes + remove dead code

* Fix a bug in n_quads_out for writeQuads

* mfem/bp3 passes.

* All tests all examples pass.

* Temporary tweaks for mfem benchmarking

* Add Context management.

* Modify .qf files to take into account the context.

* Enable optimizations.

* First set of optimization for 2D and 3D.

* Makefile tweaks and destructor code.

* make style.

* Add -MP flag.

* Fix linking issues with the tests.

* Update .qf files for the tests.

* Add .qf files for nek5000 examples.

* Use shared memory for B and G matrices.

* Fix bug introduced in previous commit.

show more ...


/libCEED/.travis.yml
/libCEED/Makefile
/libCEED/README.md
/libCEED/backends/avx/ceed-avx-blocked.c
/libCEED/backends/avx/ceed-avx-serial.c
/libCEED/backends/blocked/ceed-blocked.c
/libCEED/backends/cuda-gen/ceed-cuda-gen-operator-build.cpp
/libCEED/backends/cuda-gen/ceed-cuda-gen-operator-build.h
/libCEED/backends/cuda-gen/ceed-cuda-gen-operator.c
/libCEED/backends/cuda-gen/ceed-cuda-gen-qfunction.c
/libCEED/backends/cuda-gen/ceed-cuda-gen.c
/libCEED/backends/cuda-gen/ceed-cuda-gen.h
ceed-cuda.h
/libCEED/backends/memcheck/ceed-memcheck.c
/libCEED/backends/opt/ceed-opt-blocked.c
/libCEED/backends/opt/ceed-opt-serial.c
/libCEED/backends/ref/ceed-ref.c
/libCEED/backends/template/ceed-tmpl-sub.c
/libCEED/backends/template/ceed-tmpl.c
/libCEED/backends/xsmm/ceed-xsmm-blocked.c
/libCEED/backends/xsmm/ceed-xsmm-serial.c
/libCEED/examples/Makefile
/libCEED/examples/README.md
/libCEED/examples/ceed/ex1.qf
/libCEED/examples/mfem/bp1.qf
/libCEED/examples/mfem/bp3.qf
/libCEED/examples/navier-stokes/densitycurrent.h
/libCEED/examples/nek/.gitignore
/libCEED/examples/nek/Makefile
/libCEED/examples/nek/README.md
/libCEED/examples/nek/SIZE.in
/libCEED/examples/nek/boxes/b.box
/libCEED/examples/nek/boxes/b1e.rea
/libCEED/examples/nek/bps/bps.cu
/libCEED/examples/nek/bps/bps.okl
/libCEED/examples/nek/bps/bps.usr
/libCEED/examples/nek/nek-examples.sh
/libCEED/examples/nek5000/bp1.qf
/libCEED/examples/nek5000/bp3.qf
/libCEED/examples/petsc/bp1.qf
/libCEED/examples/petsc/bp2.qf
/libCEED/examples/petsc/bp3.qf
/libCEED/examples/petsc/bp4.qf
/libCEED/examples/petsc/bps.c
/libCEED/examples/petsc/common.qf
/libCEED/include/ceed-backend.h
/libCEED/include/ceed.h
/libCEED/interface/ceed-elemrestriction.c
/libCEED/interface/ceed-fortran.c
/libCEED/tests/junit.py
/libCEED/tests/t209-elemrestriction-f.f90
/libCEED/tests/t209-elemrestriction.c
/libCEED/tests/t400-qfunction-f.qf
/libCEED/tests/t400-qfunction.qf
/libCEED/tests/t401-qfunction-f.qf
/libCEED/tests/t401-qfunction.qf
/libCEED/tests/t500-operator-f.qf
/libCEED/tests/t500-operator.qf
/libCEED/tests/t501-operator-f.qf
/libCEED/tests/t501-operator.qf
/libCEED/tests/t502-operator-f.qf
/libCEED/tests/t502-operator.qf
/libCEED/tests/t510-operator-f.qf
/libCEED/tests/t510-operator.qf
/libCEED/tests/t511-operator-f.qf
/libCEED/tests/t511-operator.qf
/libCEED/tests/t520-operator-f.qf
/libCEED/tests/t520-operator.qf
/libCEED/tests/t521-operator-f.qf
/libCEED/tests/t521-operator.qf
/libCEED/tests/tap.sh
f405f80604-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #289 from CEED/cuda-occa-copy-vals

Update CUDA/OCCA CEED_COPY_VALUES logic

ea03cb9503-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

Update CUDA/OCCA CEED_COPY_VALUES logic

1226057f27-Jun-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Merge branch 'master' into yohann/cuda-restr-opt

Conflicts:
backends/cuda-reg/ceed-cuda-reg-restriction.c
backends/cuda-shared/ceed-cuda-shared-basis.c

9d77422e26-Jun-2019 Jed Brown <jed@jedbrown.org>

Merge branch 'yohann/cuda-non-tensor' [PR #249]

* yohann/cuda-non-tensor:
ceed-cuda: resolve -Wsign-compare for CUresult (unsigned enum) in CeedError
make style.
namespace cuda backends functi

Merge branch 'yohann/cuda-non-tensor' [PR #249]

* yohann/cuda-non-tensor:
ceed-cuda: resolve -Wsign-compare for CUresult (unsigned enum) in CeedError
make style.
namespace cuda backends functions.
Minor: styling
Add CUDA_LIB_DIR_STUBS for systems that don't have CUDA drivers installed
make style
Remove useless function declaration.
Add a reference non-tensor BasisApply for cuda backends.

show more ...


/libCEED/.gitignore
/libCEED/.travis.yml
/libCEED/Doxyfile
/libCEED/Makefile
/libCEED/README.md
/libCEED/backends/avx/ceed-avx-blocked.c
/libCEED/backends/avx/ceed-avx-serial.c
/libCEED/backends/blocked/ceed-blocked.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-basis.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-restriction.c
/libCEED/backends/cuda-reg/ceed-cuda-reg.c
/libCEED/backends/cuda-reg/ceed-cuda-reg.h
/libCEED/backends/cuda-shared/ceed-cuda-shared-basis.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.h
ceed-cuda-basis.c
ceed-cuda-operator.c
ceed-cuda-qfunction.c
ceed-cuda-restriction.c
ceed-cuda.c
ceed-cuda.h
cuda-vec.cu
/libCEED/backends/memcheck/ceed-memcheck.c
/libCEED/backends/occa/ceed-occa-basis.c
/libCEED/backends/occa/ceed-occa-okl.c
/libCEED/backends/occa/ceed-occa-operator.c
/libCEED/backends/occa/ceed-occa-qfunction-noop.c
/libCEED/backends/occa/ceed-occa.c
/libCEED/backends/opt/ceed-opt-blocked.c
/libCEED/backends/opt/ceed-opt-serial.c
/libCEED/backends/ref/ceed-ref-basis.c
/libCEED/backends/template/ceed-tmpl-sub.c
/libCEED/backends/template/ceed-tmpl.c
/libCEED/backends/xsmm/ceed-xsmm-blocked.c
/libCEED/backends/xsmm/ceed-xsmm-serial.c
/libCEED/benchmarks/README.md
/libCEED/benchmarks/benchmark.sh
/libCEED/benchmarks/petsc-bps.sh
/libCEED/benchmarks/postprocess-base.py
/libCEED/benchmarks/postprocess-plot.py
/libCEED/benchmarks/postprocess-table.py
/libCEED/doc/libCEEDdev.md
/libCEED/examples/Makefile
/libCEED/examples/README.md
/libCEED/examples/navier-stokes/README.md
/libCEED/examples/navier-stokes/densitycurrent.h
/libCEED/examples/navier-stokes/navierstokes.c
/libCEED/examples/nek5000/generate-boxes.sh
/libCEED/examples/nek5000/make-nek-examples.sh
/libCEED/examples/nek5000/make-nek-tests.sh
/libCEED/examples/petsc/.gitignore
/libCEED/examples/petsc/Makefile
/libCEED/examples/petsc/bp1.cu
/libCEED/examples/petsc/bp1.h
/libCEED/examples/petsc/bp1.okl
/libCEED/examples/petsc/bp2.cu
/libCEED/examples/petsc/bp2.h
/libCEED/examples/petsc/bp2.okl
/libCEED/examples/petsc/bp3.cu
/libCEED/examples/petsc/bp3.h
/libCEED/examples/petsc/bp3.okl
/libCEED/examples/petsc/bp4.cu
/libCEED/examples/petsc/bp4.h
/libCEED/examples/petsc/bp4.okl
/libCEED/examples/petsc/bps.c
/libCEED/examples/petsc/common.cu
/libCEED/examples/petsc/common.h
/libCEED/examples/petsc/common.okl
/libCEED/include/ceed-backend.h
/libCEED/include/ceed-impl.h
/libCEED/include/ceed.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-elemrestriction.c
/libCEED/interface/ceed-fortran.c
/libCEED/interface/ceed-operator.c
/libCEED/interface/ceed-qfunction.c
/libCEED/interface/ceed-tensor.c
/libCEED/interface/ceed-vec.c
/libCEED/interface/ceed.c
/libCEED/tests/t306-basis-f.f90
/libCEED/tests/t306-basis.c
/libCEED/tests/t314-basis-f.f90
/libCEED/tests/t314-basis.c
/libCEED/tests/t510-operator-f.cu
/libCEED/tests/t510-operator.cu
/libCEED/tests/t511-operator-f.cu
/libCEED/tests/t511-operator.cu
/libCEED/tests/t520-operator-f.cu
/libCEED/tests/t520-operator.cu
/libCEED/tests/t521-operator-f.cu
/libCEED/tests/t521-operator.cu
/libCEED/tests/tap.sh
ab7ab56023-Jun-2019 Jed Brown <jed@jedbrown.org>

ceed-cuda: resolve -Wsign-compare for CUresult (unsigned enum) in CeedError

961116ec17-Jun-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

make style.

4a6d4bbd17-Jun-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

namespace cuda backends functions.

0109ba8604-Jun-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Minor: styling

074be16103-Jun-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Optimization of weight kernel and dynamic allocation of shared memory.

- First optimization of weight kernel, kernels are now coalesce but
might not be fully using SMs (need to batch elements per

Optimization of weight kernel and dynamic allocation of shared memory.

- First optimization of weight kernel, kernels are now coalesce but
might not be fully using SMs (need to batch elements per block)
- Switch to dynamic shared memory allocation in order to batch elements
for interpolation and gradient in cuda-shared backend.
- Add GetPreferedMemoryType for cuda-reg and cuda-shared backends.
(Can be removed in the future with delegation of this function)

show more ...


/libCEED/.gitignore
/libCEED/.travis.yml
/libCEED/Doxyfile
/libCEED/Makefile
/libCEED/README.md
/libCEED/backends/avx/ceed-avx-blocked.c
/libCEED/backends/avx/ceed-avx-serial.c
/libCEED/backends/blocked/ceed-blocked.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-restriction.c
/libCEED/backends/cuda-reg/ceed-cuda-reg.c
/libCEED/backends/cuda-shared/ceed-cuda-shared-basis.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.h
ceed-cuda.c
ceed-cuda.h
/libCEED/backends/memcheck/ceed-memcheck.c
/libCEED/backends/occa/ceed-occa-basis.c
/libCEED/backends/occa/ceed-occa-okl.c
/libCEED/backends/occa/ceed-occa-operator.c
/libCEED/backends/occa/ceed-occa-qfunction-noop.c
/libCEED/backends/occa/ceed-occa.c
/libCEED/backends/opt/ceed-opt-blocked.c
/libCEED/backends/opt/ceed-opt-serial.c
/libCEED/backends/template/ceed-tmpl-sub.c
/libCEED/backends/template/ceed-tmpl.c
/libCEED/backends/xsmm/ceed-xsmm-blocked.c
/libCEED/backends/xsmm/ceed-xsmm-serial.c
/libCEED/benchmarks/benchmark.sh
/libCEED/benchmarks/petsc-bps.sh
/libCEED/benchmarks/postprocess-base.py
/libCEED/benchmarks/postprocess-plot.py
/libCEED/doc/libCEEDdev.md
/libCEED/examples/Makefile
/libCEED/examples/README.md
/libCEED/examples/navier-stokes/README.md
/libCEED/examples/navier-stokes/densitycurrent.h
/libCEED/examples/navier-stokes/navierstokes.c
/libCEED/examples/nek5000/generate-boxes.sh
/libCEED/examples/nek5000/make-nek-examples.sh
/libCEED/examples/nek5000/make-nek-tests.sh
/libCEED/examples/petsc/.gitignore
/libCEED/examples/petsc/Makefile
/libCEED/examples/petsc/bp1.cu
/libCEED/examples/petsc/bp1.h
/libCEED/examples/petsc/bp1.okl
/libCEED/examples/petsc/bp2.cu
/libCEED/examples/petsc/bp2.h
/libCEED/examples/petsc/bp2.okl
/libCEED/examples/petsc/bp3.cu
/libCEED/examples/petsc/bp3.h
/libCEED/examples/petsc/bp3.okl
/libCEED/examples/petsc/bp4.cu
/libCEED/examples/petsc/bp4.h
/libCEED/examples/petsc/bp4.okl
/libCEED/examples/petsc/bps.c
/libCEED/examples/petsc/common.cu
/libCEED/examples/petsc/common.h
/libCEED/examples/petsc/common.okl
/libCEED/include/ceed-backend.h
/libCEED/include/ceed-impl.h
/libCEED/include/ceed.h
/libCEED/interface/ceed-basis.c
/libCEED/interface/ceed-elemrestriction.c
/libCEED/interface/ceed-operator.c
/libCEED/interface/ceed-qfunction.c
/libCEED/interface/ceed-tensor.c
/libCEED/interface/ceed-vec.c
/libCEED/interface/ceed.c
/libCEED/tests/tap.sh
f8902d9e24-May-2019 jeremylt <jeremy.thompson@colorado.edu>

VecCreate -> VectorCreate

be9261b728-Mar-2019 jeremylt <jeremy.thompson@colorado.edu>

Add ElemRestrictionApplyBlock

abe33e5416-May-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

make style

9ad4535716-May-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Add a reference non-tensor BasisApply for cuda backends.

c532df6316-May-2019 Yohann <dudouit1@llnl.gov>

Cuda backend using shared memory (#247)

Add a GPU backend based on Cuda using shared memory.

* Draft of a shared memory backend

* New basis apply passes all tests.

* Add the possibility to

Cuda backend using shared memory (#247)

Add a GPU backend based on Cuda using shared memory.

* Draft of a shared memory backend

* New basis apply passes all tests.

* Add the possibility to treat several elements in one block of threads.

* Fix an error in 2D and 3D gradient.

* Put the cuda-shared backend in its own folder.

* Minor cleaning.

* Replace <ceed-impl.h> with <ceed-backend.h>

* make style

* Add a few CeedChk_Cu

show more ...


/libCEED/.travis.yml
/libCEED/Makefile
/libCEED/README.md
/libCEED/backends/avx/ceed-avx-blocked.c
/libCEED/backends/avx/ceed-avx-serial.c
/libCEED/backends/avx/ceed-avx-tensor.c
/libCEED/backends/avx/ceed-avx.h
/libCEED/backends/blocked/ceed-blocked-operator.c
/libCEED/backends/blocked/ceed-blocked.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-basis.c
/libCEED/backends/cuda-reg/ceed-cuda-reg-restriction.c
/libCEED/backends/cuda-reg/ceed-cuda-reg.c
/libCEED/backends/cuda-reg/ceed-cuda-reg.h
/libCEED/backends/cuda-reg/cuda-reg-basis.cu
/libCEED/backends/cuda-shared/ceed-cuda-shared-basis.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.c
/libCEED/backends/cuda-shared/ceed-cuda-shared.h
ceed-cuda-basis.c
ceed-cuda-operator.c
ceed-cuda-qfunction.c
ceed-cuda-restriction.c
ceed-cuda-vec.c
ceed-cuda.c
ceed-cuda.h
/libCEED/backends/memcheck/ceed-memcheck-qfunction.c
/libCEED/backends/memcheck/ceed-memcheck.c
/libCEED/backends/memcheck/ceed-memcheck.h
/libCEED/backends/ref/ceed-ref-basis.c
/libCEED/backends/ref/ceed-ref-operator.c
/libCEED/backends/ref/ceed-ref-qfunction.c
/libCEED/backends/ref/ceed-ref-restriction.c
/libCEED/backends/ref/ceed-ref-tensor.c
/libCEED/backends/ref/ceed-ref-vec.c
/libCEED/backends/ref/ceed-ref.c
/libCEED/backends/xsmm/ceed-xsmm-blocked.c
/libCEED/backends/xsmm/ceed-xsmm-serial.c
/libCEED/backends/xsmm/ceed-xsmm-tensor.c
/libCEED/backends/xsmm/ceed-xsmm.h
/libCEED/examples/navier-stokes/navierstokes.c
/libCEED/examples/nek5000/.gitignore
/libCEED/examples/nek5000/bp1.usr
/libCEED/examples/nek5000/bp3.usr
/libCEED/interface/ceed-fortran.c
/libCEED/tests/junit.py
/libCEED/tests/t100-vec-f.f90
/libCEED/tests/t101-vec-f.f90
/libCEED/tests/t102-vec-f.f90
/libCEED/tests/t103-vec-f.f90
/libCEED/tests/t105-vec-f.f90
/libCEED/tests/t106-vec-f.f90
/libCEED/tests/t108-vec-f.f90
/libCEED/tests/t109-vec-f.f90
/libCEED/tests/t109-vec.c
/libCEED/tests/t110-vec-f.f90
/libCEED/tests/t110-vec.c
/libCEED/tests/t200-elemrestriction-f.f90
/libCEED/tests/t201-elemrestriction-f.f90
/libCEED/tests/t202-elemrestriction-f.f90
/libCEED/tests/t203-elemrestriction-f.f90
/libCEED/tests/t204-elemrestriction-f.f90
/libCEED/tests/t205-elemrestriction-f.f90
/libCEED/tests/t206-elemrestriction-f.f90
/libCEED/tests/t207-elemrestriction-f.f90
/libCEED/tests/t301-basis-f.f90
/libCEED/tests/t302-basis-f.f90
/libCEED/tests/t303-basis-f.f90
/libCEED/tests/t304-basis-f.f90
/libCEED/tests/t305-basis-f.f90
/libCEED/tests/t311-basis-f.f90
/libCEED/tests/t312-basis-f.f90
/libCEED/tests/t313-basis-f.f90
/libCEED/tests/t400-qfunction-f.f90
/libCEED/tests/t401-qfunction-f.f90
/libCEED/tests/t500-operator-f.f90
/libCEED/tests/t501-operator-f.f90
/libCEED/tests/t502-operator-f.f90
/libCEED/tests/t510-operator-f.f90
/libCEED/tests/t511-operator-f.f90
/libCEED/tests/t520-operator-f.f90
/libCEED/tests/t521-operator-f.f90
/libCEED/tests/tap.sh
30ea05eb06-May-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Force Context existence with cudaFree(0).

5e9d07a706-May-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Modify the device initialization

974a6da529-Apr-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Fix CeedChk with CeedChk_Cu in the Cuda backend.

12345678