| /libCEED/backends/cuda-gen/ |
| H A D | ceed-cuda-gen-operator-build.h | 241a4b83dc9c714eb4bcc90729a46be02322143a Thu Jul 25 22:33:45 UTC 2019 Yohann <yohann.dudouit@gmail.com> Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
|
| H A D | ceed-cuda-gen.h | 241a4b83dc9c714eb4bcc90729a46be02322143a Thu Jul 25 22:33:45 UTC 2019 Yohann <yohann.dudouit@gmail.com> Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
|
| H A D | ceed-cuda-gen-qfunction.c | 241a4b83dc9c714eb4bcc90729a46be02322143a Thu Jul 25 22:33:45 UTC 2019 Yohann <yohann.dudouit@gmail.com> Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
|
| H A D | ceed-cuda-gen.c | 241a4b83dc9c714eb4bcc90729a46be02322143a Thu Jul 25 22:33:45 UTC 2019 Yohann <yohann.dudouit@gmail.com> Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
|
| H A D | ceed-cuda-gen-operator.c | 241a4b83dc9c714eb4bcc90729a46be02322143a Thu Jul 25 22:33:45 UTC 2019 Yohann <yohann.dudouit@gmail.com> Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
|
| H A D | ceed-cuda-gen-operator-build.cpp | 241a4b83dc9c714eb4bcc90729a46be02322143a Thu Jul 25 22:33:45 UTC 2019 Yohann <yohann.dudouit@gmail.com> Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
|
| /libCEED/examples/petsc/ |
| H A D | bps.c | diff 241a4b83dc9c714eb4bcc90729a46be02322143a Thu Jul 25 22:33:45 UTC 2019 Yohann <yohann.dudouit@gmail.com> Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
|
| /libCEED/ |
| H A D | Makefile | diff 241a4b83dc9c714eb4bcc90729a46be02322143a Thu Jul 25 22:33:45 UTC 2019 Yohann <yohann.dudouit@gmail.com> Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
|