| #
d4cc1845
|
| 30-Dec-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1912 from CEED/jeremy/copyright
minor - update copyright to 2026
|
| #
9ba83ac0
|
| 19-Dec-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - update copyright to 2026
|
| #
6d997e5f
|
| 09-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1851 from CEED/jeremy/gen-qf-assemble
GPU Gen QFunction Assembly
|
| #
0816752e
|
| 02-Jul-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda - QFunction assembly for gen
|
| #
0183ed61
|
| 17-Jun-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
GPU Assembly AtPoints (#1833)
* cuda - AtPoints diagonal assembly for gen
* hip - AtPoints diagonal assembly for gen
* pc - use subops for LinearAssemble[Add]Diagonal if composite
* gen - turn mo
GPU Assembly AtPoints (#1833)
* cuda - AtPoints diagonal assembly for gen
* hip - AtPoints diagonal assembly for gen
* pc - use subops for LinearAssemble[Add]Diagonal if composite
* gen - turn more numbers into named variables
* gen - fix alignment for assembly
* gen - check for only one active basis in/out
* HIP gen at points syntax error fixes
* hip - embarassing fix
* gen - add Tab helper to manage indentation
---------
Co-authored-by: Zach Atkins <Zach.Atkins@colorado.edu>
show more ...
|
| #
20a16a5f
|
| 20-Mar-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1786 from CEED/jeremy/copy-headers
minor - upate copyright to 2025
|
| #
d275d636
|
| 19-Mar-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - upate copyright to 2025
|
| #
ea41f46c
|
| 07-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1740 from CEED/jeremy/gen-rollback
GPU - gen fallback to shared if error
|
| #
ddae5012
|
| 07-Feb-2025 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda - gen fallback to shared if error
|
| #
509d4af6
|
| 28-Mar-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Drop JiT Guards in Most QF Source (#1540)
* backend - use pragma once
* gallery - drop source header guards
* ex - drop some qfunction source header guards
* fluids - drop guards on all sin
Drop JiT Guards in Most QF Source (#1540)
* backend - use pragma once
* gallery - drop source header guards
* ex - drop some qfunction source header guards
* fluids - drop guards on all singly included headers
* jit - drop most guards on backend JiT files
* sycl - drop extra header guards
* jit - enable #pragma once for QF source
* fluids - use #pragma once for util/helper qf source
* test - check different multiple includes
* fluids - fix odd include
* jit - update interface for building JiT string from multiple files
show more ...
|
| #
a171b6ef
|
| 27-Mar-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1537 from CEED/jeremy/pragma-once
Use #pragma once for non-JiT headers
|
| #
5aed82e4
|
| 27-Mar-2024 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - update copyright headers
|
| #
31c137a9
|
| 01-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1320 from CEED/jeremy/jit-header-guards
style - fix header guards
|
| #
94b7b29b
|
| 01-Sep-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
style - fix header guards
|
| #
4b35598d
|
| 20-Jun-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #1231 from CEED/jeremy/consistency
Consistency fixes
|
| #
eb7e6caf
|
| 16-Jun-2023 |
Jeremy L Thompson <jeremy@jeremylt.org> |
gpu - naming consistency fixes
|
| #
2b730f8b
|
| 17-Nov-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Switch to clang-format (#1051)
* style - switch to clang-format
* ci - use newer libxsmm
* action - update format action
* format - consistent use of {} for multi-line if/for
* make - re
Switch to clang-format (#1051)
* style - switch to clang-format
* ci - use newer libxsmm
* action - update format action
* format - consistent use of {} for multi-line if/for
* make - remove stray newline
* make - simpler 'make format' target
* ci - use newer libxsmm
* doc - minor release note claification
* minor - minor fix
* minor - minor fix
* minor - minor fix
* minor - minor fix
* make format
* format - less aggressive alignment rules
* tidy - check for argument name mismatches
* fix newline
* format - mirror Ratel update to .clang-format
* fix merge error
* fix merge conflict
* fix merge error
* drop style in .phony list
* Update .clang-format
Co-authored-by: Jed Brown <jed@jedbrown.org>
* apply updated format
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| #
ce18bed9
|
| 17-Mar-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #858 from CEED/jeremy/dump-copy-stuff
Strip redundant/outdated license info duplication
|
| #
3d8e8822
|
| 17-Mar-2022 |
Jeremy L Thompson <jeremy@jeremylt.org> |
minor - update copyright headers
|
| #
3d576824
|
| 29-Jan-2021 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
headers - clearify includes to not rely on transitive includes (#701)
* headers - clearify includes to not rely on transitive includes
* style - add header recommendations from 'include-what-you-
headers - clearify includes to not rely on transitive includes (#701)
* headers - clearify includes to not rely on transitive includes
* style - add header recommendations from 'include-what-you-use'
* style - apply 'include-what-you-use' changes to CUDA backends
* style - 'include-what-you-use' for hip backends
* style - drop ceed.h includes in gallery qf source
* docs - add dev notes for header files
* style - header style and alphabetize
show more ...
|
| #
241a4b83
|
| 25-Jul-2019 |
Yohann <yohann.dudouit@gmail.com> |
Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generatio
Full jit compiled operator: cuda-gen backend (#275)
* First steps toward cuda-gen backend!
* Closer to real code generation.
* Generated code should be ready for nvrtc.
* The code generation skeleton is ready.
* Hack with the qfunction to make the operator kernel compile.
* Some tweaks in the makefile + Input fields structure change.
* Remove using cout.
* 1d interp and grad device functions.
* 1d readDofs, readQuads, writeDofs, writeQuads.
* Remove dead code.
* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d
* 2d interp and grad
* 3d interp and grad
* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel
* Add the qFunction reader + inlining
* Add qf files for the tests.
* Add qf file for ceed/ex1
* Add qf file for mfem/bp1
* All tests pass.
* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.
* mfem/bp1 passes + remove dead code
* Fix a bug in n_quads_out for writeQuads
* mfem/bp3 passes.
* All tests all examples pass.
* Temporary tweaks for mfem benchmarking
* Add Context management.
* Modify .qf files to take into account the context.
* Enable optimizations.
* First set of optimization for 2D and 3D.
* Makefile tweaks and destructor code.
* make style.
* Add -MP flag.
* Fix linking issues with the tests.
* Update .qf files for the tests.
* Add .qf files for nek5000 examples.
* Use shared memory for B and G matrices.
* Fix bug introduced in previous commit.
show more ...
|