minor - update copyright to 2026
minor - upate copyright to 2025
gpu - allow running shared kernels on stream
ceed - require *GetCeed ceed to be Destroyed
minor - fix CeedCall() vs CeedCallBackend() in backend code
Add CeedBasisApplyAdd (#1644)* basis - add CeedBasisApplyAdd + CPU impl * basis - add ref GPU ApplyAdd * basis - add shared GPU ApplyAdd * basis - add MAGMA ApplyAdd * basis - add CeedB
Add CeedBasisApplyAdd (#1644)* basis - add CeedBasisApplyAdd + CPU impl * basis - add ref GPU ApplyAdd * basis - add shared GPU ApplyAdd * basis - add MAGMA ApplyAdd * basis - add CeedBasisApplyAddAtPoints + default impl * basis - add GPU ApplyAddAtPoints * tidy - add extra assert to fix clang-tidy * Apply suggestions from code review style - consistently use indexing over pointer arithmatic Co-authored-by: Zach Atkins <zach.atkins@colorado.edu> * style - more pointer fixes --------- Co-authored-by: Zach Atkins <zach.atkins@colorado.edu>
show more ...
magma - explicitly exclude BasisApplyAtPoints
basis: CreateProjection set q_ref, q_weight to NULL
Drop JiT Guards in Most QF Source (#1540)* backend - use pragma once * gallery - drop source header guards * ex - drop some qfunction source header guards * fluids - drop guards on all sin
Drop JiT Guards in Most QF Source (#1540)* backend - use pragma once * gallery - drop source header guards * ex - drop some qfunction source header guards * fluids - drop guards on all singly included headers * jit - drop most guards on backend JiT files * sycl - drop extra header guards * jit - enable #pragma once for QF source * fluids - use #pragma once for util/helper qf source * test - check different multiple includes * fluids - fix odd include * jit - update interface for building JiT string from multiple files
minor - update copyright headers
internal - more updates for const
minor - reduce LCOV_EXCL markers
Remove need for separate CUmodule/hipModule_t for weights in non-tensor MAGMA backend
Add missing magma_finialize() call
Add updated MI100 tuning data; make style updates for tuning data
Add updated A100 tuning data
Add updated MI250X tuning data
Add RTC tuning data for V100 and H100
Increment NB by 2 or 4 to speed up tuning
Add 2*p quadrature rules to tuning data (previously was just 2*(p-1)
Add documentation for MAGMA autotuning
New RTC tuning data format
Fixes and improvements to autotuning code
Add Python program to automatically generate tuning data for MAGMA RTC kernels
Add simple test program to generate benchmark data for CeedBasis operations
12345678