History log of /libCEED/rust/libceed-sys/c-src/backends/magma/ceed-magma-basis.c (Results 26 – 50 of 81)
Revision Date Author Comments
# 90a4cd18 27-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Whitespace formatting consistency with libCEED style


# 9b83230a 27-Oct-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Merge pull request #1385 from CEED/sjg/magma-transpose-opt

Improve transpose basis performance for `magma` backend


# 833aa127 19-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Use shared-memory for transpose non-tensor basis kernel to accelerate load of A matrix


# 7132caa0 20-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

Remove need to zero out V vector before applying basis transpose for magma backends


# 9d15e85b 18-Oct-2023 Sebastian Grimberg <sjg@amazon.com>

H(div) and H(curl) basis support for magma backend


# db7ade31 17-Oct-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1382 from CEED/sjg/magma-formatting-dev

Lazy RTC compilation of MAGMA basis kernels for small P, Q


# 940a72f1 10-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

Formatting consistency for magma backend with cuda-ref and hip-ref

Includes JiT upgrades for Magma non-tensor basis to only compile for N values which are used at runtime. Adds JiT for Magma non-ten

Formatting consistency for magma backend with cuda-ref and hip-ref

Includes JiT upgrades for Magma non-tensor basis to only compile for N values which are used at runtime. Adds JiT for Magma non-tensor basis CEED_EVAL_WEIGHT mode.

show more ...


# f80f4a74 09-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

Rename files in magma backend for consistency with other libCEED backends


# 21f16bf6 13-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1315 from CEED/jeremy/queep

Backend Consistency Fixes


# 38293ee6 31-Aug-2023 Jeremy L Thompson <jeremy@jeremylt.org>

style - fixes for Magma backends


# 1c66c397 30-Aug-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Interface consistency fixes (#1312)

* format - consistency fixes

* format - test name consistency

* format - drop comma operator

* format - drop git add . in favor coercion

* format - ad

Interface consistency fixes (#1312)

* format - consistency fixes

* format - test name consistency

* format - drop comma operator

* format - drop git add . in favor coercion

* format - add CeedScalarSwap macro

show more ...


# 53f7acb1 24-Aug-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1292 from CEED/sjg/gpu-rstr-unify

Unify `magma` backend restriction with `cuda-ref` and `hip-ref`


# 00fb7a04 17-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

Refactor CeedInit_Magma and CeedInit_Magma_det similar to ceed-cuda-common.h and ceed-hip-common.h


# 63ca1801 08-Jul-2023 Jed Brown <jed@jedbrown.org>

Merge pull request #1249 from CEED/jeremy/memcheck-isnan

Memcheck - warn or error on NaN after write-only access


# 23d4529e 07-Jul-2023 Jeremy L Thompson <jeremy@jeremylt.org>

debug - add debug color enum


# 6e6704a8 19-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1198 from CEED/jeremy/CeedCheck

Add CeedCheck macro to reduce repetition


# 6574a04f 18-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

internal - add CeedCheck macro to reduce repetition


# 49aac155 24-Mar-2023 Jeremy L Thompson <jeremy@jeremylt.org>

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc heade

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc headers

* iwyu - ceed/ceed.h -> ceed.h

* iwyu - cuda/hip include fixes

show more ...


# 023b8a51 25-Jan-2023 abdelfattah83 <36712794+abdelfattah83@users.noreply.github.com>

magma: non-tensor rtc (#1141)

* some refactoring in magma's jit src

* fix path

* fix loading src

* refactor magma nontensor backend

* refactor magma nontensor backend

* [WIP]: new non

magma: non-tensor rtc (#1141)

* some refactoring in magma's jit src

* fix path

* fix loading src

* refactor magma nontensor backend

* refactor magma nontensor backend

* [WIP]: new nontensor basis kernels

* [WIP]: new nontensor basis kernels

* [WIP]: new nontensor basis kernels

* call the new nontensor kernels for low order problems

* multiple compilation for the same kernels but with different tuning parmaters

* magma: allow different nb's for different non-tensor kernels

* tuning data for the non-tensor rtc kernels

* remove no-longer used functions, add new one for tuning the nontensor kernels

* constants for tuning

* tuning functions

* use the tuning functions in compiling/running the new kernels

* bug fix

* fixes

* fixes

* minor

* switch tuning data

* fix name

* fix name

* add function to run cuda kernels with opt-in shared memory feature

* minor fix

* minor fix

* fix calls to batch api

* allow more kernel instances

* temporary timing function

* temporary timing function

* tuning data based on hiprtc

* rollback tuning parameters

* fixes

* fixes

* fix inconsistency in the parameters passed to nvrtc/hiprtc

* minor

* a fix to the nb selector

* cleanup

* merge the opt-in feature in CeedRunKernelDimSharedOptinCuda into CeedRunKernelDimSharedCuda

* fix paths for hip-magma backends

* style

* fixes

* running make format

* undo changes from the last commit

* change HIP_DIR to ROCM_DIR and adjust the paths for magma accordingly

* replace HIP_DIR with ROCM_DIR

show more ...


# 2b730f8b 17-Nov-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - re

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - remove stray newline

* make - simpler 'make format' target

* ci - use newer libxsmm

* doc - minor release note claification

* minor - minor fix

* minor - minor fix

* minor - minor fix

* minor - minor fix

* make format

* format - less aggressive alignment rules

* tidy - check for argument name mismatches

* fix newline

* format - mirror Ratel update to .clang-format

* fix merge error

* fix merge conflict

* fix merge error

* drop style in .phony list

* Update .clang-format

Co-authored-by: Jed Brown <jed@jedbrown.org>

* apply updated format

Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


# 01005eab 30-Aug-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1053 from CEED/natalie/fix-magma-jit-mem

Fix small memory leaks in JIT source code management


# 03f90b05 26-Aug-2022 nbeams <246972+nbeams@users.noreply.github.com>

magma: free memory used in loading jit kernel source


# cb03979f 28-Jun-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1002 from CEED/jeremy/format-ceed-types

Add CeedInt_FMT


# 13f886e9 28-Jun-2022 Jeremy L Thompson <jeremy@jeremylt.org>

debug - consistent use of CeedDebug vs CeedDebug256


# 990fdeb6 21-Jun-2022 Jeremy L Thompson <jeremy@jeremylt.org>

fmt - add CeedInt_FMT


1234