History log of /libCEED/backends/cuda-ref/ceed-cuda-ref.h (Results 51 – 75 of 87)
Revision Date Author Comments
# 78cbaa95 30-Nov-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Merge pull request #1300 from CEED/sjg/hcurl-hdiv-basis-cuda-hip

H(div) and H(curl) basis support for `cuda-ref` and `hip-ref` backends


# d075f50b 11-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

H(div) and H(curl) basis support for cuda-ref and hip-ref backends


# 506b1a0c 20-Sep-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Non-square operator full assembly (#1316)

* Consistency formatting for operator tests

* Add (failing) test for non-square operator full assembly

* Add support for CPU-based full assembly of no

Non-square operator full assembly (#1316)

* Consistency formatting for operator tests

* Add (failing) test for non-square operator full assembly

* Add support for CPU-based full assembly of non-square CeedOperators

* Fix full assembly of identity quadrature functions and operators with CEED_BASIS_NONE

Also adds a unit test for full assembly which fails prior to the changes in this commit.

* Minor refactor to improve coverage

show more ...


# 21f16bf6 13-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1315 from CEED/jeremy/queep

Backend Consistency Fixes


# ca735530 31-Aug-2023 Jeremy L Thompson <jeremy@jeremylt.org>

style - fixes for CUDA backends


# 31c137a9 01-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1320 from CEED/jeremy/jit-header-guards

style - fix header guards


# 94b7b29b 01-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

style - fix header guards


# 3c708d80 24-Aug-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1299 from CEED/sjg/cuda-hip-backends-consistency

Style fixes for `cuda` and `hip` backend consistency


# b2165e7a 11-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

Whitespace, style, and formatting updates for consistency between CUDA and HIP backends

Adds include guards in JiT header files, even if not strictly necessary, to match the precedent set in cuda-sh

Whitespace, style, and formatting updates for consistency between CUDA and HIP backends

Adds include guards in JiT header files, even if not strictly necessary, to match the precedent set in cuda-shared and hip-shared as well as sycl.

show more ...


# 53f7acb1 24-Aug-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1292 from CEED/sjg/gpu-rstr-unify

Unify `magma` backend restriction with `cuda-ref` and `hip-ref`


# 58549094 15-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

Unify magma backend restriction with cuda/hip-ref, keeping runtime option for deterministic and non-deterministic using magma/det

This also opens the opportunity to make cuda/hip-ref non-determinist

Unify magma backend restriction with cuda/hip-ref, keeping runtime option for deterministic and non-deterministic using magma/det

This also opens the opportunity to make cuda/hip-ref non-deterministic by default and add cuda/hip-ref/det variants.

show more ...


# 38432684 03-Aug-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Merge pull request #1265 from CEED/sjg/hcurl-restr-dev

CeedElemRestriction for H(curl)


# 0c73c039 22-Jun-2023 Sebastian Grimberg <sjg@amazon.com>

Improve element restriction for H(curl) spaces by reorganizing loops, including use of int8_t for tridiagonal matrix


# 0305e208 06-May-2023 Sebastian Grimberg <sjg@amazon.com>

Update backends for unified ElemRestrictionCreate variants for all restriction types (default, oriented, strided)


# 4b35598d 20-Jun-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1231 from CEED/jeremy/consistency

Consistency fixes


# eb7e6caf 16-Jun-2023 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - naming consistency fixes


# 30d6126f 25-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1202 from CEED/jeremy/const-fix

Clean up backend headers for const and argument names


# 472941f0 21-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

minor - fix static vs CEED_INTERN in backend file


# 51475c7c 20-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

minor - clean up backend headers for const and argument names


# 6e6704a8 19-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1198 from CEED/jeremy/CeedCheck

Add CeedCheck macro to reduce repetition


# 6574a04f 18-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

internal - add CeedCheck macro to reduce repetition


# 49aac155 24-Mar-2023 Jeremy L Thompson <jeremy@jeremylt.org>

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc heade

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc headers

* iwyu - ceed/ceed.h -> ceed.h

* iwyu - cuda/hip include fixes

show more ...


# 2b730f8b 17-Nov-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - re

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - remove stray newline

* make - simpler 'make format' target

* ci - use newer libxsmm

* doc - minor release note claification

* minor - minor fix

* minor - minor fix

* minor - minor fix

* minor - minor fix

* make format

* format - less aggressive alignment rules

* tidy - check for argument name mismatches

* fix newline

* format - mirror Ratel update to .clang-format

* fix merge error

* fix merge conflict

* fix merge error

* drop style in .phony list

* Update .clang-format

Co-authored-by: Jed Brown <jed@jedbrown.org>

* apply updated format

Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


# 9e201c85 23-Sep-2022 Yohann <dudouit1@llnl.gov>

Refactor `cuda-gen` and `hip-gen` backends. (#1050)

* Add TODO items.

* rough, but something like this?

* wip - cleaning up some warnings, but more remain

* wip - reorganize

* wip - miss

Refactor `cuda-gen` and `hip-gen` backends. (#1050)

* Add TODO items.

* rough, but something like this?

* wip - cleaning up some warnings, but more remain

* wip - reorganize

* wip - missing kernels

* wip - replace t1d

* fix some kernels

* another typo

* more

* another one

* closer

* define T_1D

* typosgit add .!

* WIP: changes to cuda-shared framework for new kernels

* fix output writing

* buffer fix

* buffer sizes

* WIP: fixes for 2 and 3D basis kernels

* minor

* fix weight kernel for 3d

* remove debugging output

* minor reorg

* fix includes

* enable collo grad for cuda-shared

* move quoted kernels

* renaming

* missed a rename

* small fix

* more naming consistency

* faster 'useCollograd=false' path in *-gen

* more style

* one last style fix

* clearer collograd condition

* Add gen basis kernels to hip-shared

* Try some changes to hip-shared basis block sizes for new kernels

* cuda - drop extra kernel arg

* cuda - fix collograd check logic

* update gen comment about parallelization

* tidy up fields struct definition

* tidy up structs even more

* Update hip-gen basis templates use and move other hip-gen device functions to jit-source

* Finish hip-gen basis template update; small style updates to match CUDA

* missing isStrided

* Update block size used in 3D weight for new shared kernels

* update release notes

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>
Co-authored-by: nbeams <246972+nbeams@users.noreply.github.com>

show more ...


# 428b7a12 06-Jun-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #977 from CEED/jeremy/fallback-yet-again

Add debugging output to fallback creation


1234