History log of /libCEED/rust/libceed-sys/c-src/backends/cuda-gen/ceed-cuda-gen.h (Results 26 – 50 of 55)
Revision Date Author Comments
# 94b7b29b 01-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

style - fix header guards


# 49aac155 24-Mar-2023 Jeremy L Thompson <jeremy@jeremylt.org>

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc heade

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc headers

* iwyu - ceed/ceed.h -> ceed.h

* iwyu - cuda/hip include fixes

show more ...


# 2b730f8b 17-Nov-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - re

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - remove stray newline

* make - simpler 'make format' target

* ci - use newer libxsmm

* doc - minor release note claification

* minor - minor fix

* minor - minor fix

* minor - minor fix

* minor - minor fix

* make format

* format - less aggressive alignment rules

* tidy - check for argument name mismatches

* fix newline

* format - mirror Ratel update to .clang-format

* fix merge error

* fix merge conflict

* fix merge error

* drop style in .phony list

* Update .clang-format

Co-authored-by: Jed Brown <jed@jedbrown.org>

* apply updated format

Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


# 9e201c85 23-Sep-2022 Yohann <dudouit1@llnl.gov>

Refactor `cuda-gen` and `hip-gen` backends. (#1050)

* Add TODO items.

* rough, but something like this?

* wip - cleaning up some warnings, but more remain

* wip - reorganize

* wip - miss

Refactor `cuda-gen` and `hip-gen` backends. (#1050)

* Add TODO items.

* rough, but something like this?

* wip - cleaning up some warnings, but more remain

* wip - reorganize

* wip - missing kernels

* wip - replace t1d

* fix some kernels

* another typo

* more

* another one

* closer

* define T_1D

* typosgit add .!

* WIP: changes to cuda-shared framework for new kernels

* fix output writing

* buffer fix

* buffer sizes

* WIP: fixes for 2 and 3D basis kernels

* minor

* fix weight kernel for 3d

* remove debugging output

* minor reorg

* fix includes

* enable collo grad for cuda-shared

* move quoted kernels

* renaming

* missed a rename

* small fix

* more naming consistency

* faster 'useCollograd=false' path in *-gen

* more style

* one last style fix

* clearer collograd condition

* Add gen basis kernels to hip-shared

* Try some changes to hip-shared basis block sizes for new kernels

* cuda - drop extra kernel arg

* cuda - fix collograd check logic

* update gen comment about parallelization

* tidy up fields struct definition

* tidy up structs even more

* Update hip-gen basis templates use and move other hip-gen device functions to jit-source

* Finish hip-gen basis template update; small style updates to match CUDA

* missing isStrided

* Update block size used in 3D weight for new shared kernels

* update release notes

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>
Co-authored-by: nbeams <246972+nbeams@users.noreply.github.com>

show more ...


# 428b7a12 06-Jun-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #977 from CEED/jeremy/fallback-yet-again

Add debugging output to fallback creation


# 6aa95790 06-Jun-2022 Jeremy L Thompson <jeremy@jeremylt.org>

pc - fix fallback for composite assembly


# ce18bed9 17-Mar-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #858 from CEED/jeremy/dump-copy-stuff

Strip redundant/outdated license info duplication


# 3d8e8822 17-Mar-2022 Jeremy L Thompson <jeremy@jeremylt.org>

minor - update copyright headers


# d92fedf5 22-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #863 from CEED/jeremy/gpu-jit-code

GPU - separate common code into separate folder


# 7fcac036 22-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - split common cuda/hip data into separate folder


# aa616312 10-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #855 from CEED/jeremy/drop-oper-e-data

Remove magic number 16


# bf4cb664 08-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

op - turn magic number 16 into constant CEED_FIELD_MAX


# ebc204c0 15-Apr-2021 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #740 from CEED/natalie/device-id

Update device ID selection for HIP/CUDA/MAGMA backends


# 6dbfb411 05-Apr-2021 nbeams <246972+nbeams@users.noreply.github.com>

Update device ID selection for HIP/CUDA backends; add for MAGMA backends


# 874019bc 31-Mar-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #716 from CEED/jed/install-backend.h

Jed/install backend.h


# ec3da8bc 26-Mar-2021 Jed Brown <jed@jedbrown.org>

Install install backend headers under include/ceed/

This makes it possible to distribute source plugins that provide
additional backends. It's also used in MFEM, perhaps temporarily.

Deprecate ceed

Install install backend headers under include/ceed/

This makes it possible to distribute source plugins that provide
additional backends. It's also used in MFEM, perhaps temporarily.

Deprecate ceed-backend.h, which was not previously installed, but some
users accessed it from an in-place build.

Also install CUDA and HIP headers that allow users to provide CUfunction
and hipFunction_t.

Co-authored-by: Jeremy L. Thompson <jeremy.thompson@colorado.edu>
Requested-by: Andrew T. Barker <barker29@llnl.gov>

show more ...


# 3d576824 29-Jan-2021 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

headers - clearify includes to not rely on transitive includes (#701)

* headers - clearify includes to not rely on transitive includes

* style - add header recommendations from 'include-what-you-

headers - clearify includes to not rely on transitive includes (#701)

* headers - clearify includes to not rely on transitive includes

* style - add header recommendations from 'include-what-you-use'

* style - apply 'include-what-you-use' changes to CUDA backends

* style - 'include-what-you-use' for hip backends

* style - drop ceed.h includes in gallery qf source

* docs - add dev notes for header files

* style - header style and alphabetize

show more ...


# 2a8ae808 17-Sep-2020 nbeams <246972+nbeams@users.noreply.github.com>

Merge branch 'main' into icl/hip-magma


# 18d499f1 17-Sep-2020 Yohann <dudouit1@llnl.gov>

Enable under-integration for cuda-shared and cuda-gen backends (#620)

* Support under integration in cuda-shared.

* Add under-integration to the cuda-gen backend.

* Fix bugs when under-integ i

Enable under-integration for cuda-shared and cuda-gen backends (#620)

* Support under integration in cuda-shared.

* Add under-integration to the cuda-gen backend.

* Fix bugs when under-integ in cuda-shared.

* Factor some code.

* Factor some code in cuda-gen.

* Guard more carefully.

* Introduce T1d.

* Fix a bug in readQuads3d

* Fix bugs in 3D.

* Fix a typo

* Safety init.

* Try something with ContractZ3d.

* Guard the add

* revert add.

* Add more thread guards

* Same as previous

* Fix a bug in add.

* style.

* Check that the bases are tensor in cuda-gen.

* move isTensor

* Add T1d to cuda-gen and guard contractions.

* Fix typos.

* add guards in 1d.

* Rewrite weight functions.

* typo

* CUDA - fix cuda-gen collocated check

* make style.

Co-authored-by: jeremylt <thompson.jeremy.luke@gmail.com>

show more ...


# 77ad9f29 15-Jul-2020 nbeams <246972+nbeams@users.noreply.github.com>

Merge branch 'main' into natalie/hip-ref-v2


# 4d36c801 24-Jun-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #568 from CEED/jeremy/cuda-destroy

Small CUDA Tidying


# 7df94212 23-Jun-2020 Jeremy L Thompson <thompson.jeremy.luke@gmail.com>

CUDA - clean up includes


# 621cd461 16-Mar-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #421 from SanderA/sanderarens/fix_ceed_cuda_subclasses

Add Ceed_Cuda struct to Ceed_Cuda_ref/shared/gen.


# d80fc06a 24-Feb-2020 jeremylt <jeremy.thompson@colorado.edu>

CUDA - use strides as template parameters for cuda/gen


# 920dcdc4 14-Feb-2020 jeremylt <jeremy.thompson@colorado.edu>

CUDA - initial impl of strided restrictions in cuda/gen


123