History log of /libCEED/backends/cuda/ceed-cuda-compile.cpp (Results 51 – 62 of 62)
Revision Date Author Comments
# 4b35598d 20-Jun-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1231 from CEED/jeremy/consistency

Consistency fixes


# eb7e6caf 16-Jun-2023 Jeremy L Thompson <jeremy@jeremylt.org>

gpu - naming consistency fixes


# c5215752 24-Apr-2023 Sebastian Grimberg <sebastiangrimb@gmail.com>

Merge pull request #1197 from sebastiangrimberg/sjg/style-whitespace-fix

Minor style consistency updates


# 6e6704a8 19-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1198 from CEED/jeremy/CeedCheck

Add CeedCheck macro to reduce repetition


# 6574a04f 18-Apr-2023 Jeremy L Thompson <jeremy@jeremylt.org>

internal - add CeedCheck macro to reduce repetition


# 2a86cc9d 04-Mar-2023 Sebastian Grimberg <sjg@amazon.com>

Fix file endings inconsistency


# 49aac155 24-Mar-2023 Jeremy L Thompson <jeremy@jeremylt.org>

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc heade

IWYU fixes (#1182)

* iwyu - include fixes

* iwyu - silence some iwyu output

* minor - clearer macro names

* iwyu - fix suggestion of "ceed/ceed.h" externally

* iwyu - lighter petsc headers

* iwyu - ceed/ceed.h -> ceed.h

* iwyu - cuda/hip include fixes

show more ...


# 023b8a51 25-Jan-2023 abdelfattah83 <36712794+abdelfattah83@users.noreply.github.com>

magma: non-tensor rtc (#1141)

* some refactoring in magma's jit src

* fix path

* fix loading src

* refactor magma nontensor backend

* refactor magma nontensor backend

* [WIP]: new non

magma: non-tensor rtc (#1141)

* some refactoring in magma's jit src

* fix path

* fix loading src

* refactor magma nontensor backend

* refactor magma nontensor backend

* [WIP]: new nontensor basis kernels

* [WIP]: new nontensor basis kernels

* [WIP]: new nontensor basis kernels

* call the new nontensor kernels for low order problems

* multiple compilation for the same kernels but with different tuning parmaters

* magma: allow different nb's for different non-tensor kernels

* tuning data for the non-tensor rtc kernels

* remove no-longer used functions, add new one for tuning the nontensor kernels

* constants for tuning

* tuning functions

* use the tuning functions in compiling/running the new kernels

* bug fix

* fixes

* fixes

* minor

* switch tuning data

* fix name

* fix name

* add function to run cuda kernels with opt-in shared memory feature

* minor fix

* minor fix

* fix calls to batch api

* allow more kernel instances

* temporary timing function

* temporary timing function

* tuning data based on hiprtc

* rollback tuning parameters

* fixes

* fixes

* fix inconsistency in the parameters passed to nvrtc/hiprtc

* minor

* a fix to the nb selector

* cleanup

* merge the opt-in feature in CeedRunKernelDimSharedOptinCuda into CeedRunKernelDimSharedCuda

* fix paths for hip-magma backends

* style

* fixes

* running make format

* undo changes from the last commit

* change HIP_DIR to ROCM_DIR and adjust the paths for magma accordingly

* replace HIP_DIR with ROCM_DIR

show more ...


# 78a97f55 30-Nov-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1104 from CEED/jeremy/formatting

Minor Formatting Fixes


# ea61e9ac 30-Nov-2022 Jeremy L Thompson <jeremy@jeremylt.org>

minor - assorted formatting fixes


# 2b730f8b 17-Nov-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - re

Switch to clang-format (#1051)

* style - switch to clang-format

* ci - use newer libxsmm

* action - update format action

* format - consistent use of {} for multi-line if/for

* make - remove stray newline

* make - simpler 'make format' target

* ci - use newer libxsmm

* doc - minor release note claification

* minor - minor fix

* minor - minor fix

* minor - minor fix

* minor - minor fix

* make format

* format - less aggressive alignment rules

* tidy - check for argument name mismatches

* fix newline

* format - mirror Ratel update to .clang-format

* fix merge error

* fix merge conflict

* fix merge error

* drop style in .phony list

* Update .clang-format

Co-authored-by: Jed Brown <jed@jedbrown.org>

* apply updated format

Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


# c9c2c079 05-Aug-2022 Jeremy L Thompson <jeremy@jeremylt.org>

QF headers for typedefs and macros (#1036)

* jit - qf headers for typedefs and macros

* jit - smaller list of permitted files

* ceed - only include ceed.h in QF source


123