History log of /libCEED/backends/cuda/ceed-cuda-compile.cpp (Results 26 – 50 of 62)
Revision Date Author Comments
# 8d12f40e 07-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

hip - gen fallback to shared if error


# ddae5012 07-Feb-2025 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - gen fallback to shared if error


# de84fe53 25-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1698 from CEED/jeremy/debug-cleanup

Truncate jit output


# 26ef7cda 22-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

debug - truncate jit output


# e036be40 22-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1699 from CEED/jeremy/set-jit-defines

Add CeedAddJitDefine


# 4753b775 21-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

jit - add CeedAddJitDefine


# 1dc8b1e6 21-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1696 from CEED/jeremy/jit-include

JiT include update


# 91adc9c8 16-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

cuda/hip - include *-jit.h via #include


# a491a57e 16-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

jit - redirect ceed.h to ceed/types.h over ceed/ceed.h for jit


# b13efd58 16-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

jit - I include JiT source dirs set


# daaf13a4 16-Oct-2024 Jeremy L Thompson <jeremy@jeremylt.org>

jit - allow <> includes other than math, std*


# 9e9230d4 23-May-2024 Jed Brown <jed@jedbrown.org>

Merge pull request #1588 from CEED/jed/nvrtc-cubin

backends/cuda: NVRTC compile to CUBIN when supported (resolve #1587)


# 29ec485e 22-May-2024 Jed Brown <jed@jedbrown.org>

backends/cuda: NVRTC compile to CUBIN when supported (resolve #1587)

This allows using a newer CUDA runtime with an older driver, and seems
to have no downsides.

NVRTC can generate cubins directl

backends/cuda: NVRTC compile to CUBIN when supported (resolve #1587)

This allows using a newer CUDA runtime with an older driver, and seems
to have no downsides.

NVRTC can generate cubins directly starting with CUDA 11.1. [...]
NVRTC used to support only virtual architectures through the option
-arch, since it was only emitting PTX. It will now support actual
architectures as well to emit SASS. The interface is augmented to
retrieve either the PTX or cubin if an actual architecture is
specified.

https://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html#dynamic-code-generation

show more ...


# a171b6ef 27-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1537 from CEED/jeremy/pragma-once

Use #pragma once for non-JiT headers


# 5aed82e4 27-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

minor - update copyright headers


# b81bcd5a 06-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1504 from CEED/jeremy/debug-on-error

Write source to debug when CUDA/HIP fails to compile


# f5a0ec79 06-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

fix - need C string for debug printing


# edaedbd9 06-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

fix - print actual source code, not the defs only


# a4bfdec2 06-Mar-2024 Jeremy L Thompson <jeremy@jeremylt.org>

jit - write debug info when CUDA/HIP fails to compile


# 1f70653f 23-Feb-2024 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1488 from CEED/jeremy/const-path

Const for CeedQFunctionGetSourcePath


# 22070f95 23-Feb-2024 Jeremy L Thompson <jeremy@jeremylt.org>

internal - more updates for const


# 21f16bf6 13-Sep-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1315 from CEED/jeremy/queep

Backend Consistency Fixes


# ca735530 31-Aug-2023 Jeremy L Thompson <jeremy@jeremylt.org>

style - fixes for CUDA backends


# 3c708d80 24-Aug-2023 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #1299 from CEED/sjg/cuda-hip-backends-consistency

Style fixes for `cuda` and `hip` backend consistency


# b2165e7a 11-Aug-2023 Sebastian Grimberg <sjg@amazon.com>

Whitespace, style, and formatting updates for consistency between CUDA and HIP backends

Adds include guards in JiT header files, even if not strictly necessary, to match the precedent set in cuda-sh

Whitespace, style, and formatting updates for consistency between CUDA and HIP backends

Adds include guards in JiT header files, even if not strictly necessary, to match the precedent set in cuda-shared and hip-shared as well as sycl.

show more ...


123