History log of /petsc/src/vec/is/sf/impls/basic/kokkos/sfkok.kokkos.cxx (Results 1 – 25 of 58)
Revision Date Author Comments
# 97fff7b2 07-Mar-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/2025-03-05/revise-PetscGetKokkosExecutionSpace' into 'main'

Return execution space instead of reference to simplify the code

See merge request petsc/petsc!8182


# 4df4a32c 07-Mar-2025 Junchao Zhang <jczhang@mcs.anl.gov>

Kokkos: return execution space instead of reference to simplify the code

A Kokkos execution space is a shared pointer. We don't need to reference it


# b7b2c57c 05-Feb-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/2025-01-30/feature-support-AMD-MI300A' into 'main'

Add support of AMD MI300A

Closes #1703

See merge request petsc/petsc!8110


# 45402d8a 30-Jan-2025 Junchao Zhang <jczhang@anl.gov>

Kokkos: add support of AMD MI300A

* Use HostMirrorMemorySpace instead of HostSpace to fix compile errors on MI300A

* Replace Kokkos::HostSpace with HostMirrorMemorySpace to fix compile errors on MI

Kokkos: add support of AMD MI300A

* Use HostMirrorMemorySpace instead of HostSpace to fix compile errors on MI300A

* Replace Kokkos::HostSpace with HostMirrorMemorySpace to fix compile errors on MI300A, since the latter is what Kokkos::DualView use for its host view

* Fix a subtle bug in KokkosDualViewSync() w.r.t MI300A. Suppose we want to sync a petsc VecKokkos v on host. On MI300A, the host copy v_h and the device copy v_d share the memory. So in the old code, we used if (v_dual.need_sync_host()) to skip the device to host memory copy. But I should not skip the exec.fence(). As the device might still have kernels writing v_d, we still need to sync the device/stream to make v_d ready for use on CPU (via v_h).

show more ...


# eb54662a 26-Nov-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/2024-11-17/update-kokkos-4.5' into 'main'

Update to Kokkos to 4.5.00

See merge request petsc/petsc!8013


# ffc29c3a 26-Nov-2024 Junchao Zhang <jczhang@mcs.anl.gov>

Update to Kokkos to 4.5.00


# bfd69959 22-Sep-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/2024-09-09/fix-sf-mpi-large-count-datatype' into 'main'

PetscSF: Support MPI derived data types created with large count

Closes #1625

See merge request petsc/petsc!7825


# d279a5e3 22-Sep-2024 Junchao Zhang <jczhang@mcs.anl.gov>

PetscSF: support MPI derived data types created with large count

When user created MPI derived data types with large count routines, e.g., MPI_Type_contiguous_c(4, MPI_DOUBLE, &newtype), and passed

PetscSF: support MPI derived data types created with large count

When user created MPI derived data types with large count routines, e.g., MPI_Type_contiguous_c(4, MPI_DOUBLE, &newtype), and passed newtype to PetscSF,
is it legal to decode the datatype with MPI_Type_get_envelope()? The MPI Forum debated but has yet to resolve this issue.

Our approach is to decode the datatype with MPI_Type_get_envelope_c() if available; otherwise with MPI_Type_get_envelope().

But currently we don't really support real big count values, say a type created by MPI_Type_contiguous_c(2^33, MPI_DOUBLE, &newtype).

We could support that, but just see no needs for now. We will error out if this kind of types are given to PetscSF.

show more ...


# 9ddca861 09-Sep-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'tisaac/bugfix-sfwindow' into 'main'

Add PetscSFRegisterPersistent() and fix PETSCSFWINDOW for empty ranks

See merge request petsc/petsc!7786


# e1187f0d 03-Sep-2024 Toby Isaac <toby.isaac@gmail.com>

PetscSF: add PetscSFGetDatatypeSize_Internal() for internal use


# 970231d2 07-Mar-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/clang-format-18' into 'main'

clang-format version 18

See merge request petsc/petsc!6902


# 4ad8454b 09-Feb-2024 Pierre Jolivet <pierre@joliv.et>

Remove some unneeded parentheses


# 4d86920d 10-Feb-2024 Pierre Jolivet <pierre@joliv.et>

checkbadSource: rules for PetscFunctionBegin and derivatives


# 48c57749 15-Aug-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/2023-08-14/make-device2host-memcopy-async-safe' into 'main'

Make DualView sync_host() asynchronous by adding an execution space argument

See merge request petsc/petsc!6811


# e36ced11 14-Aug-2023 Junchao Zhang <jczhang@mcs.anl.gov>

Kokkos: make dual view sync_host() asynchronous by adding an execution space argument

sync_host() without the argument is synchronous (like cudaMemcpy D2H). With the argument,
it is asynchronous (l

Kokkos: make dual view sync_host() asynchronous by adding an execution space argument

sync_host() without the argument is synchronous (like cudaMemcpy D2H). With the argument,
it is asynchronous (like cudaMemcpyAsync D2H).

To make sure the host array contains valide data immediately after dualview.sync_host(),
we need to do exec.fence();

show more ...


# 89480967 14-Apr-2023 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/release'


# 378155d4 14-Apr-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/2023-04-11/fix-sfkok-stream/release' into 'release'

Let SF Kokkos backend use an execution space made from PetscDefaultCuda/HipStream

See merge request petsc/petsc!6317


# 524fe776 14-Apr-2023 Junchao Zhang <jczhang@mcs.anl.gov>

Let SF Kokkos backend use an execution space made from PetscDefaultCuda/HipStream


# 31d78bcd 02-Feb-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2022-12-10/petscerrorcode-nodiscard' into 'main'

Feature: Non-discardable PetscErrorCode

See merge request petsc/petsc!5923


# 3ba16761 10-Dec-2022 Jacob Faibussowitsch <jacob.fai@gmail.com>

Make PetscErrorCode a non-discardable enum


# 061e922f 22-Sep-2022 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2022-09-21/2-bike-2-shed' into 'main'

Feature: Bicycle Storage Facility 2

See merge request petsc/petsc!5661


# d71ae5a4 21-Sep-2022 Jacob Faibussowitsch <jacob.fai@gmail.com>

source code format changes due to .clang-format changes


# 58d68138 23-Aug-2022 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2022-08-21/clang-format-source' into 'main'

format repository with clang-format

See merge request petsc/petsc!5541


# 9371c9d4 22-Aug-2022 Satish Balay <balay@mcs.anl.gov>

clang-format: convert PETSc sources to comply with clang-format


# b33f4bec 05-Apr-2022 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/feature-less-checkfalse' into 'main'

Dividing by four the number of PetscCheckFalse()

See merge request petsc/petsc!5072


123