| #
97fff7b2
|
| 07-Mar-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2025-03-05/revise-PetscGetKokkosExecutionSpace' into 'main'
Return execution space instead of reference to simplify the code
See merge request petsc/petsc!8182
|
| #
4df4a32c
|
| 07-Mar-2025 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Kokkos: return execution space instead of reference to simplify the code
A Kokkos execution space is a shared pointer. We don't need to reference it
|
| #
b7b2c57c
|
| 05-Feb-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2025-01-30/feature-support-AMD-MI300A' into 'main'
Add support of AMD MI300A
Closes #1703
See merge request petsc/petsc!8110
|
| #
45402d8a
|
| 30-Jan-2025 |
Junchao Zhang <jczhang@anl.gov> |
Kokkos: add support of AMD MI300A
* Use HostMirrorMemorySpace instead of HostSpace to fix compile errors on MI300A
* Replace Kokkos::HostSpace with HostMirrorMemorySpace to fix compile errors on MI
Kokkos: add support of AMD MI300A
* Use HostMirrorMemorySpace instead of HostSpace to fix compile errors on MI300A
* Replace Kokkos::HostSpace with HostMirrorMemorySpace to fix compile errors on MI300A, since the latter is what Kokkos::DualView use for its host view
* Fix a subtle bug in KokkosDualViewSync() w.r.t MI300A. Suppose we want to sync a petsc VecKokkos v on host. On MI300A, the host copy v_h and the device copy v_d share the memory. So in the old code, we used if (v_dual.need_sync_host()) to skip the device to host memory copy. But I should not skip the exec.fence(). As the device might still have kernels writing v_d, we still need to sync the device/stream to make v_d ready for use on CPU (via v_h).
show more ...
|
| #
eb54662a
|
| 26-Nov-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2024-11-17/update-kokkos-4.5' into 'main'
Update to Kokkos to 4.5.00
See merge request petsc/petsc!8013
|
| #
ffc29c3a
|
| 26-Nov-2024 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Update to Kokkos to 4.5.00
|
| #
bfd69959
|
| 22-Sep-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2024-09-09/fix-sf-mpi-large-count-datatype' into 'main'
PetscSF: Support MPI derived data types created with large count
Closes #1625
See merge request petsc/petsc!7825
|
| #
d279a5e3
|
| 22-Sep-2024 |
Junchao Zhang <jczhang@mcs.anl.gov> |
PetscSF: support MPI derived data types created with large count
When user created MPI derived data types with large count routines, e.g., MPI_Type_contiguous_c(4, MPI_DOUBLE, &newtype), and passed
PetscSF: support MPI derived data types created with large count
When user created MPI derived data types with large count routines, e.g., MPI_Type_contiguous_c(4, MPI_DOUBLE, &newtype), and passed newtype to PetscSF, is it legal to decode the datatype with MPI_Type_get_envelope()? The MPI Forum debated but has yet to resolve this issue.
Our approach is to decode the datatype with MPI_Type_get_envelope_c() if available; otherwise with MPI_Type_get_envelope().
But currently we don't really support real big count values, say a type created by MPI_Type_contiguous_c(2^33, MPI_DOUBLE, &newtype).
We could support that, but just see no needs for now. We will error out if this kind of types are given to PetscSF.
show more ...
|
| #
9ddca861
|
| 09-Sep-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'tisaac/bugfix-sfwindow' into 'main'
Add PetscSFRegisterPersistent() and fix PETSCSFWINDOW for empty ranks
See merge request petsc/petsc!7786
|
| #
e1187f0d
|
| 03-Sep-2024 |
Toby Isaac <toby.isaac@gmail.com> |
PetscSF: add PetscSFGetDatatypeSize_Internal() for internal use
|
| #
970231d2
|
| 07-Mar-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/clang-format-18' into 'main'
clang-format version 18
See merge request petsc/petsc!6902
|
| #
4ad8454b
|
| 09-Feb-2024 |
Pierre Jolivet <pierre@joliv.et> |
Remove some unneeded parentheses
|
| #
4d86920d
|
| 10-Feb-2024 |
Pierre Jolivet <pierre@joliv.et> |
checkbadSource: rules for PetscFunctionBegin and derivatives
|
| #
48c57749
|
| 15-Aug-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2023-08-14/make-device2host-memcopy-async-safe' into 'main'
Make DualView sync_host() asynchronous by adding an execution space argument
See merge request petsc/petsc!6811
|
| #
e36ced11
|
| 14-Aug-2023 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Kokkos: make dual view sync_host() asynchronous by adding an execution space argument
sync_host() without the argument is synchronous (like cudaMemcpy D2H). With the argument, it is asynchronous (l
Kokkos: make dual view sync_host() asynchronous by adding an execution space argument
sync_host() without the argument is synchronous (like cudaMemcpy D2H). With the argument, it is asynchronous (like cudaMemcpyAsync D2H).
To make sure the host array contains valide data immediately after dualview.sync_host(), we need to do exec.fence();
show more ...
|
| #
89480967
|
| 14-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
378155d4
|
| 14-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2023-04-11/fix-sfkok-stream/release' into 'release'
Let SF Kokkos backend use an execution space made from PetscDefaultCuda/HipStream
See merge request petsc/petsc!6317
|
| #
524fe776
|
| 14-Apr-2023 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Let SF Kokkos backend use an execution space made from PetscDefaultCuda/HipStream
|
| #
31d78bcd
|
| 02-Feb-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2022-12-10/petscerrorcode-nodiscard' into 'main'
Feature: Non-discardable PetscErrorCode
See merge request petsc/petsc!5923
|
| #
3ba16761
|
| 10-Dec-2022 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Make PetscErrorCode a non-discardable enum
|
| #
061e922f
|
| 22-Sep-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2022-09-21/2-bike-2-shed' into 'main'
Feature: Bicycle Storage Facility 2
See merge request petsc/petsc!5661
|
| #
d71ae5a4
|
| 21-Sep-2022 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
source code format changes due to .clang-format changes
|
| #
58d68138
|
| 23-Aug-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2022-08-21/clang-format-source' into 'main'
format repository with clang-format
See merge request petsc/petsc!5541
|
| #
9371c9d4
|
| 22-Aug-2022 |
Satish Balay <balay@mcs.anl.gov> |
clang-format: convert PETSc sources to comply with clang-format
|
| #
b33f4bec
|
| 05-Apr-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/feature-less-checkfalse' into 'main'
Dividing by four the number of PetscCheckFalse()
See merge request petsc/petsc!5072
|