| #
cd871708
|
| 16-Sep-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'cuda' into 'main'
Fix CUDA 13 API incompatibilities
See merge request petsc/petsc!8632
|
| #
2695cf96
|
| 11-Sep-2025 |
Nuno Nobre <nuno.nobre@stfc.ac.uk> |
Gather CUDA lib includes in petscdevice_cuda.h + disable cuSPARSE warnings using DISABLE_CUSPARSE_DEPRECATED
Defining DISABLE_CUSPARSE_DEPRECATED is Nvidia's recommended way of disabling cuSPARSE wa
Gather CUDA lib includes in petscdevice_cuda.h + disable cuSPARSE warnings using DISABLE_CUSPARSE_DEPRECATED
Defining DISABLE_CUSPARSE_DEPRECATED is Nvidia's recommended way of disabling cuSPARSE warnings This avoids requiring -Wdeprecated-declarations - that can mask other (non-cuSPARSE) warnings
show more ...
|
| #
d7f32bc4
|
| 28-Mar-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2025-03-27/add-cusparsespsv-updatematrix' into 'main'
aijcusparse: leverage cusparseSpSV_updateMatrix in cuda-12.1.1
See merge request petsc/petsc!8249
|
| #
204a0e31
|
| 28-Mar-2025 |
Junchao Zhang <jczhang@anl.gov> |
aijcusparse: leverage cusparseSpSV_updateMatrix in cuda-12.1.1
When only matrix values have changed, calling cusparseSpSV_updateMatrix() is faster than calling cusparseSpSV_analysis() again.
|
| #
afb41d4c
|
| 28-Mar-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2025-03-18/revise-aijkokkos-matsolve' into 'main'
Add options to do factorization and solve on host for matseqaijkokkos
See merge request petsc/petsc!8209
|
| #
f82ac72c
|
| 18-Mar-2025 |
Junchao Zhang <jczhang@anl.gov> |
Mat: change -mat_factor_bind_factorization <host | device> to -pc_factor_mat_factor_on_host <bool>
Also add an option -pc_factor_mat_solve_on_host
|
| #
76d69608
|
| 19-Jan-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
1d017dde
|
| 19-Jan-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2024-11-26/improve-man-pages/release' into 'release'
Improve some manual pages in KSP/SNES
See merge request petsc/petsc!8071
|
| #
0b4b7b1c
|
| 26-Nov-2024 |
Barry Smith <bsmith@mcs.anl.gov> |
Improve some manual pages in KSP/SNES
|
| #
11c8a98f
|
| 28-Aug-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'release-revert-7765'
|
| #
8433bb00
|
| 28-Aug-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'balay/ci-cuda-126' into 'release'
CI: update test to use cuda-12.6
See merge request petsc/petsc!7765
|
| #
fe5544b9
|
| 26-Aug-2024 |
Junchao Zhang <jczhang@anl.gov> |
cupsarse: add workarounds for cusparse bugs
See known issues at https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#cusparse-release-12-6
|
| #
3c6c2bd6
|
| 21-Aug-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/x_remove-cuda-warnings-ci' into 'main'
Remove -Wdeprecated-declarations warnings
See merge request petsc/petsc!7771
|
| #
b0c00012
|
| 17-Aug-2024 |
Pierre Jolivet <pierre@joliv.et> |
Remove -Wdeprecated-declarations warnings
|
| #
9dd11ecf
|
| 25-Aug-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-08-17/header-guard-check' into 'main'
Check header guards
See merge request petsc/petsc!6822
|
| #
a4963045
|
| 18-Aug-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Convert all header guards to pragma once
|
| #
cd8ad754
|
| 06-Jun-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-06-01/improve-cupm-compile-time' into 'main'
CUPM Compile-time Optimizations
See merge request petsc/petsc!6535
|
| #
47f8145d
|
| 01-Jun-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Use only forward declarations for VecSeq_CUPM and VecMPI_CUPM. This elides needing to transitively include all of thrust and improves compile time of vecmpicupm, aij[cu|hip]sparse, and any others tha
Use only forward declarations for VecSeq_CUPM and VecMPI_CUPM. This elides needing to transitively include all of thrust and improves compile time of vecmpicupm, aij[cu|hip]sparse, and any others that include vecseqcupm.hpp.
show more ...
|
| #
ead65ede
|
| 22-May-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/feature-matdup-coo' into 'main'
Duplicate COO info in MatDuplicate(), but use reference counting to save memory
See merge request petsc/petsc!6459
|
| #
2c4ab24a
|
| 11-May-2023 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Mat: duplicate COO info in MatDuplicate, but do reference counting to save memory
Thanks to Maxime Bouyges <maxime.bouyges@gmail.com> for feature requests.
|
| #
592ea0f6
|
| 02-May-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/mat-remove-matsetvaluesdevice' into 'main'
remove MatSetValuesDevice
See merge request petsc/petsc!6378
|
| #
4c55d725
|
| 02-May-2023 |
Mark Adams <524115-markadams4@users.noreply.gitlab.com> |
remove MatSetValuesDevice
|
| #
bb27329f
|
| 28-Mar-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/use-cuda11.4-cutoff' into 'main'
Increase the cutoff cuda version number from 11.3 to 11.4 for some new CUDA APIs to avoid bugs
See merge request petsc/petsc!6246
|
| #
b917901d
|
| 27-Mar-2023 |
Junchao Zhang <jczhang@mcs.anl.gov> |
mataijcusparse: increase the cut off cuda version number to avoid a bug for unknwon reasons.
Even some cusparse APIs were introduced in cuda-11.3.0, we now use them only after cuda-11.4.0, which was
mataijcusparse: increase the cut off cuda version number to avoid a bug for unknwon reasons.
Even some cusparse APIs were introduced in cuda-11.3.0, we now use them only after cuda-11.4.0, which was two months younger than 11.3.0.
We met an error (see below) with cuda-11.3.x, but not with cuda versions lower or higher than 11.3. This might be a petsc bug. Using ifdefs, aijcusparse code takes different paths, making the issue difficult to investigate. So we just increase the cuda version to get away.
------- snes_tests-ex13_cuda
mpirun -n 4 ./ex13 -petsc_ci -dm_plex_dim 2 -benchmark_it 10 -dm_plex_box_faces 4,4 -dm_refine 2 -petscpartitioner_simple_process_grid 2,2 -petscpartitioner_simple_node_grid 1,1 -potential_petscspace_degree 2 -petscpartitioner_type simple -dm_plex_simplex 0 -snes_type ksponly -dm_view -ksp_type cg -pc_type gamg -pc_gamg_process_eq_limit 400 -ksp_norm_type unpreconditioned -ksp_converged_reason -dm_mat_type aijcusparse -dm_vec_type cuda
[3]PETSC ERROR: --------------------- Error Message -------------------------------------------------------------- [3]PETSC ERROR: GPU error [3]PETSC ERROR: cuSPARSE errorcode 7 (CUSPARSE_STATUS_INTERNAL_ERROR) : internal error [3]PETSC ERROR: See https://petsc.org/release/faq/ for trouble shooting. [3]PETSC ERROR: Petsc Development GIT revision: v3.18.5-1152-g648263c9 GIT Date: 2023-03-26 20:01:43 +0000 [3]PETSC ERROR: ./ex13 on a arch-kokkos-dbg named hong-gce-workstation by jczhang Mon Mar 27 15:55:44 2023 [3]PETSC ERROR: Configure options --PETSC_ARCH=arch-kokkos-dbg --with-debugging=1 --with-cc=mpicc --with-cxx=mpicxx --with-fc=0 --COPTFLAGS="-g -O0" --FOPTFLAGS="-g -O0" --CXXOPTFLAGS="-g -O0" --with-cuda --with-cudac=nvcc --with-strict-petscerrorcode [3]PETSC ERROR: #1 MatProductNumeric_SeqAIJCUSPARSE_SeqAIJCUSPARSE() at /scratch/jczhang/petsc/src/mat/impls/aij/seq/seqcusparse/aijcusparse.cu:2968 [3]PETSC ERROR: #2 MatProductNumeric_ABC_Basic() at /scratch/jczhang/petsc/src/mat/interface/matproduct.c:1129 [3]PETSC ERROR: #3 MatProductNumeric_MPIAIJBACKEND() at /scratch/jczhang/petsc/src/mat/impls/aij/mpi/mpiaij.c:7024 [3]PETSC ERROR: #4 MatProductNumeric() at /scratch/jczhang/petsc/src/mat/interface/matproduct.c:685 [3]PETSC ERROR: #5 MatPtAP() at /scratch/jczhang/petsc/src/mat/interface/matrix.c:9907 [3]PETSC ERROR: #6 PCSetUp_GAMG() at /scratch/jczhang/petsc/src/ksp/pc/impls/gamg/gamg.c:558 [3]PETSC ERROR: #7 PCSetUp() at /scratch/jczhang/petsc/src/ksp/pc/interface/precon.c:994 [3]PETSC ERROR: #8 KSPSetUp() at /scratch/jczhang/petsc/src/ksp/ksp/interface/itfunc.c:405 [3]PETSC ERROR: #9 KSPSolve_Private() at /scratch/jczhang/petsc/src/ksp/ksp/interface/itfunc.c:823 [3]PETSC ERROR: #10 KSPSolve() at /scratch/jczhang/petsc/src/ksp/ksp/interface/itfunc.c:1069 [3]PETSC ERROR: #11 SNESSolve_KSPONLY() at /scratch/jczhang/petsc/src/snes/impls/ksponly/ksponly.c:48 [3]PETSC ERROR: #12 SNESSolve() at /scratch/jczhang/petsc/src/snes/interface/snes.c:4666 [3]PETSC ERROR: #13 main() at ex13.c:193
show more ...
|
| #
0623d960
|
| 26-Mar-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/support-cuda12' into 'main'
Update matsolve related routines of aijcusparse to support CUDA-12.
See merge request petsc/petsc!6225
|