| #
901f9382
|
| 02-Jan-2026 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'release'
|
| #
55e133c9
|
| 22-Dec-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/minor-docs' into 'release'
Minor documentation fixes
See merge request petsc/petsc!8908
|
| #
758d4732
|
| 22-Dec-2025 |
Pierre Jolivet <pierre@joliv.et> |
Minor documentation fixes
|
| #
cd871708
|
| 16-Sep-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'cuda' into 'main'
Fix CUDA 13 API incompatibilities
See merge request petsc/petsc!8632
|
| #
b547d71c
|
| 05-Aug-2025 |
Nuno Nobre <nuno.nobre@stfc.ac.uk> |
Fix CUDA 13 API incompatibilities
Co-authored-by: Satish Balay <balay@mcs.anl.gov>
|
| #
58bddbc0
|
| 11-Apr-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'tisaac/feature-lmvm-rework' into 'main'
MatLMVM rework
See merge request petsc/petsc!6401
|
| #
d016bdde
|
| 26-Mar-2025 |
Toby Isaac <toby.isaac@gmail.com> |
Mat: Fix and improve the performance of dense matrix multiplication
Mat: Add MATDENSEFROMVECTYPE constructor type
Now in a tests set you can do
``` testset: args: -mat_type densefromvectype
Mat: Fix and improve the performance of dense matrix multiplication
Mat: Add MATDENSEFROMVECTYPE constructor type
Now in a tests set you can do
``` testset: args: -mat_type densefromvectype test: test_cuda requires: cuda args: -vec_type veccuda test: test_hip requires: hip args: -vec_type vechip ```
(This assumes that you call `MatSetVecType()` before you call `MatSetFromOptions()`)
Mat_MPIDense: Cache offsets of MatDenseGetSubMatrix() to avoid communication in more cases
Mat: Add missing implementations for internal "MatMultColumnRange()" interface
Mat_MPIDense: Fix the zeroing of buffers in multiplication routines
Mat_MPIDense: Add optimization of MatMatMult routines when all columns are owned by rank 0
The communication for intermediate buffers can be handled with allreduce / bcast operations, but we use the PetscSF matvec context instead of MPI routines directly so that we will use gpu-aware MPI if possible.
show more ...
|
| #
3853def2
|
| 26-Mar-2025 |
Toby Isaac <toby.isaac@gmail.com> |
CUPM: Fix some mult routines and make some small performance improvements
VecSeq_CUPM: Fix ::Dot, ::TDot, ::WAXPYAsync, and ::AXPBYAsync to work with non-device vectors
Mat_SeqDenseCUPM: fix ::SetR
CUPM: Fix some mult routines and make some small performance improvements
VecSeq_CUPM: Fix ::Dot, ::TDot, ::WAXPYAsync, and ::AXPBYAsync to work with non-device vectors
Mat_SeqDenseCUPM: fix ::SetRandom for hip
Mat_SeqDenseCUPM: implement conjugate on the device
Mat_SeqDenseCUPM: Fix MatMult() (and friends) when the vector is not on the device
This implementation assumes that the cost of moving the matrix to the host is more expensive that moving the vector to the device, so temporary device copies of the host vectors are used.
VecCUPM: Avoid device synchronization in some cases of ResetArray()
The documentation says that the PetscDeviceSynchronization() is only needed if CopyToDevice_() resulted in a HtoD memcpy, which would only happen if v->offloadmask == PETSC_OFFLOAD_CPU. If we test this condition, we can avoid the synchronization.
This improves the peformance of MatDenseRestoreColumnVecWrite() in performance critical loops.
CUPM: use thrust::hip::par_nosync
show more ...
|
| #
d0e6bf2a
|
| 25-Sep-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/clang-format-19' into 'main'
clang-format-19
See merge request petsc/petsc!7644
|
| #
57508ece
|
| 17-Sep-2024 |
Pierre Jolivet <pierre@joliv.et> |
Remove unneeded parentheses
|
| #
bb59972c
|
| 05-May-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jed/fix-cupm-visibility' into 'main'
cupm: fix visibility to build without warnings
See merge request petsc/petsc!7526
|
| #
85f25e71
|
| 04-May-2024 |
Jed Brown <jed@jedbrown.org> |
cupm: fix visibility to build without warnings
Thanks-to: Lawrence Mitchell
|
| #
8c77480b
|
| 04-Apr-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
74577c92
|
| 02-Apr-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jose/release/cupm-int-mult' into 'release'
CUPM: do not forbid allocations larger than PetscInt size
See merge request petsc/petsc!7427
|
| #
16130775
|
| 02-Apr-2024 |
Jose E. Roman <jroman@dsic.upv.es> |
CUPM: do not forbid allocations larger than PetscInt size
This fixes an error triggered in SLEPc when creating a BV with large sizes, i.e., a dense Mat where the product of rows and columns overflow
CUPM: do not forbid allocations larger than PetscInt size
This fixes an error triggered in SLEPc when creating a BV with large sizes, i.e., a dense Mat where the product of rows and columns overflows PetscInt, for instance 134217728 x 17
Reported-by: DenverCoder9
show more ...
|
| #
5fe01c21
|
| 22-Mar-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'hsuh/dense-lmvm-squashed' into 'main'
Dense LMVM
See merge request petsc/petsc!7371
|
| #
0be0d8bd
|
| 14-Mar-2024 |
Hansol Suh <hsuh@anl.gov> |
MatMult-ColumnRanges support added
|
| #
3c4b05c5
|
| 10-Jan-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'blanca/mat-mult-hermitian-transpose' into 'main'
Support for MatMultHermitianTranspose with MATDENSE, MATNEST and MATSCALAPACK
See merge request petsc/petsc!7171
|
| #
95571869
|
| 05-Jan-2024 |
Blanca Mellado Pinto <bmelpin@dsic.upv.es> |
Added MatMultHermitianTranspose and MatMultHermitianTransposeAdd for MATDENSECUPM
|
| #
9dd11ecf
|
| 25-Aug-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-08-17/header-guard-check' into 'main'
Check header guards
See merge request petsc/petsc!6822
|
| #
a4963045
|
| 18-Aug-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Convert all header guards to pragma once
|
| #
eef0bda4
|
| 30-Jun-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-06-13/mat-vec-more-cupm-functions' into 'main'
Add Some Missing Vec/Mat CUPM Functions
Closes #1399
See merge request petsc/petsc!6596
|
| #
04d0cef7
|
| 27-Jun-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Collapse MatDense_CUPM::Shift() and MatDense_CUPM::GetDiagonal() down to base class
|
| #
14277c92
|
| 14-Jun-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Add MatGetDiagonal_CUPM
|
| #
06e36fcd
|
| 01-Jun-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-05-30/misc-cxx-fixes-2' into 'main'
Misc C++-Related Fixes For PetscDevice And Friends Round 2
See merge request petsc/petsc!6526
|