History log of /petsc/src/mat/impls/dense/seq/cupm/matseqdensecupm.hpp (Results 1 – 25 of 39)
Revision Date Author Comments
# 901f9382 02-Jan-2026 Satish Balay <balay@mcs.anl.gov>

Merge branch 'release'


# 55e133c9 22-Dec-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/minor-docs' into 'release'

Minor documentation fixes

See merge request petsc/petsc!8908


# 758d4732 22-Dec-2025 Pierre Jolivet <pierre@joliv.et>

Minor documentation fixes


# cd871708 16-Sep-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'cuda' into 'main'

Fix CUDA 13 API incompatibilities

See merge request petsc/petsc!8632


# b547d71c 05-Aug-2025 Nuno Nobre <nuno.nobre@stfc.ac.uk>

Fix CUDA 13 API incompatibilities

Co-authored-by: Satish Balay <balay@mcs.anl.gov>


# 58bddbc0 11-Apr-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'tisaac/feature-lmvm-rework' into 'main'

MatLMVM rework

See merge request petsc/petsc!6401


# d016bdde 26-Mar-2025 Toby Isaac <toby.isaac@gmail.com>

Mat: Fix and improve the performance of dense matrix multiplication

Mat: Add MATDENSEFROMVECTYPE constructor type

Now in a tests set you can do

```
testset:
args: -mat_type densefromvectype

Mat: Fix and improve the performance of dense matrix multiplication

Mat: Add MATDENSEFROMVECTYPE constructor type

Now in a tests set you can do

```
testset:
args: -mat_type densefromvectype
test: test_cuda
requires: cuda
args: -vec_type veccuda
test: test_hip
requires: hip
args: -vec_type vechip
```

(This assumes that you call `MatSetVecType()` before you call
`MatSetFromOptions()`)

Mat_MPIDense: Cache offsets of MatDenseGetSubMatrix() to avoid communication in more cases

Mat: Add missing implementations for internal "MatMultColumnRange()" interface

Mat_MPIDense: Fix the zeroing of buffers in multiplication routines

Mat_MPIDense: Add optimization of MatMatMult routines when all columns are owned by rank 0

The communication for intermediate buffers can be handled with allreduce / bcast operations,
but we use the PetscSF matvec context instead of MPI routines directly so that we will
use gpu-aware MPI if possible.

show more ...


# 3853def2 26-Mar-2025 Toby Isaac <toby.isaac@gmail.com>

CUPM: Fix some mult routines and make some small performance improvements

VecSeq_CUPM: Fix ::Dot, ::TDot, ::WAXPYAsync, and ::AXPBYAsync to work with non-device vectors

Mat_SeqDenseCUPM: fix ::SetR

CUPM: Fix some mult routines and make some small performance improvements

VecSeq_CUPM: Fix ::Dot, ::TDot, ::WAXPYAsync, and ::AXPBYAsync to work with non-device vectors

Mat_SeqDenseCUPM: fix ::SetRandom for hip

Mat_SeqDenseCUPM: implement conjugate on the device

Mat_SeqDenseCUPM: Fix MatMult() (and friends) when the vector is not on the device

This implementation assumes that the cost of moving the matrix to the
host is more expensive that moving the vector to the device, so temporary
device copies of the host vectors are used.

VecCUPM: Avoid device synchronization in some cases of ResetArray()

The documentation says that the PetscDeviceSynchronization() is
only needed if CopyToDevice_() resulted in a HtoD memcpy,
which would only happen if v->offloadmask == PETSC_OFFLOAD_CPU.
If we test this condition, we can avoid the synchronization.

This improves the peformance of MatDenseRestoreColumnVecWrite() in
performance critical loops.

CUPM: use thrust::hip::par_nosync

show more ...


# d0e6bf2a 25-Sep-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/clang-format-19' into 'main'

clang-format-19

See merge request petsc/petsc!7644


# 57508ece 17-Sep-2024 Pierre Jolivet <pierre@joliv.et>

Remove unneeded parentheses


# bb59972c 05-May-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jed/fix-cupm-visibility' into 'main'

cupm: fix visibility to build without warnings

See merge request petsc/petsc!7526


# 85f25e71 04-May-2024 Jed Brown <jed@jedbrown.org>

cupm: fix visibility to build without warnings

Thanks-to: Lawrence Mitchell


# 8c77480b 04-Apr-2024 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/release'


# 74577c92 02-Apr-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jose/release/cupm-int-mult' into 'release'

CUPM: do not forbid allocations larger than PetscInt size

See merge request petsc/petsc!7427


# 16130775 02-Apr-2024 Jose E. Roman <jroman@dsic.upv.es>

CUPM: do not forbid allocations larger than PetscInt size

This fixes an error triggered in SLEPc when creating a BV with large
sizes, i.e., a dense Mat where the product of rows and columns overflow

CUPM: do not forbid allocations larger than PetscInt size

This fixes an error triggered in SLEPc when creating a BV with large
sizes, i.e., a dense Mat where the product of rows and columns overflows
PetscInt, for instance 134217728 x 17

Reported-by: DenverCoder9

show more ...


# 5fe01c21 22-Mar-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'hsuh/dense-lmvm-squashed' into 'main'

Dense LMVM

See merge request petsc/petsc!7371


# 0be0d8bd 14-Mar-2024 Hansol Suh <hsuh@anl.gov>

MatMult-ColumnRanges support added


# 3c4b05c5 10-Jan-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'blanca/mat-mult-hermitian-transpose' into 'main'

Support for MatMultHermitianTranspose with MATDENSE, MATNEST and MATSCALAPACK

See merge request petsc/petsc!7171


# 95571869 05-Jan-2024 Blanca Mellado Pinto <bmelpin@dsic.upv.es>

Added MatMultHermitianTranspose and MatMultHermitianTransposeAdd for MATDENSECUPM


# 9dd11ecf 25-Aug-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2023-08-17/header-guard-check' into 'main'

Check header guards

See merge request petsc/petsc!6822


# a4963045 18-Aug-2023 Jacob Faibussowitsch <jacob.fai@gmail.com>

Convert all header guards to pragma once


# eef0bda4 30-Jun-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2023-06-13/mat-vec-more-cupm-functions' into 'main'

Add Some Missing Vec/Mat CUPM Functions

Closes #1399

See merge request petsc/petsc!6596


# 04d0cef7 27-Jun-2023 Jacob Faibussowitsch <jacob.fai@gmail.com>

Collapse MatDense_CUPM::Shift() and MatDense_CUPM::GetDiagonal() down to base class


# 14277c92 14-Jun-2023 Jacob Faibussowitsch <jacob.fai@gmail.com>

Add MatGetDiagonal_CUPM


# 06e36fcd 01-Jun-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2023-05-30/misc-cxx-fixes-2' into 'main'

Misc C++-Related Fixes For PetscDevice And Friends Round 2

See merge request petsc/petsc!6526


12