History log of /petsc/src/mat/impls/dense/seq/dense.c (Results 26 – 50 of 1341)
Revision Date Author Comments
# 6bfab512 28-Apr-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2025-04-20/docs-linear-regressor-in-ksp' into 'main'

Mention linear regressor in KSP docs and KSP usage from linear regressor

See merge request petsc/petsc!8334


# 789736e1 20-Apr-2025 Barry Smith <bsmith@mcs.anl.gov>

Mention linear regressor in KSP docs and KSP usage from linear regressor


# 58bddbc0 11-Apr-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'tisaac/feature-lmvm-rework' into 'main'

MatLMVM rework

See merge request petsc/petsc!6401


# d016bdde 26-Mar-2025 Toby Isaac <toby.isaac@gmail.com>

Mat: Fix and improve the performance of dense matrix multiplication

Mat: Add MATDENSEFROMVECTYPE constructor type

Now in a tests set you can do

```
testset:
args: -mat_type densefromvectype

Mat: Fix and improve the performance of dense matrix multiplication

Mat: Add MATDENSEFROMVECTYPE constructor type

Now in a tests set you can do

```
testset:
args: -mat_type densefromvectype
test: test_cuda
requires: cuda
args: -vec_type veccuda
test: test_hip
requires: hip
args: -vec_type vechip
```

(This assumes that you call `MatSetVecType()` before you call
`MatSetFromOptions()`)

Mat_MPIDense: Cache offsets of MatDenseGetSubMatrix() to avoid communication in more cases

Mat: Add missing implementations for internal "MatMultColumnRange()" interface

Mat_MPIDense: Fix the zeroing of buffers in multiplication routines

Mat_MPIDense: Add optimization of MatMatMult routines when all columns are owned by rank 0

The communication for intermediate buffers can be handled with allreduce / bcast operations,
but we use the PetscSF matvec context instead of MPI routines directly so that we will
use gpu-aware MPI if possible.

show more ...


# 3853def2 26-Mar-2025 Toby Isaac <toby.isaac@gmail.com>

CUPM: Fix some mult routines and make some small performance improvements

VecSeq_CUPM: Fix ::Dot, ::TDot, ::WAXPYAsync, and ::AXPBYAsync to work with non-device vectors

Mat_SeqDenseCUPM: fix ::SetR

CUPM: Fix some mult routines and make some small performance improvements

VecSeq_CUPM: Fix ::Dot, ::TDot, ::WAXPYAsync, and ::AXPBYAsync to work with non-device vectors

Mat_SeqDenseCUPM: fix ::SetRandom for hip

Mat_SeqDenseCUPM: implement conjugate on the device

Mat_SeqDenseCUPM: Fix MatMult() (and friends) when the vector is not on the device

This implementation assumes that the cost of moving the matrix to the
host is more expensive that moving the vector to the device, so temporary
device copies of the host vectors are used.

VecCUPM: Avoid device synchronization in some cases of ResetArray()

The documentation says that the PetscDeviceSynchronization() is
only needed if CopyToDevice_() resulted in a HtoD memcpy,
which would only happen if v->offloadmask == PETSC_OFFLOAD_CPU.
If we test this condition, we can avoid the synchronization.

This improves the peformance of MatDenseRestoreColumnVecWrite() in
performance critical loops.

CUPM: use thrust::hip::par_nosync

show more ...


# 595506c3 09-Apr-2025 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/release'


# 9962b8f6 09-Apr-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/fix-whigher-precision' into 'release'

Fix -Whigher-precision-for-complex-division with Intel oneAPI 2025.1.0

See merge request petsc/petsc!8282


# 883424ca 06-Apr-2025 Pierre Jolivet <pierre@joliv.et>

Fix -Whigher-precision-for-complex-division with Intel oneAPI 2025.1.0

src/mat/impls/mffd/mffddef.c:78:31: warning: excess precision is requested but the target does not support excess precision whi

Fix -Whigher-precision-for-complex-division with Intel oneAPI 2025.1.0

src/mat/impls/mffd/mffddef.c:78:31: warning: excess precision is requested but the target does not support excess precision which may result in observable differences in complex division behavior [-Whigher-precision-for-complex-division]
78 | *h = ctx->error_rel * dot / (nrm * nrm);
| ^
1 warning generated.
src/dm/interface/dmgeommodel.c:42:77: warning: excess precision is requested but the target does not support excess precision which may result in observable differences in complex division behavior [-Whigher-precision-for-complex-division]
42 | for (PetscInt d = 0; d < dE - 1; ++d) gcoords[d] = gmax[0] * mcoords[d] / norm;
| ^
src/ksp/ksp/impls/bcgs/qmrcgs/qmrcgs.c:117:32: warning: excess precision is requested but the target does not support excess precision which may result in observable differences in complex division behavior, additional uses where the requested higher precision cannot be honored were found but not diagnosed [-Whigher-precision-for-complex-division]
117 | cf = theta * theta * eta / alpha;
| ^
src/mat/impls/dense/seq/dense.c:1043:73: warning: excess precision is requested but the target does not support excess precision which may result in observable differences in complex division behavior, additional uses where the requested higher precision cannot be honored were found but not diagnosed [-Whigher-precision-for-complex-division]
1043 | x[i] = (1. - omega) * x[i] + omega * (xt + v[i + i * m] * x[i]) / (v[i + i * m] + shift);
| ^

show more ...


# 6d8694c4 20-Mar-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2024-04-29/add-fortran-enums' into 'main'

Major updates to Fortran interface

See merge request petsc/petsc!7517


# ce78bad3 20-Feb-2025 Barry Smith <bsmith@petsc.dev>

Fortran 90: fully embrace After 34 years!

- deprecate use of 'F90' in Fortran function names
- use Fortran pointers when appropriate
- the new Fortran API is not backward compatible with previous ve

Fortran 90: fully embrace After 34 years!

- deprecate use of 'F90' in Fortran function names
- use Fortran pointers when appropriate
- the new Fortran API is not backward compatible with previous versions!
- also clean up inconsistent PETSc code detected by new Fortran generation tools
- drop use of bfort
- automatically generate all the Fortran PETSc objects, enums etc from the include files
- generate most of the Fortran interface definitions and functions from the source code
- simplify the number and organization of Fortran modules

Co-authored-by: Jose E. Roman <jroman@dsic.upv.es>

show more ...


# 7e4f6f19 07-Mar-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/matoption-no-error' into 'main'

MatSetOption: ignore unused options instead of erroring out

See merge request petsc/petsc!8179


# 888c827c 26-Feb-2025 Stefano Zampini <stefano.zampini@gmail.com>

MatSetOption: ignore unused options instead of erroring out


# 49c2f904 01-Mar-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'release'


# 24af29f3 01-Mar-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/fix-matsetoption' into 'release'

Mat: fix regression from c8ca1fbc97989d7f13a4db197e36b5d2efa188ca

See merge request petsc/petsc!8171


# 9725ddff 28-Feb-2025 Pierre Jolivet <pierre@joliv.et>

Mat: fix regression from c8ca1fbc97989d7f13a4db197e36b5d2efa188ca


# 24676f2a 28-Jan-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/rm-useless-code' into 'main'

Remove useless functions

See merge request petsc/petsc!8101


# b7103cf4 27-Jan-2025 Pierre Jolivet <pierre@joliv.et>

Remove useless functions


# 2355682a 16-Nov-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/fix-overflow-seqdense' into 'main'

Mat: avoid overflow when loading a MatSeqDense

See merge request petsc/petsc!8008


# 3e1d7bce 14-Nov-2024 Pierre Jolivet <pierre@joliv.et>

Mat: avoid overflow when loading a MatSeqDense


# a81650ca 29-Oct-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'lindad/new-aij-from-hash' into 'main'

Add MatCopyHashToXAIJ() to copy hash table entries into an XAIJ matrix type

See merge request petsc/petsc!7939


# fe1fc275 29-Oct-2024 Alexander <alexander.lindsay@inl.gov>

Add MatCopyHashToXAIJ() to copy hash table entries into an XAIJ matrix type


# be37439e 21-Oct-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/useless-cast' into 'main'

Remove useless cast

See merge request petsc/petsc!7894


# 835f2295 05-Oct-2024 Stefano Zampini <stefano.zampini@gmail.com>

Brain dead fixes for useless casts


# e4a561a4 07-Oct-2024 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/release'


# fd0575bf 07-Oct-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jose/fix-compile-warnings' into 'release'

Fix compiler warnings

See merge request petsc/petsc!7923


12345678910>>...54