| #
e8c0849a
|
| 20-Nov-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2025-10-18/optimize-aij-ops' into 'main'
Refactor handling of diagonal marking in *AIJ and SELL matrices
See merge request petsc/petsc!8797
|
| #
07425a8d
|
| 18-Oct-2025 |
Barry Smith <bsmith@mcs.anl.gov> |
This MR optimize some MAT[SB]AIJ operations related to the diagonal
The longer term goal is to refactor some Mat_SeqXXX non-numeric code to make it easier to also implement it on GPUs. For example,
This MR optimize some MAT[SB]AIJ operations related to the diagonal
The longer term goal is to refactor some Mat_SeqXXX non-numeric code to make it easier to also implement it on GPUs. For example, MatGetDiagonal() and friends
This commit - MatMarkDiagonal_SeqAIJ/ELL() - now use change in mat->nonzerostate to determine if it needs to recheck the diagonal locations - sets diagDense flag for a complete diagonal - MatInvertDiagonal_SeqAIJ/ELL() - now uses change in mat->state to determine if diagonal entries and inverses need to be recomputed - name changed to MatInvertDiagonalForSOR_SeqAIJ/ELL() for code clarity - Added MatInvertDiagonal_SeqAIJ_Inode() for use by MatSOR_SeqAIJ_Inode() - now uses change in mat->state to determine if block diagonal entries and inverses need to be recomputed
show more ...
|
| #
cd871708
|
| 16-Sep-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'cuda' into 'main'
Fix CUDA 13 API incompatibilities
See merge request petsc/petsc!8632
|
| #
2695cf96
|
| 11-Sep-2025 |
Nuno Nobre <nuno.nobre@stfc.ac.uk> |
Gather CUDA lib includes in petscdevice_cuda.h + disable cuSPARSE warnings using DISABLE_CUSPARSE_DEPRECATED
Defining DISABLE_CUSPARSE_DEPRECATED is Nvidia's recommended way of disabling cuSPARSE wa
Gather CUDA lib includes in petscdevice_cuda.h + disable cuSPARSE warnings using DISABLE_CUSPARSE_DEPRECATED
Defining DISABLE_CUSPARSE_DEPRECATED is Nvidia's recommended way of disabling cuSPARSE warnings This avoids requiring -Wdeprecated-declarations - that can mask other (non-cuSPARSE) warnings
show more ...
|
| #
834855d6
|
| 27-Aug-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/clang-format-21' into 'main'
clang-format-21
See merge request petsc/petsc!8202
|
| #
ac530a7e
|
| 03-Jun-2025 |
Pierre Jolivet <pierre@joliv.et> |
Remove unnecessary braces around one-liners
git grep -lE "[ ]*(if|for|while) \(.*\) {[^;]*;[^;]*}$" -- '*.c' '*.cxx' '*.cu' '*.h' '*.hpp' '*.cpp' | xargs sed -i '' -E 's#([ ]*)(if|for|while) \((.*)\
Remove unnecessary braces around one-liners
git grep -lE "[ ]*(if|for|while) \(.*\) {[^;]*;[^;]*}$" -- '*.c' '*.cxx' '*.cu' '*.h' '*.hpp' '*.cpp' | xargs sed -i '' -E 's#([ ]*)(if|for|while) \((.*)\) {([^;]*);([^;]*)}$#\1\2 \(\3\)\4;\5#'
show more ...
|
| #
3a7d0413
|
| 12-May-2025 |
Pierre Jolivet <pierre@joliv.et> |
One-liners from petsc/petsc!5344 and petsc/petsc!5557
Slightly reworked regular expression
git ls-files -z -- '*.c' '*.cxx' '*.cu' '*.h' '*.hpp' '*.cpp' | while IFS= read -r -d '' file; do cat
One-liners from petsc/petsc!5344 and petsc/petsc!5557
Slightly reworked regular expression
git ls-files -z -- '*.c' '*.cxx' '*.cu' '*.h' '*.hpp' '*.cpp' | while IFS= read -r -d '' file; do cat $file | tr '\n' '\r' | sed -E 's/\r([ ]*)(for|if|while|else) ([^\r]*)\{\r[ ]*Petsc([a-zA-Z]*)\(([^\r]*)\);\r[ ]*\}\r/\r\1\2 \3Petsc\4(\5);\r/g' | tr '\r' '\n' > ${file}.joe; mv ${file}.joe ${file} done
show more ...
|
| #
6d8694c4
|
| 20-Mar-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2024-04-29/add-fortran-enums' into 'main'
Major updates to Fortran interface
See merge request petsc/petsc!7517
|
| #
ce78bad3
|
| 20-Feb-2025 |
Barry Smith <bsmith@petsc.dev> |
Fortran 90: fully embrace After 34 years!
- deprecate use of 'F90' in Fortran function names - use Fortran pointers when appropriate - the new Fortran API is not backward compatible with previous ve
Fortran 90: fully embrace After 34 years!
- deprecate use of 'F90' in Fortran function names - use Fortran pointers when appropriate - the new Fortran API is not backward compatible with previous versions! - also clean up inconsistent PETSc code detected by new Fortran generation tools - drop use of bfort - automatically generate all the Fortran PETSc objects, enums etc from the include files - generate most of the Fortran interface definitions and functions from the source code - simplify the number and organization of Fortran modules
Co-authored-by: Jose E. Roman <jroman@dsic.upv.es>
show more ...
|
| #
be37439e
|
| 21-Oct-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'stefanozampini/useless-cast' into 'main'
Remove useless cast
See merge request petsc/petsc!7894
|
| #
b7c0efca
|
| 01-Oct-2024 |
Stefano Zampini <stefano.zampini@gmail.com> |
PetscCeilInt: inline method
|
| #
3b91a372
|
| 26-Mar-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'hongzh/sell-hip' into 'main'
Add SELLHIP
See merge request petsc/petsc!7338
|
| #
773bf0f6
|
| 05-Mar-2024 |
Hong Zhang <hongzhang@anl.gov> |
Add SELLHIP
- The HIP kernels are converted directly from their CUDA version - AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate comp
Add SELLHIP
- The HIP kernels are converted directly from their CUDA version - AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate compile-time code optimization
show more ...
|
| #
970231d2
|
| 07-Mar-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/clang-format-18' into 'main'
clang-format version 18
See merge request petsc/petsc!6902
|
| #
f4f49eea
|
| 11-Feb-2024 |
Pierre Jolivet <pierre@joliv.et> |
Remove multiple parentheses and extra semicolon
git grep -l -E "\(\([a-zA-Z0-9_]+\)\(\*[a-zA-Z0-9_]+\)[, )]" | xargs sed -r -i'' 's#\(\(([a-zA-Z0-9_]+)\)\(\*([a-zA-Z0-9_]+)\)([, )])#\(\(\1\)\*\2\3#g
Remove multiple parentheses and extra semicolon
git grep -l -E "\(\([a-zA-Z0-9_]+\)\(\*[a-zA-Z0-9_]+\)[, )]" | xargs sed -r -i'' 's#\(\(([a-zA-Z0-9_]+)\)\(\*([a-zA-Z0-9_]+)\)([, )])#\(\(\1\)\*\2\3#g' git grep -l -E "Petsc(.)*\(\(\*[a-zA-Z0-9_]*\)," | xargs sed -r -i'' 's#Petsc([a-zA-Z0-9_]*)\(\(\*([a-zA-Z0-9_]*)\), #Petsc\1(*\2, #g' git grep -l -E "([\!\&\~\*\(]|\)\)|\([^,\*\(]+\**\))\(([a-zA-Z0-9_]+((\.|->)[a-zA-Z0-9_]+|\[[a-zA-Z0-9_ \%\+\*\-]+\])+)\)" | xargs sed -r -i'' 's#([\!\&\~\*\(]|\)\)|\([^,\*\(]+\**\))\(([a-zA-Z0-9_]+((\.|->)[a-zA-Z0-9_]+|\[[a-zA-Z0-9_ \%\+\*\-]+\])+)\)#\1\2#g'
show more ...
|
| #
2131b729
|
| 30-Aug-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jose/fix-warn-cuda-complex' into 'main'
Fix compile warnings in CUDA code wih complex scalars
See merge request petsc/petsc!6845
|
| #
16a9b8de
|
| 30-Aug-2023 |
Jose E. Roman <jroman@dsic.upv.es> |
Fix compile warnings in CUDA code wih complex scalars
|
| #
9c5460f9
|
| 17-Aug-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-08-09/linter-detect-static-function-candidates' into 'main'
Linter detect possible static functions
See merge request petsc/petsc!6796
|
| #
66976f2f
|
| 14-Aug-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Add static to internal functions
|
| #
9cc24bc9
|
| 15-May-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'stefanozampini/fix-cuda-builds-host' into 'main'
MATSEQSELLCUDA: fix for older cuda archs
See merge request petsc/petsc!6445
|
| #
6eb97ccc
|
| 10-May-2023 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATSEQSELLCUDA: fix for older cuda versions
move atomicAdd support from SF to private CPP header
|
| #
dd874c20
|
| 10-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'hongzh/sell-cuda' into 'main'
SELL-based SpMV
See merge request petsc/petsc!3428
|
| #
8df136f9
|
| 08-Apr-2023 |
Hong Zhang <hongzhang@anl.gov> |
Finish MPICUDASELL
|
| #
8711c661
|
| 01-Apr-2023 |
Hong Zhang <hongzhang@anl.gov> |
Guard against complex build for unsupported kernels
|
| #
b921024e
|
| 06-Apr-2021 |
Hong Zhang <hongzhang@anl.gov> |
Add MatSeqSELLGetAvgSliceSize
It returns the variance of the slice sizes.
|