History log of /petsc/src/mat/impls/sell/seq/seqcuda/sellcuda.cu (Results 1 – 25 of 32)
Revision Date Author Comments
# e8c0849a 20-Nov-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2025-10-18/optimize-aij-ops' into 'main'

Refactor handling of diagonal marking in *AIJ and SELL matrices

See merge request petsc/petsc!8797


# 07425a8d 18-Oct-2025 Barry Smith <bsmith@mcs.anl.gov>

This MR optimize some MAT[SB]AIJ operations related to the diagonal

The longer term goal is to refactor some Mat_SeqXXX non-numeric code to make it easier to also implement it on GPUs. For example,

This MR optimize some MAT[SB]AIJ operations related to the diagonal

The longer term goal is to refactor some Mat_SeqXXX non-numeric code to make it easier to also implement it on GPUs. For example, MatGetDiagonal() and friends

This commit
- MatMarkDiagonal_SeqAIJ/ELL()
- now use change in mat->nonzerostate to determine if it needs to recheck the diagonal locations
- sets diagDense flag for a complete diagonal
- MatInvertDiagonal_SeqAIJ/ELL()
- now uses change in mat->state to determine if diagonal entries and inverses need to be recomputed
- name changed to MatInvertDiagonalForSOR_SeqAIJ/ELL() for code clarity
- Added MatInvertDiagonal_SeqAIJ_Inode() for use by MatSOR_SeqAIJ_Inode()
- now uses change in mat->state to determine if block diagonal entries and inverses need to be recomputed

show more ...


# cd871708 16-Sep-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'cuda' into 'main'

Fix CUDA 13 API incompatibilities

See merge request petsc/petsc!8632


# 2695cf96 11-Sep-2025 Nuno Nobre <nuno.nobre@stfc.ac.uk>

Gather CUDA lib includes in petscdevice_cuda.h + disable cuSPARSE warnings using DISABLE_CUSPARSE_DEPRECATED

Defining DISABLE_CUSPARSE_DEPRECATED is Nvidia's recommended way of disabling cuSPARSE wa

Gather CUDA lib includes in petscdevice_cuda.h + disable cuSPARSE warnings using DISABLE_CUSPARSE_DEPRECATED

Defining DISABLE_CUSPARSE_DEPRECATED is Nvidia's recommended way of disabling cuSPARSE warnings
This avoids requiring -Wdeprecated-declarations - that can mask other (non-cuSPARSE) warnings

show more ...


# 834855d6 27-Aug-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/clang-format-21' into 'main'

clang-format-21

See merge request petsc/petsc!8202


# ac530a7e 03-Jun-2025 Pierre Jolivet <pierre@joliv.et>

Remove unnecessary braces around one-liners

git grep -lE "[ ]*(if|for|while) \(.*\) {[^;]*;[^;]*}$" -- '*.c' '*.cxx' '*.cu' '*.h' '*.hpp' '*.cpp' | xargs sed -i '' -E 's#([ ]*)(if|for|while) \((.*)\

Remove unnecessary braces around one-liners

git grep -lE "[ ]*(if|for|while) \(.*\) {[^;]*;[^;]*}$" -- '*.c' '*.cxx' '*.cu' '*.h' '*.hpp' '*.cpp' | xargs sed -i '' -E 's#([ ]*)(if|for|while) \((.*)\) {([^;]*);([^;]*)}$#\1\2 \(\3\)\4;\5#'

show more ...


# 3a7d0413 12-May-2025 Pierre Jolivet <pierre@joliv.et>

One-liners from petsc/petsc!5344 and petsc/petsc!5557

Slightly reworked regular expression

git ls-files -z -- '*.c' '*.cxx' '*.cu' '*.h' '*.hpp' '*.cpp' | while IFS= read -r -d '' file; do
cat

One-liners from petsc/petsc!5344 and petsc/petsc!5557

Slightly reworked regular expression

git ls-files -z -- '*.c' '*.cxx' '*.cu' '*.h' '*.hpp' '*.cpp' | while IFS= read -r -d '' file; do
cat $file | tr '\n' '\r' | sed -E 's/\r([ ]*)(for|if|while|else) ([^\r]*)\{\r[ ]*Petsc([a-zA-Z]*)\(([^\r]*)\);\r[ ]*\}\r/\r\1\2 \3Petsc\4(\5);\r/g' | tr '\r' '\n' > ${file}.joe; mv ${file}.joe ${file}
done

show more ...


# 6d8694c4 20-Mar-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2024-04-29/add-fortran-enums' into 'main'

Major updates to Fortran interface

See merge request petsc/petsc!7517


# ce78bad3 20-Feb-2025 Barry Smith <bsmith@petsc.dev>

Fortran 90: fully embrace After 34 years!

- deprecate use of 'F90' in Fortran function names
- use Fortran pointers when appropriate
- the new Fortran API is not backward compatible with previous ve

Fortran 90: fully embrace After 34 years!

- deprecate use of 'F90' in Fortran function names
- use Fortran pointers when appropriate
- the new Fortran API is not backward compatible with previous versions!
- also clean up inconsistent PETSc code detected by new Fortran generation tools
- drop use of bfort
- automatically generate all the Fortran PETSc objects, enums etc from the include files
- generate most of the Fortran interface definitions and functions from the source code
- simplify the number and organization of Fortran modules

Co-authored-by: Jose E. Roman <jroman@dsic.upv.es>

show more ...


# be37439e 21-Oct-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/useless-cast' into 'main'

Remove useless cast

See merge request petsc/petsc!7894


# b7c0efca 01-Oct-2024 Stefano Zampini <stefano.zampini@gmail.com>

PetscCeilInt: inline method


# 3b91a372 26-Mar-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'hongzh/sell-hip' into 'main'

Add SELLHIP

See merge request petsc/petsc!7338


# 773bf0f6 05-Mar-2024 Hong Zhang <hongzhang@anl.gov>

Add SELLHIP

- The HIP kernels are converted directly from their CUDA version
- AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate comp

Add SELLHIP

- The HIP kernels are converted directly from their CUDA version
- AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate compile-time code optimization

show more ...


# 970231d2 07-Mar-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/clang-format-18' into 'main'

clang-format version 18

See merge request petsc/petsc!6902


# f4f49eea 11-Feb-2024 Pierre Jolivet <pierre@joliv.et>

Remove multiple parentheses and extra semicolon

git grep -l -E "\(\([a-zA-Z0-9_]+\)\(\*[a-zA-Z0-9_]+\)[, )]" | xargs sed -r -i'' 's#\(\(([a-zA-Z0-9_]+)\)\(\*([a-zA-Z0-9_]+)\)([, )])#\(\(\1\)\*\2\3#g

Remove multiple parentheses and extra semicolon

git grep -l -E "\(\([a-zA-Z0-9_]+\)\(\*[a-zA-Z0-9_]+\)[, )]" | xargs sed -r -i'' 's#\(\(([a-zA-Z0-9_]+)\)\(\*([a-zA-Z0-9_]+)\)([, )])#\(\(\1\)\*\2\3#g'
git grep -l -E "Petsc(.)*\(\(\*[a-zA-Z0-9_]*\)," | xargs sed -r -i'' 's#Petsc([a-zA-Z0-9_]*)\(\(\*([a-zA-Z0-9_]*)\), #Petsc\1(*\2, #g'
git grep -l -E "([\!\&\~\*\(]|\)\)|\([^,\*\(]+\**\))\(([a-zA-Z0-9_]+((\.|->)[a-zA-Z0-9_]+|\[[a-zA-Z0-9_ \%\+\*\-]+\])+)\)" | xargs sed -r -i'' 's#([\!\&\~\*\(]|\)\)|\([^,\*\(]+\**\))\(([a-zA-Z0-9_]+((\.|->)[a-zA-Z0-9_]+|\[[a-zA-Z0-9_ \%\+\*\-]+\])+)\)#\1\2#g'

show more ...


# 2131b729 30-Aug-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jose/fix-warn-cuda-complex' into 'main'

Fix compile warnings in CUDA code wih complex scalars

See merge request petsc/petsc!6845


# 16a9b8de 30-Aug-2023 Jose E. Roman <jroman@dsic.upv.es>

Fix compile warnings in CUDA code wih complex scalars


# 9c5460f9 17-Aug-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2023-08-09/linter-detect-static-function-candidates' into 'main'

Linter detect possible static functions

See merge request petsc/petsc!6796


# 66976f2f 14-Aug-2023 Jacob Faibussowitsch <jacob.fai@gmail.com>

Add static to internal functions


# 9cc24bc9 15-May-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/fix-cuda-builds-host' into 'main'

MATSEQSELLCUDA: fix for older cuda archs

See merge request petsc/petsc!6445


# 6eb97ccc 10-May-2023 Stefano Zampini <stefano.zampini@gmail.com>

MATSEQSELLCUDA: fix for older cuda versions

move atomicAdd support from SF to private CPP header


# dd874c20 10-Apr-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'hongzh/sell-cuda' into 'main'

SELL-based SpMV

See merge request petsc/petsc!3428


# 8df136f9 08-Apr-2023 Hong Zhang <hongzhang@anl.gov>

Finish MPICUDASELL


# 8711c661 01-Apr-2023 Hong Zhang <hongzhang@anl.gov>

Guard against complex build for unsupported kernels


# b921024e 06-Apr-2021 Hong Zhang <hongzhang@anl.gov>

Add MatSeqSELLGetAvgSliceSize

It returns the variance of the slice sizes.


12