History log of /petsc/src/mat/impls/sell/mpi/mpicuda/mpisellcuda.cu (Results 1 – 14 of 14)
Revision Date Author Comments
# 6d8694c4 20-Mar-2025 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2024-04-29/add-fortran-enums' into 'main'

Major updates to Fortran interface

See merge request petsc/petsc!7517


# ce78bad3 20-Feb-2025 Barry Smith <bsmith@petsc.dev>

Fortran 90: fully embrace After 34 years!

- deprecate use of 'F90' in Fortran function names
- use Fortran pointers when appropriate
- the new Fortran API is not backward compatible with previous ve

Fortran 90: fully embrace After 34 years!

- deprecate use of 'F90' in Fortran function names
- use Fortran pointers when appropriate
- the new Fortran API is not backward compatible with previous versions!
- also clean up inconsistent PETSc code detected by new Fortran generation tools
- drop use of bfort
- automatically generate all the Fortran PETSc objects, enums etc from the include files
- generate most of the Fortran interface definitions and functions from the source code
- simplify the number and organization of Fortran modules

Co-authored-by: Jose E. Roman <jroman@dsic.upv.es>

show more ...


# 3b91a372 26-Mar-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'hongzh/sell-hip' into 'main'

Add SELLHIP

See merge request petsc/petsc!7338


# 887e4694 11-Mar-2024 Hong Zhang <hongzhang@anl.gov>

Reuse MPISELL operations for SELLCUDA and SELLHIP


# bf10c3a9 16-Nov-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jolivet/fix-seealso' into 'main'

Ensure no leading white spaces in front of .seealso:

See merge request petsc/petsc!7027


# f1e39ca5 15-Nov-2023 Pierre Jolivet <pierre@joliv.et>

Ensure no leading white spaces in front of .seealso:


# 9c5460f9 17-Aug-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2023-08-09/linter-detect-static-function-candidates' into 'main'

Linter detect possible static functions

See merge request petsc/petsc!6796


# 66976f2f 14-Aug-2023 Jacob Faibussowitsch <jacob.fai@gmail.com>

Add static to internal functions


# 02477ebb 15-Jul-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2023-06-30/enable-doc-lint' into 'main'

Enable docs linting

See merge request petsc/petsc!6667


# fe59aa6d 12-Jul-2023 Jacob Faibussowitsch <jacob.fai@gmail.com>

Lint apply: mat


# be50c303 03-Jun-2023 Satish Balay <balay@mcs.anl.gov>

docs: additional chapter_ -> ch_ change in main after merge of release changes from !6520


# dd874c20 10-Apr-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'hongzh/sell-cuda' into 'main'

SELL-based SpMV

See merge request petsc/petsc!3428


# 8df136f9 08-Apr-2023 Hong Zhang <hongzhang@anl.gov>

Finish MPICUDASELL


# 2d1451d4 09-Jan-2020 Hong Zhang <hongzhang@anl.gov>

Initial commit for porting SELL to GPU

- Add tiled SPMV and basic SpMVfor SeqSELL
- Tested in serial
- Offloadmask is used to determine when the matrix should be copied to GPU
- Use different slice

Initial commit for porting SELL to GPU

- Add tiled SPMV and basic SpMVfor SeqSELL
- Tested in serial
- Offloadmask is used to determine when the matrix should be copied to GPU
- Use different slice height for CUDA version
- By checking the nonzerostate, PETSc can decide if the whole matrix need to be copied or just the values need to be copied
- Make the convert function public so that the very slow MatConvert_Basic can be avoided sometimes. E.g. one can use a two-step convert method: AIJ->SELL,SELL->SELLCUDA instead of the direct convert AIJ->SELLCUDA
- Make the FLOPS count for SELL same as that for AIJCUSPARSE.
- MatDisAssemble is not needed.
- Change slice height from 32 to 16 for GPU
- To overlap communication with MatMult, VecScatterBegin() should be called before MatMult() for the diagonal part.
- SLICE_HEIGHT is defined to be 32 to match the warp size of GPU. For other cases, it is still 8.

Funded-by:
Project: PETSc for GPU
Time: 42 hours
Reported-by:
Thanks-to:

show more ...