| #
6d8694c4
|
| 20-Mar-2025 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2024-04-29/add-fortran-enums' into 'main'
Major updates to Fortran interface
See merge request petsc/petsc!7517
|
| #
ce78bad3
|
| 20-Feb-2025 |
Barry Smith <bsmith@petsc.dev> |
Fortran 90: fully embrace After 34 years!
- deprecate use of 'F90' in Fortran function names - use Fortran pointers when appropriate - the new Fortran API is not backward compatible with previous ve
Fortran 90: fully embrace After 34 years!
- deprecate use of 'F90' in Fortran function names - use Fortran pointers when appropriate - the new Fortran API is not backward compatible with previous versions! - also clean up inconsistent PETSc code detected by new Fortran generation tools - drop use of bfort - automatically generate all the Fortran PETSc objects, enums etc from the include files - generate most of the Fortran interface definitions and functions from the source code - simplify the number and organization of Fortran modules
Co-authored-by: Jose E. Roman <jroman@dsic.upv.es>
show more ...
|
| #
3b91a372
|
| 26-Mar-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'hongzh/sell-hip' into 'main'
Add SELLHIP
See merge request petsc/petsc!7338
|
| #
887e4694
|
| 11-Mar-2024 |
Hong Zhang <hongzhang@anl.gov> |
Reuse MPISELL operations for SELLCUDA and SELLHIP
|
| #
bf10c3a9
|
| 16-Nov-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/fix-seealso' into 'main'
Ensure no leading white spaces in front of .seealso:
See merge request petsc/petsc!7027
|
| #
f1e39ca5
|
| 15-Nov-2023 |
Pierre Jolivet <pierre@joliv.et> |
Ensure no leading white spaces in front of .seealso:
|
| #
9c5460f9
|
| 17-Aug-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-08-09/linter-detect-static-function-candidates' into 'main'
Linter detect possible static functions
See merge request petsc/petsc!6796
|
| #
66976f2f
|
| 14-Aug-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Add static to internal functions
|
| #
02477ebb
|
| 15-Jul-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-06-30/enable-doc-lint' into 'main'
Enable docs linting
See merge request petsc/petsc!6667
|
| #
fe59aa6d
|
| 12-Jul-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Lint apply: mat
|
| #
be50c303
|
| 03-Jun-2023 |
Satish Balay <balay@mcs.anl.gov> |
docs: additional chapter_ -> ch_ change in main after merge of release changes from !6520
|
| #
dd874c20
|
| 10-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'hongzh/sell-cuda' into 'main'
SELL-based SpMV
See merge request petsc/petsc!3428
|
| #
8df136f9
|
| 08-Apr-2023 |
Hong Zhang <hongzhang@anl.gov> |
Finish MPICUDASELL
|
| #
2d1451d4
|
| 09-Jan-2020 |
Hong Zhang <hongzhang@anl.gov> |
Initial commit for porting SELL to GPU
- Add tiled SPMV and basic SpMVfor SeqSELL - Tested in serial - Offloadmask is used to determine when the matrix should be copied to GPU - Use different slice
Initial commit for porting SELL to GPU
- Add tiled SPMV and basic SpMVfor SeqSELL - Tested in serial - Offloadmask is used to determine when the matrix should be copied to GPU - Use different slice height for CUDA version - By checking the nonzerostate, PETSc can decide if the whole matrix need to be copied or just the values need to be copied - Make the convert function public so that the very slow MatConvert_Basic can be avoided sometimes. E.g. one can use a two-step convert method: AIJ->SELL,SELL->SELLCUDA instead of the direct convert AIJ->SELLCUDA - Make the FLOPS count for SELL same as that for AIJCUSPARSE. - MatDisAssemble is not needed. - Change slice height from 32 to 16 for GPU - To overlap communication with MatMult, VecScatterBegin() should be called before MatMult() for the diagonal part. - SLICE_HEIGHT is defined to be 32 to match the warp size of GPU. For other cases, it is still 8.
Funded-by: Project: PETSc for GPU Time: 42 hours Reported-by: Thanks-to:
show more ...
|