| 20291eb5 | 02-Mar-2020 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Fix bugs and simplify MatMultTransposeAdd_SeqAIJCUSPARSE
MatT always uses non-compressed row format on GPU. We should not use cusparsestruct->workVector, which is created for Mat instead of MatT, an
Fix bugs and simplify MatMultTransposeAdd_SeqAIJCUSPARSE
MatT always uses non-compressed row format on GPU. We should not use cusparsestruct->workVector, which is created for Mat instead of MatT, and might be less than A->cmap->n (when Mat has compressed rows)
show more ...
|
| 81902715 | 02-Mar-2020 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Save rowoffsets_gpu and use A->was_assembled to easily update Mat and MatT when matrix was assembled
Old code used if (A->assembled ..), which is wrong and could not take advantage of the fact that
Save rowoffsets_gpu and use A->was_assembled to easily update Mat and MatT when matrix was assembled
Old code used if (A->assembled ..), which is wrong and could not take advantage of the fact that A's non-zero pattern is not changed
show more ...
|
| a3fdcf43 | 05-Aug-2019 |
Karl Rupp <me@karlrupp.net> |
CUDA: Fixed issues in MatSeqAIJCUSPARSEGenerateTransposeForMult and MatMultTransposeAdd_SeqAIJCUSPARSE
This is a cherry-pick of commits dde4751, 435e334, 1d884b8, 4e32a5a Thanks-to: Mark Adams <ma23
CUDA: Fixed issues in MatSeqAIJCUSPARSEGenerateTransposeForMult and MatMultTransposeAdd_SeqAIJCUSPARSE
This is a cherry-pick of commits dde4751, 435e334, 1d884b8, 4e32a5a Thanks-to: Mark Adams <ma2325@columbia.edu>
show more ...
|