| 7d8d0e25 | 02-Dec-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
add initial ported hip-shared and hip-gen backends |
| f67c8563 | 29-Oct-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
change HIP blocksize to 256 |
| 461525f5 | 17-Sep-2020 |
Natalie Beams <246972+nbeams@users.noreply.github.com> |
Consolidate CUDA backends (#623)
* Travis - allow icc failure for now
* move cuda-reg basis init kernels to cuda-shared
* move cuda-reg restrictions to cuda-ref
* change delegate ceeds for
Consolidate CUDA backends (#623)
* Travis - allow icc failure for now
* move cuda-reg basis init kernels to cuda-shared
* move cuda-reg restrictions to cuda-ref
* change delegate ceeds for previous uses of cuda-reg
* remove cuda-reg backend
* update hip restrictions to match cuda
* update backends list in README
* make style
* update release notes for removal of cuda-reg
Co-authored-by: jeremylt <thompson.jeremy.luke@gmail.com>
show more ...
|
| 3069e47f | 20-Aug-2020 |
jeremylt <thompson.jeremy.luke@gmail.com> |
Hip - shorten up resource strncmp to remove requirement for trailing slash with /gpu/hip |
| 777ff853 | 14-Aug-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
QFunction Context Data Object (#596)
* Ctx - create context object for QFunction context data
* Context - rename UserContext -> QFunctionContext
* Ctx - add lcov markers
* Ctx - fix leak in
QFunction Context Data Object (#596)
* Ctx - create context object for QFunction context data
* Context - rename UserContext -> QFunctionContext
* Ctx - add lcov markers
* Ctx - fix leak in identity QFunctions
* Hip/Cuda - rename sync functions for vector/context
* Tests - lcov marker update
* QFunction - drop unused function
* Python - fix copy-paste errors
* Ctx - update notes for Fortran usage
* Fortran - drop unneeded cast
Co-authored-by: Jed Brown <jed@jedbrown.org>
* Interface - use void* for SetData interfaces
* Make - use call quiet for NVCC
* Interface - use void* for GetData interfaces
* Make - add quiet call option for examples
* Makefile - create common makefile to reduce duplication/complexity in example makefiles
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| e299b378 | 10-Aug-2020 |
jeremylt <thompson.jeremy.luke@gmail.com> |
Hip - add missing ierr |
| 0f09838f | 29-Jul-2020 |
jeremylt <thompson.jeremy.luke@gmail.com> |
Cuda/Hip - name QFunctions for easier profiling |
| 29b67289 | 29-Jul-2020 |
jeremylt <thompson.jeremy.luke@gmail.com> |
Hip - fix warning about snprintf |
| 752c3701 | 28-Jul-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Fix CodeCov Reports (#597)
* Tests - use qfunction headers for Fortran tests to improve bypass gcov issue
* Gitlab - use latest gcc on Noether
* Cuda/Hip - add case in reciprocal for completen
Fix CodeCov Reports (#597)
* Tests - use qfunction headers for Fortran tests to improve bypass gcov issue
* Gitlab - use latest gcc on Noether
* Cuda/Hip - add case in reciprocal for completeness
* Cuda - remove duplicate case
* Makefile - exclude fortran test headers from make style
* Travis - update to Focal
* Cov - adjust style to be consistent and avoid false misses
* Travis - update comments and style
show more ...
|
| d99fa3c5 | 28-Jul-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Op - add interface for multigrid level creation (#579)
* Op - add interface for multigrid level creation
* Op - add implementation for OperatorMultigridLevelCreate
* make style
* make tidy
Op - add interface for multigrid level creation (#579)
* Op - add interface for multigrid level creation
* Op - add implementation for OperatorMultigridLevelCreate
* make style
* make tidy
* Op - add test t550, fix errors
* Tests - add Fortran version of t550
* Tests - add t511 for testing tensor basis multigrid level setup
* make style and tidy
* Tests - fix t55* memory leaks
* Tests - add t552 for non-tensor basis multigrid levels
* CUDA - use CeedIntMax in shared CUDA backend
* Tests - add OCCA test exception for t55*
* Op - add lvector global prolongation multiplicity, simplifies user interface
* Solids - convert example to new interface
* make style
* Tests - convert t550 to multicomponent
* Solids - drop unused ceed_fine
* Python - add new multigrid level interface
* Python - fix operator wrap, use ceed python obj rather than ceed pointer
* Gallery - update comment slightly
* Tests - remove accidental duplicate test
* Multigrid - add size=2 case as well
* Operator - drop unneeded inline
* QFunction - simplify context ownership to match vector
* make style
* Python - update multigrid function signature
* Operator - refactor prolong/restrict qfunctions as scaling qfunctions
* Vector - add testing for reciprocal and add to Fortran/Python interfaces
* CUDA - add VectorReciprocal on device
* Gallery - drop specalized versions for 'Scale', wil fix performance hit later
* Hip - add vector reciprocal
* Operator - add more flexible prologation basis creation interface
* Vec - make sure data is set for VectorReciprocal
* Tests - drop ncomp for t550/1 so kernel is not too large for Magma backend
* Tests - add missing lcov markers
* make style
* Travis - allow ARM job to fail
* Travis - fix intel install
* Travis - try different install dir name for inteloneapi
* Travis - add ifort, ipp packages
* Tests - add missing lcov marker
show more ...
|
| 6bbcfef4 | 27-Jul-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Hip/Cuda - expand QFunction LCOV exception for failing to open qf source file |
| b2573fe1 | 27-Jul-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Device - put device kernels in separate 'kernels' folder in backends |
| e9f4dca0 | 27-Jul-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Cuda - add LCOV markers |
| 34f6cd3f | 27-Jul-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Hip - add LCOV markers |
| 4e0e931f | 24-Jul-2020 |
Jed Brown <jed@jedbrown.org> |
backends/hip: exclude testing of unimplemented code |
| cfb6011c | 22-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
change *.hip to *.hip.cpp |
| 9b75b0e4 | 21-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
make style |
| 3d05ce95 | 21-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
add hipblas norm to hip-ref |
| f56b74c7 | 21-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
fix spacing |
| 9550c042 | 21-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
Add _Cuda or _Hip to Ceed[Host/Device]SetValue functions |
| cf7477b8 | 16-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
make style |
| 4e905529 | 16-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
update LinearAssemble* names to match PR #586 |
| 17fed040 | 15-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
update with changes from cuda-ref |
| 0df135b4 | 22-Jun-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
comment out the hiprtc-related calls causing memory errors |
| 81a63d6f | 17-Jun-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
fix size of options list for hiprtcCompile |