| 9d77422e | 26-Jun-2019 |
Jed Brown <jed@jedbrown.org> |
Merge branch 'yohann/cuda-non-tensor' [PR #249]
* yohann/cuda-non-tensor: ceed-cuda: resolve -Wsign-compare for CUresult (unsigned enum) in CeedError make style. namespace cuda backends functi
Merge branch 'yohann/cuda-non-tensor' [PR #249]
* yohann/cuda-non-tensor: ceed-cuda: resolve -Wsign-compare for CUresult (unsigned enum) in CeedError make style. namespace cuda backends functions. Minor: styling Add CUDA_LIB_DIR_STUBS for systems that don't have CUDA drivers installed make style Remove useless function declaration. Add a reference non-tensor BasisApply for cuda backends.
show more ...
|
| a7bd39da | 10-Jun-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Fix underinterpolation mode for /cpu/self backends |
| 686de4ac | 27-May-2019 |
Jed Brown <jed@jedbrown.org> |
Makefile: add search for testing
The search argument works with make test, prove, and junit targets.
Remove "eval" code that was breaking parallel testing with -libceed_test. |
| 9fbf56ac | 14-May-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Update t208-f to use offset |
| be9261b7 | 28-Mar-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Add ElemRestrictionApplyBlock |
| 9ad45357 | 16-May-2019 |
Yohann Dudouit <yohann.dudouit@gmail.com> |
Add a reference non-tensor BasisApply for cuda backends. |
| 88760429 | 30-Apr-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Add t110-vec, set vec from another vec |
| c8b9fe72 | 30-Apr-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Add offset parameter in Fortran VecSetArray |
| cbac262c | 27-Mar-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Add juint exception |
| 7568297b | 22-Mar-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Add t109-vec-f |
| 8cb26d4f | 03-Dec-2018 |
Yohann Dudouit <yohann.dudouit@gmail.com> |
Adds a Vector unit test using CEED_MEM_DEVICE. |
| d57f42f9 | 29-Apr-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Restore t308-f |
| 27d44ff5 | 29-Mar-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Modify t001 to remove clang warning |
| c907536f | 27-Mar-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Add CeedGetPreferredMemType |
| bdb0bdbb | 22-Mar-2019 |
Jed Brown <jed@jedbrown.org> |
junit.py: update test logic for skipped and intended-fail tests |
| 73132ccb | 21-Mar-2019 |
Jed Brown <jed@jedbrown.org> |
junit.py: subprocess.run compatibility for python3.5 |
| 9bcbe8bd | 21-Mar-2019 |
Jed Brown <jed@jedbrown.org> |
junit.py: avoid f-strings to support python3.5 |
| 8ec9d54b | 16-Mar-2019 |
Jed Brown <jed@jedbrown.org> |
Add junit target and report test results to Azure |
| 82c7dee4 | 16-Mar-2019 |
Jed Brown <jed@jedbrown.org> |
junit-xml: avoid "six", which may not be installed |
| dfafb49c | 16-Mar-2019 |
Jed Brown <jed@jedbrown.org> |
Merge commit '71c263ca3a3ce57731d1f7a04e93e4c31116f80b' as 'tests/junit-xml' |
| 55ae60f9 | 14-Mar-2019 |
Yohann <yohann.dudouit@gmail.com> |
Simple Cuda backend using one thread per element (#195)
Thanks-to: Jeremy Thompson
* Take into account the compute capability of the GPU
* Add the cuda/reg backend and rename cuda to cuda/ref.
Simple Cuda backend using one thread per element (#195)
Thanks-to: Jeremy Thompson
* Take into account the compute capability of the GPU
* Add the cuda/reg backend and rename cuda to cuda/ref.
- cuda/reg uses a simple approach where each element is
processed by one thread. This approach is expected to be
efficient for 1D and 2D problems, but very ineficient
as soon as the kernels start to spill, which should arise
around Q1D=4 for 3D problems.
* Compilation takes into account the deviceId
* Make style
* Remove dead code in cuda qFunctions.
* Cuda-reg specialized Restriction.
* Split the Prolongation operator into Identity/not Identity.
* Remove "#pragma unroll" until further perf investigation.
* README update
* Add a description of cuda/reg.
* Add CompositeOperator msg to CUDA backends
show more ...
|
| 52d6035f | 13-Mar-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Operator Composition (#197)
* Composite Operator for cpu/self family of backends
* Remove small leak
* Improve C tests
* Add composite operator to Fortran interface and tests
* Fix Fortr
Operator Composition (#197)
* Composite Operator for cpu/self family of backends
* Remove small leak
* Improve C tests
* Add composite operator to Fortran interface and tests
* Fix Fortran test missing destroys
* Fortran test okl files, currently not used
* fix error in composite ' add' flag logic
* Switch composite op tests to f90
* Check for operator type on utility functions
* Documentation and test cleanup
* Make Style
show more ...
|
| b502e64c | 08-Mar-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
CeedBasisApply check (#208)
* Add check to verify input/ouput vector lengths compatibility with basis
* Added test for CeedBasisApply and updated tap.sh to pass failure
* Updated check and tap
CeedBasisApply check (#208)
* Add check to verify input/ouput vector lengths compatibility with basis
* Added test for CeedBasisApply and updated tap.sh to pass failure
* Updated check and tap.sh
show more ...
|
| 4d1cd9fc | 06-Feb-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Add Nek to Travis (#169)
* Add test mode to Nek BP1 and BP3, improve Nek BPs
* Fix OCCA identity rst for multifield, minor NekBP1 fix
* Improve Nek run script
* Add Nek5K to prove-all
*
Add Nek to Travis (#169)
* Add test mode to Nek BP1 and BP3, improve Nek BPs
* Fix OCCA identity rst for multifield, minor NekBP1 fix
* Improve Nek run script
* Add Nek5K to prove-all
* Update travis yml for Nek5K
* Make style
* Adjust Travis yml
* Combine Nek run bash scripts
* Minor Nek script improvements
* Update to Nek 18.0 and reduce number of Nek compiler warnings
* Document required Nek5k version
* Remove stray command
* Remove extra file
* Adapt Nek for CUDA backend
* Fix Nek script string comparison
* Modify Nek script for better exit codes
* typo fix
* Modify the CU function names in nek/bp1.cu and nek/bp3.cu
* .cu file consistency
* Tidy Travis
* Tidy Travis
* Operator fixes
show more ...
|
| 8980d4a7 | 01-Feb-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Switch tests to .f90 extension |