| #
874019bc
|
| 31-Mar-2021 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #716 from CEED/jed/install-backend.h
Jed/install backend.h
|
| #
ec3da8bc
|
| 26-Mar-2021 |
Jed Brown <jed@jedbrown.org> |
Install install backend headers under include/ceed/
This makes it possible to distribute source plugins that provide additional backends. It's also used in MFEM, perhaps temporarily.
Deprecate ceed
Install install backend headers under include/ceed/
This makes it possible to distribute source plugins that provide additional backends. It's also used in MFEM, perhaps temporarily.
Deprecate ceed-backend.h, which was not previously installed, but some users accessed it from an in-place build.
Also install CUDA and HIP headers that allow users to provide CUfunction and hipFunction_t.
Co-authored-by: Jeremy L. Thompson <jeremy.thompson@colorado.edu> Requested-by: Andrew T. Barker <barker29@llnl.gov>
show more ...
|
| #
3d576824
|
| 29-Jan-2021 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
headers - clearify includes to not rely on transitive includes (#701)
* headers - clearify includes to not rely on transitive includes
* style - add header recommendations from 'include-what-you-
headers - clearify includes to not rely on transitive includes (#701)
* headers - clearify includes to not rely on transitive includes
* style - add header recommendations from 'include-what-you-use'
* style - apply 'include-what-you-use' changes to CUDA backends
* style - 'include-what-you-use' for hip backends
* style - drop ceed.h includes in gallery qf source
* docs - add dev notes for header files
* style - header style and alphabetize
show more ...
|
| #
77ad9f29
|
| 15-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
Merge branch 'main' into natalie/hip-ref-v2
|
| #
db3f49c5
|
| 30-Jun-2020 |
Yohann <dudouit1@llnl.gov> |
Merge branch 'master' into yohann/nameKernels
|
| #
4753c0fa
|
| 28-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #570 from CEED/jeremy/debug-mode
Add Debug Mode
|
| #
b21e5731
|
| 26-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
MAGMA - remove backend specific debug mode in favor of libray debug mode
|
| #
e0582403
|
| 15-May-2020 |
abdelfattah83 <36712794+abdelfattah83@users.noreply.github.com> |
Icl/magma queue (#524)
Update the MAGMA backend:
* add new specialized tensor basis kernels
* add batched DGEMM wrapper for non-tensor basis kernels
* switch backend to use MAGMA v2 inte
Icl/magma queue (#524)
Update the MAGMA backend:
* add new specialized tensor basis kernels
* add batched DGEMM wrapper for non-tensor basis kernels
* switch backend to use MAGMA v2 interface
Co-authored-by: nbeams <246972+nbeams@users.noreply.github.com>
Co-authored-by: Stan Tomov <tomov@eecs.utk.edu>
show more ...
|
| #
65275b31
|
| 13-May-2020 |
valeriabarra <valeriabarra21@gmail.com> |
Merge branch 'master' into valeria/NSfixes
|
| #
a8c028e3
|
| 07-May-2020 |
Natalie Beams <246972+nbeams@users.noreply.github.com> |
CEED_STRIDES_BACKEND optimization for cuda-ref operator apply (#528)
* add check for backend stride status for input vectors
* add backend strides check for output vectors
* replace output cop
CEED_STRIDES_BACKEND optimization for cuda-ref operator apply (#528)
* add check for backend stride status for input vectors
* add backend strides check for output vectors
* replace output copy with elem restriction for none emode
* move input skip_restrict check to setup and never allocate E-vec if not needed
* add boolean variable for E/Q vector layout for
further optimization of output and add wrapper function in magma backend
to create a cuda-ref operator and change this state variable
* Add missing CeedChks
* style changes to better match cuda backends
* missed style change for evec check
* add CeedChk from PR #525 (merge conflict)
* make style changes
* adjust size of nqpts for non-tensor basis
show more ...
|
| #
d655899a
|
| 22-Apr-2020 |
Natalie Beams <246972+nbeams@users.noreply.github.com> |
Update names in MAGMA restrictions (#517)
* update comments in restriction cuda file
* renamed indices --> offsets for restrictions
* fix comments for L-Vector size
|
| #
68d8d928
|
| 22-Apr-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #513 from CEED/jeremy/restriction-no-interlace
Convert restriction to offsets
|
| #
8dc8d968
|
| 17-Apr-2020 |
jeremylt <jeremy.thompson@colorado.edu> |
MAGMA - remove use of imode
|
| #
38d0029a
|
| 05-Mar-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #456 from CEED/jeremy/strided-restriction
Strided Restrictions
|
| #
266dd7ab
|
| 07-Feb-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
add strided restriction for magma backend
|
| #
868539c2
|
| 04-Feb-2020 |
Natalie Beams <246972+nbeams@users.noreply.github.com> |
Enable MAGMA non-tensor basis (#424)
* update magma backend files from magma-dev to new branch
* add skeleton for elem restrictions
* start putting code and files for nontensor case
* more
Enable MAGMA non-tensor basis (#424)
* update magma backend files from magma-dev to new branch
* add skeleton for elem restrictions
* start putting code and files for nontensor case
* more framework for adding magma elem restrictions
* typo/old code error fixes for interface and header file, etc
* add the writedofs kernels
* fix nonconstants - template them for now
* fix bugs in non-tensor basis apply (interp and weight)
* update magma backend files from magma-dev to new branch
* add skeleton for elem restrictions
* start putting code and files for nontensor case
* more framework for adding magma elem restrictions
* typo/old code error fixes for interface and header file, etc
* add the writedofs kernels
* fix nonconstants - template them for now
* fix bugs in non-tensor basis apply (interp and weight)
* fix incorrect merge conflict resolution of header file
* fix bugs in lmode=notranspose elem restrictions
and in copying of indices to device
* test simpler kernels for lmode=transpose elem restrict
* swap element and component ordering in tensor basis actions
* update comments in restriction kernels to match swapped ordering
* fix if statement to work with CEED_VECTOR_NONE instead of NULL
* minor code cleanup
* skip t204-7 for magma after changing E-vector layout
* remove commented old calls for now
* use magma set/get vectors instead of cuda memcopies
* remove dead code
* make style changes
Co-authored-by: Stan Tomov <tomov@eecs.utk.edu>
Co-authored-by: abdelfattah83 <36712794+abdelfattah83@users.noreply.github.com>
show more ...
|
| #
7c903a07
|
| 06-Nov-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #405 from CEED/jeremy/astyle
Fix Style Indentation Errors
|
| #
d4f68153
|
| 30-Oct-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
QFunction - use CeedQFunctionUser for CeedQFunctionGetUserFunction
|
| #
ccf0fe6f
|
| 30-Oct-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
make style
|
| #
e6a04bf5
|
| 16-Oct-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #273 from CEED/p-multigrid
P Multigrid Example
|
| #
7f823360
|
| 16-Oct-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Make style
|
| #
fa15ec8f
|
| 02-Oct-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #371 from CEED/jeremy/magma-basis-switch
Minor MAGMA Updates
|
| #
bfd2ac16
|
| 02-Oct-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
MAGMA - drop unused dtensor_contract
|
| #
3513a710
|
| 02-Oct-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
MAGMA - style updates and switch for basis
|
| #
7f5b9731
|
| 02-Oct-2019 |
Stan Tomov <tomov@eecs.utk.edu> |
Magma dev optimizations (#111)
* makefile changes
* update magma backend
* magma qfunctions updated to new interface
* in the magmabackend we manage where pointers are - if on CPU, on some
Magma dev optimizations (#111)
* makefile changes
* update magma backend
* magma qfunctions updated to new interface
* in the magmabackend we manage where pointers are - if on CPU, on some cases we still need and may call the CPU code
* update the reflect changes in the API
* update the reflect changes in the API
* add the q functions for ex1.c
* Switch to CeedIntPow
* Fix merge errors
* Clean up Magma operator loops
* Move zeroing lvec
* fix bug in the rebase and add some qfunctions. This passes the tests now
* adding new files, changing -O to -O3
* new faster way of checking CPU vs. GPU pointers
* core magma device functions for basis apply
* new kernels for basis apply
* use the new magma_isdevptr function
* minor cleanup
* new headers and defs
* calling the new magma functions for basis apply
* undo O3, and change default magma directory
* use static
* use static
* modify the generator to add before __global__
* remove unnecessary header
* silence some warnings
* Makefile: restore NVCC and NVCCFLAGS to match master
* first pass as updating new Magma work, untested
* Use CUDA backend to dispatch
* Device memory for MAGMA
* Add copyright messages and tidy
* WIP: starting fresh on magma-dev-rebae. Add magma_is_devptr
* WIP: starting fresh on magma-dev-rebae. Fix build issue
* WIP: starting fresh on magma-dev-rebae. Fix build issue
* WIP: starting fresh on magma-dev-rebae. Fix build issue
* WIP: starting fresh on magma-dev-rebae. Fix this include
* WIP: starting fresh on magma-dev-rebae. Fix build issue.
* WIP: starting fresh on magma-dev-rebase. Mostly fixing compilation errors
* WIP: starting fresh on magma-dev-rebase. Disbale magma-basis for now
* WIP: starting fresh on magma-dev-rebase. Edit the required magma src files
* move CeedVector_Magma functions from magma-dev branch
* add includes
* fix build errors
* disable magma vector logic for now
* Remove reference in CeedDelegate
* add RestoreArray calls to CeedBasisApply_Magma
* add basis for magma
* magma batched operator
* Change batch calls to match Q-vector ordering
* minor cleanup of unused variable
* update magma portion of Makefile
* remove magma vectors
* remove unused contract variable
* change extern to CEED_INTERN
* merge in updates from master branch
* MAGMA - add lcov markers
* remove stray example script
show more ...
|