| 60f9e2d6 | 25-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Ceed - add debugging mode for backends, lifted to library level from OCCA backend |
| 9525855c | 17-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Ceed - add IsDeterministic |
| 9e9210b8 | 17-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Op - add AssembleAdd version of diagonal assembly functions, will be helpful for MFEM integration |
| 2bba3ffa | 17-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Op - change LinearAssemble* to accept CeedVector instead of pointer to CeedVector, allows for less memory movement and interfaces with parent code better |
| 80ac2e43 | 15-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
style - rename AssembleLinear* to LinearAssemble* |
| fd364f38 | 15-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
style - change Get*Status to Is* |
| c04a41a7 | 15-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Composite Operator support for AssembleLinearDiag/PBDiag (#552)
* ref - add composite operator support for building diagonal/pb diagonal
* ref - add error for non-composite mult-field operator di
Composite Operator support for AssembleLinearDiag/PBDiag (#552)
* ref - add composite operator support for building diagonal/pb diagonal
* ref - add error for non-composite mult-field operator diagonal/pb diagonal assembly
* tap - add t538 exclusion because OCCA does not support galleries
* tests - adjust test cases for ceed examples for test coverage
* Op - fix documentation
show more ...
|
| 49fd234c | 12-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Convert CUDA ref/reg/shared E-Layout (#554)
* tests - update tests for multiple e-layouts
* CUDA - convert ref and reg restrictions to Q-layout
* CUDA - ref/reg/shared use gen/magma E-Layout f
Convert CUDA ref/reg/shared E-Layout (#554)
* tests - update tests for multiple e-layouts
* CUDA - convert ref and reg restrictions to Q-layout
* CUDA - ref/reg/shared use gen/magma E-Layout for multi elememnt basis apply and operator apply
* CUDA/MAGMA - drop eandqdiffer and separate MAGMA operator code
* CUDA - update operator comment
* reg - clarify read/write dofs/quads
* CUDA - drop dead code
show more ...
|
| d965c7a7 | 06-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
CPU Impl of AssemblePointBlockDiagonal (#503)
* Operator - add AssemblePointBlockDiagonal for CPU backends, with test
* CUDA - add point block diagonal not supported message
* make style
*
CPU Impl of AssemblePointBlockDiagonal (#503)
* Operator - add AssemblePointBlockDiagonal for CPU backends, with test
* CUDA - add point block diagonal not supported message
* make style
* Operator - improve point block description
* OCCA - explicitly remove OCCA fallback to CPU assembly functions, will update after new OCCA backend
* Op - remove gap removal in point block diagonal
* Op - update diagonal assembly documentation
* Update backends/ref/ceed-ref-operator.c
Co-authored-by: Jed Brown <jed@jedbrown.org>
* style - fix extra space in * with nopad
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| b349fba6 | 14-May-2020 |
Jed Brown <jed@jedbrown.org> |
fortran: use standard z prefix instead of deprecated x prefix for hexadecimal |
| 20aaa365 | 08-May-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
XSMM - fix q=1 index computation (#531)
* XSMM - use hash table for kernel index lookup, add khash to libCEED
* Hash - add CeedHashIJKLM to simplify xsmm tensor hash table code
* simplify use
XSMM - fix q=1 index computation (#531)
* XSMM - use hash table for kernel index lookup, add khash to libCEED
* Hash - add CeedHashIJKLM to simplify xsmm tensor hash table code
* simplify use of hash table, use kernels as values
* XSMM: more idiomatic use of khash
* make style
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| fc5b9ac6 | 01-May-2020 |
Jed Brown <jed@jedbrown.org> |
doc: remove obsolete references to CeedInterlaceMode |
| 3ac43b2c | 24-Apr-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
ElemRestriction - add check for strided status |
| 430758c8 | 24-Apr-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
ElemRestriction - add state counter for reading offsets array |
| bd33150a | 24-Apr-2020 |
jeremylt <jeremy.thompson@colorado.edu> |
ElemRestriction - add ElemRestrictionGetOffsets |
| 68d8d928 | 22-Apr-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #513 from CEED/jeremy/restriction-no-interlace
Convert restriction to offsets |
| 6c58de82 | 17-Apr-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Basis - better protect basis data from user meddling
Co-Authored-By: Jed Brown <jed@jedbrown.org> |
| 9d007619 | 17-Apr-2020 |
jeremylt <jeremy.thompson@colorado.edu> |
Basis - move some functions to public API |
| d979a051 | 14-Apr-2020 |
jeremylt <jeremy.thompson@colorado.edu> |
Restriction - convert restiction to use offsets instead of interlace mode. CPU and Python impl only, CUDA and MAGMA left to do. |
| 245d2530 | 26-Mar-2020 |
Jed Brown <jed@jedbrown.org> |
pragma vector for Intel; pragma simd is deprecated
Pragma simd is documented without deprecation notice, but the icc-19.05.281 warns of its deprecation. https://software.intel.com/en-us/cpp-compiler
pragma vector for Intel; pragma simd is deprecated
Pragma simd is documented without deprecation notice, but the icc-19.05.281 warns of its deprecation. https://software.intel.com/en-us/cpp-compiler-developer-guide-and-reference-simdvector
One would think Intel's pragma ivdep would be similar to GCC ivdep, but icc evidently has different semantics/miscompiles, as demonstrated in the example, which is left in the source to protect an enterprising future committer from adding ivdep.
show more ...
|
| 0a0da059 | 20-Mar-2020 |
jeremylt <jeremy.thompson@colorado.edu> |
Ceed - add CeedView() |
| 1da99368 | 17-Mar-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Add VLA support for CUDA backends (#481)
* CUDA - add VLA support for CUDA backends
* Navier-Stokes - guard #include math.h for CUDA
* Navier-Stokes - fix struct definitions for code generatio
Add VLA support for CUDA backends (#481)
* CUDA - add VLA support for CUDA backends
* Navier-Stokes - guard #include math.h for CUDA
* Navier-Stokes - fix struct definitions for code generation
* Navier-Stokes - fix file location of mass QFunction
* Navier-Stokes - minor style consistency
* Navier-Stokes - more context sturct fixing
* Navier-Stokes - cast context to stop CUDA errors
* Navier-Stokes - change helper functions to inline to silence CUDA errors
* CUDA - clean up CEED_Q_VLA definition for cuda/gen
* Navier-Stokes - populate TODOs for undocumenced functions
* Doc - document libCEED macros
* CUDA - use -default-device for NVRTC
* Tests - add t507 to test VLA support
* CUDA - fix my bug in dim, Q1d for cuda/gen
* Tests - update tap/junit to reflect current CUDA capabilities
* make style
* CUDA - check output bases for dim, Q1d as well (this fixes multigrid example support)
* CUDA - fix readSliceQuads, not all arguments were defined
* Navier-Stokes - document all functions in Navier-Stokes QFunction files
* make style
show more ...
|
| cec346d6 | 10-Mar-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
change restrict to __restrict__ for t |
| d2b960ed | 10-Mar-2020 |
jeremylt <jeremy.thompson@colorado.edu> |
Interface - use __restrict__ instead of restrict for C++ compatability |
| 4c4400c7 | 08-Mar-2020 |
Valeria Barra <valeriabarra21@gmail.com> |
doc: convert to Sphinx documentation at libceed.readthedocs.io
Some technical notes about the expanded docs/process:
* Doxygen is now restricted to API documentation. Sphinx docs are incomplete
doc: convert to Sphinx documentation at libceed.readthedocs.io
Some technical notes about the expanded docs/process:
* Doxygen is now restricted to API documentation. Sphinx docs are incomplete surrogates due to not hyperlinking code listings, though we hope the Doxygen site can be removed eventually.
* PDF output uses XeTeX (unicode and OTF fonts) with TeX Gyre Pagella and DejaVu Sans Mono. You may need to install these fonts in order to build the PDF users manual.
* Obtaining the author list from Git has not worked on ReadTheDocs (`git shortlog` is empty even though `git log` is not), so we automatically generate it on developer machines with `make doc`. If the author information is incorrect/mangled, edit .mailmap to fix the mapping.
* Math semantics are based on the description here https://github.com/sphinx-doc/sphinx/issues/3785
Sphinx chooses one of two setups for the maths.
* If there is a single equation, then we wrap the maths in \begin{equation*} (or equation).
* If there are multiple equations (as determined by the presence of an empty line), then we wrap the maths first in \begin{aligned} and then in \begin{align*} (or align).
Co-authored-by: Jeremy L. Thompson <jeremy.thompson@colorado.edu> Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|