| #
8d75ea1b
|
| 18-Apr-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Fix include statements
|
| #
84a01de5
|
| 12-Mar-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Serial and Blocked AVX Backends (#198)
* Add serial AVX backend
* Style and README changes
* Simplify AVX serial tensor loop
* Minor performance improvement
* C=1 AVX scalar case
* In
Serial and Blocked AVX Backends (#198)
* Add serial AVX backend
* Style and README changes
* Simplify AVX serial tensor loop
* Minor performance improvement
* C=1 AVX scalar case
* Increase use of AVX commands for edge cases
* Prep for eventual Tensor Object
* Comment updates
* Readme update
* Update README
* Refactor to reduce code
* Increase vectorization in remainder of columns
* Vectorize column remainder on C=1 case
* Switch to static inlining for AVX tensor contract
* Tidying for merge
* make style
* Style cleanup
* Full register use for columns
* Make style
show more ...
|
| #
0a1d75a0
|
| 06-Feb-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
Merge pull request #206 from CEED/wording
Readability changes
|
| #
856142e1
|
| 06-Feb-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Backend naming adjustment
|
| #
ae228676
|
| 11-Jan-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #182 from CEED/avx
AVX Backend
|
| #
48fffa06
|
| 17-Dec-2018 |
jeremylt <jeremy.thompson@colorado.edu> |
avx vectorized backend
Edge cases for AVX BasisApply
Priority adjustment to match libXSMM branch
Remove scalar/simd mix for Intel
Check for CC AVX support
AVX: proposed doc and makefile detectio
avx vectorized backend
Edge cases for AVX BasisApply
Priority adjustment to match libXSMM branch
Remove scalar/simd mix for Intel
Check for CC AVX support
AVX: proposed doc and makefile detection update
show more ...
|
| #
5c32accb
|
| 18-Dec-2018 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #172 from CEED/setters
Setters
|
| #
fe2413ff
|
| 14-Nov-2018 |
jeremylt <jeremy.thompson@colorado.edu> |
Add setters, remove impl header from CPU, OCCA backends
|
| #
0a5a520a
|
| 06-Nov-2018 |
Jed Brown <jed@jedbrown.org> |
Merge branch 'getters' of github:ceed/libceed [PR #167]
* 'getters' of github:ceed/libceed: Improved documentation Add Operator/QFunction field getters Update documentation Separate to 3 hea
Merge branch 'getters' of github:ceed/libceed [PR #167]
* 'getters' of github:ceed/libceed: Improved documentation Add Operator/QFunction field getters Update documentation Separate to 3 header files First round of getters
[Remove unnecessary ceed-impl.h in merge.]
show more ...
|
| #
d863ab9b
|
| 19-Oct-2018 |
jeremylt <jeremy.thompson@colorado.edu> |
Separate to 3 header files
|
| #
4ce2993f
|
| 17-Oct-2018 |
jeremylt <jeremy.thompson@colorado.edu> |
First round of getters
Use Getters in ref backend
Add Getters to blocked backend
Convert OCCA backend to use Getters
Add getters for backend data
|
| #
9e1c8ed3
|
| 12-Sep-2018 |
Jed Brown <jed@jedbrown.org> |
Merge branch 'remove-extra-ceed-args' [PR #148]
* remove-extra-ceed-args: Restore ceed argument in TensorContractRef/Opt Switch held ref ceed to delegate ceed recursively checked for Refactor
Merge branch 'remove-extra-ceed-args' [PR #148]
* remove-extra-ceed-args: Restore ceed argument in TensorContractRef/Opt Switch held ref ceed to delegate ceed recursively checked for Refactor to standardize backend create functions
show more ...
|
| #
5fe0d4fa
|
| 29-Aug-2018 |
jeremylt <jeremy.thompson@colorado.edu> |
Switch held ref ceed to delegate ceed recursively checked for
|
| #
dba52a49
|
| 04-Sep-2018 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #147 from CEED/opt-to-vec
Rename /cpu/self/opt to /cpu/self/blocked
|
| #
4a2e7687
|
| 04-Sep-2018 |
jeremylt <jeremy.thompson@colorado.edu> |
Rename /cpu/self/opt to /cpu/self/blocked
|