| 0b31f4ea | 17-Sep-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
.mailmap: more updates for Valeria and Yohann |
| 938abcb3 | 17-Sep-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
Update .mailmap (#347) |
| 1a0857f9 | 17-Sep-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #346 from CEED/jeremy/style-revert
MFEM - revert make style error |
| cf1ddfe8 | 17-Sep-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
MFEM - revert make style error |
| ac421f39 | 17-Sep-2019 |
Yohann <dudouit1@llnl.gov> |
Improved performance of cuda-gen backend (#341)
Thanks-to: Tim Warburton
Some of these optimizations are the results of the knowledge and experience gathered by Tim Warburton and his team in libPar
Improved performance of cuda-gen backend (#341)
Thanks-to: Tim Warburton
Some of these optimizations are the results of the knowledge and experience gathered by Tim Warburton and his team in libParanumal and then ported to libCEED.
* Add colocated gradient in 3D.
* Treat the qFunction by slice in 3d to avoid using too many registers.
* Minor fix
* Minor fix.
* Minor fix
* Compute the colocated gradient slice by slice.
* Add synchthreads after initialization of the matrices.
* Remove code print.
* Add a critical #pragma unroll
* Fix typo on "collocated".
* Remove dead code.
* Use ColloGrad3d functions.
* Fix cuda-gen backend when collocated gradient is not available.
* make style
* make style
* Add some comments.
* Replace int by CeedInt.
show more ...
|
| e5261ca0 | 17-Sep-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
Update benchmarks/README.md (#345) |
| 7cf4cb3b | 16-Sep-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #344 from CEED/jeremy/ifort-fix
Fortran - Use preprocessor directive to provide abs path to true qfuntion source
resolves Issue #71 |
| 2d50dd3d | 16-Sep-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Fortran - Use preprocessor directive to provide abs path to true qfunction source |
| 1176cc3a | 14-Sep-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
QFunction - document absolute path requirement (#343) |
| a2fa7910 | 13-Sep-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
Make examples consistent (#342)
* Make examples consistent |
| 52bfb9bb | 13-Sep-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Symmetric Schur and Simultaneous Diagonalization (#299)
This PR adds simultaneous diagonalization and reorders the t3* tests.
* Interface: add Schur Decomp and interface for Simultaneous Diagonal
Symmetric Schur and Simultaneous Diagonalization (#299)
This PR adds simultaneous diagonalization and reorders the t3* tests.
* Interface: add Schur Decomp and interface for Simultaneous Diagonalization, implement Schur Decomp
* Tests: add Symmetric Schur Fortran test
* Interface: implement Simult Diag
* Tighten Givens tolerance in symmetric Schur
* Tests: Reorder t3* tests
* Basis - Check bounds for symmetric Schur
* Simult Diag - Add LAPACK function reference
* Typo fix
* Basis - 'static' CeedMatrixMultiply
* Merge - fix small issues
* Basis - typo fix
* Basis - Documentation fixes
show more ...
|
| 288c0443 | 13-Sep-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
QFunction Create by Name (#311)
This PR adds a QFunction gallery to libCEED with 1D, 2D, and 3D mass and Poisson operators.
Closes issue #37, issue #340
* Add QFunction gallery, rename focca
QFunction Create by Name (#311)
This PR adds a QFunction gallery to libCEED with 1D, 2D, and 3D mass and Poisson operators.
Closes issue #37, issue #340
* Add QFunction gallery, rename focca
* Gallery - add initial QFunctions
* Add a test for using the QF gallery
* Modify ex1 to use gallery
* Add multiple test configs to tap
* Move output to test directory
* Update junit
* Add OCCA galley exception
* Add ex2
* Update ex2 for dim->ncompx
* Gallery - modify to work for CUDA as is
* Update Documentation
* Gallery - typo fix
* Gallery - convention change, postappend qfunction family variant
* Gallery - update template with new name checking convention
* Gallery - condense diff3DBuild QFunction
* Gallery - rename diff -> poisson
* Gallery - clarify poisson3DBuild comment
* Gallery - use Pragma SIMD, store Qdata in Voigt convention
* Examples - Convert BP3-6 to Voigt convention
* Examples - add cl option to switch between header and gallery qfs in CEED examples
* Examples - clean up construction of QF name
* Gallery - Switch to PascalCase for gallery names
* Doc - fix function type page
* Interface - Make sure strncpy result is null terminated
* Gallery - Update Poisson 2/3D Apply to new QF body
* make style
* make style - fix worst style problems
* make style - add gallery to make style
* Doc - update documentation errors and inconsistencies
* Examples - test ex1 ex2 with and without gallary
* Examples - reduce testing of ex1/ex2 without gallery, clean up non-gallery qfunctions
* MFEM - revert another make style mistake
* Manual make style updates
* Doc - update function documentation page
* Style updates, document test numbering conventions
* doc: resolve ambiguous image location warning, allow more Dot nodes
* Tests - style and cast cleanup
* Tests - fix README indentation
show more ...
|
| ee07ded2 | 11-Sep-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
Add CeedPragmaOMP to bps (#338)
* Convert petsc BP3&4 to loops
* Update petsc/bp4.h looping
* Switch to CeedPragmaSIMD and make examples/petsc/bp3.h consistent with bp4.h
Remove CeedPragm
Add CeedPragmaOMP to bps (#338)
* Convert petsc BP3&4 to loops
* Update petsc/bp4.h looping
* Switch to CeedPragmaSIMD and make examples/petsc/bp3.h consistent with bp4.h
Remove CeedPragmaOMP directive in Nek example and update documentation
* Remove restric qualifier in petsc/bp3.h and update documentation
show more ...
|
| c172e0a6 | 11-Sep-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
Fix postprocessing script (#339)
* Fix postprocessing scripts after PR #314 |
| 4f043703 | 05-Sep-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #336 from CEED/jeremy/cuda-mem-enum
CUDA - Rename SyncState -> CeedCudaSyncState |
| dc729fe2 | 05-Sep-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
CUDA - Rename SyncState -> CeedCudaSyncState |
| 34d77899 | 04-Sep-2019 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
Homogenize examples by using ncomp and dim (#335)
* Homogenize examples by using ncomp and dim |
| 2ec799e5 | 04-Sep-2019 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #333 from CEED/jed/ns-periodic
Navier-Stokes: add support for periodic boundary conditions |
| 52cf619c | 03-Sep-2019 |
Jed Brown <jed@jedbrown.org> |
navier-stokes: remove unused variables |
| 794da26a | 28-Aug-2019 |
Jed Brown <jed@jedbrown.org> |
navier-stokes: add -process_grid and make resolution mean something
Formerly, the process grid was always automatically determined by the number of processes. Now, use that as default, but allow it
navier-stokes: add -process_grid and make resolution mean something
Formerly, the process grid was always automatically determined by the number of processes. Now, use that as default, but allow it to be fully specified. For example, one can now run a quasi-2D simulation by adding periodicity in z. For example,
mpiexec -n 32 ./navierstokes -process_grid 8,4,1 -resx 100 -resy 100 -resz 100 -lz 100 -periodic 0,0,1
show more ...
|
| 83bd60e3 | 28-Aug-2019 |
Jed Brown <jed@jedbrown.org> |
navier-stokes: add support for periodic boundary conditions
Change mapping to DM Vec from global-to-global to local-to-global to support periodicity without using DMDA periodic (for which coordinate
navier-stokes: add support for periodic boundary conditions
Change mapping to DM Vec from global-to-global to local-to-global to support periodicity without using DMDA periodic (for which coordinates are ill-defined relative to what VTK expects).
Coordinates are now written with physical, rather than nondimensional, units.
show more ...
|
| 1341e1bf | 04-Sep-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #334 from CEED/jeremy/travis-debug
Work around LIBXSMM bug |
| 7c9c4dbf | 03-Sep-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Work around LIBXSMM bug; reset LIBXSMM to 07e360a |
| c6c5551b | 03-Sep-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #332 from CEED/valeria/NS-update-documentation
Update NS example documentation |
| 217aa3ef | 03-Sep-2019 |
valeria <valeriabarra21@gmail.com> |
Update documentation after PR #323 |