History log of /libCEED/ (Results 4576 – 4600 of 5646)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
241a4b8325-Jul-2019 Yohann <yohann.dudouit@gmail.com>

Full jit compiled operator: cuda-gen backend (#275)

* First steps toward cuda-gen backend!

* Closer to real code generation.

* Generated code should be ready for nvrtc.

* The code generatio

Full jit compiled operator: cuda-gen backend (#275)

* First steps toward cuda-gen backend!

* Closer to real code generation.

* Generated code should be ready for nvrtc.

* The code generation skeleton is ready.

* Hack with the qfunction to make the operator kernel compile.

* Some tweaks in the makefile + Input fields structure change.

* Remove using cout.

* 1d interp and grad device functions.

* 1d readDofs, readQuads, writeDofs, writeQuads.

* Remove dead code.

* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d

* 2d interp and grad

* 3d interp and grad

* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel

* Add the qFunction reader + inlining

* Add qf files for the tests.

* Add qf file for ceed/ex1

* Add qf file for mfem/bp1

* All tests pass.

* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.

* mfem/bp1 passes + remove dead code

* Fix a bug in n_quads_out for writeQuads

* mfem/bp3 passes.

* All tests all examples pass.

* Temporary tweaks for mfem benchmarking

* Add Context management.

* Modify .qf files to take into account the context.

* Enable optimizations.

* First set of optimization for 2D and 3D.

* Makefile tweaks and destructor code.

* make style.

* Add -MP flag.

* Fix linking issues with the tests.

* Update .qf files for the tests.

* Add .qf files for nek5000 examples.

* Use shared memory for B and G matrices.

* Fix bug introduced in previous commit.

show more ...

6f67eec923-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #300 from CEED/jeremy/doc-fix

README: Update corectness for examples

01a7252b22-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

README: Update corectness for examples

86a4271f22-Jul-2019 Thilina Rathnayake <thilinarmtb@users.noreply.github.com>

Update nek5000 example scripts (#241)

* Update nek5000 examples to use NEKTOOLS_DIR

* Remove +x

* Nek: Fix a bug in generate-boxes.sh script

* Use one script for nek exaamples

* Update m

Update nek5000 example scripts (#241)

* Update nek5000 examples to use NEKTOOLS_DIR

* Remove +x

* Nek: Fix a bug in generate-boxes.sh script

* Use one script for nek exaamples

* Update make-nek-tests.sh

* Update nek-examples.sh

* Change permissions of make-nek-tests.sh

* Update examples Makefile to use new nek script

* Update nek5000 examples build process

* Trying to make travis happy

* Polishing the script

* This should make Travis happy

* Polish nek-examples.sh

* Update examples/Makefile

* Update top-level makefile

* Delete make-nek-tests.sh & update nek-examples.sh

* Update Makefile and examples/Makefile

* Update nek-examples.sh

* Update TESTARGS in bp .usr files

* Update Makefile

* Update .travis.yml

* Update nek-examples.sh

* Add CEED_DIR to tap.sh

* Uncomment OCCA_DIR in Makefile

* Fix make prove and update tap.sh

* Pass NEK5K_DIR and CEED_DIR in prove-all

* Set MPI=0 default for Nek examples/tests

* Nek5K make updates

* Make prove update

* Update make-prove to make all ex except Nek5K in parallel

* Set Nek5K examples to use MPI by default

* Makefile: Improved Nek5k handling

* Makefile: build Nek bps in separate directories

* Nek: rename dir for f77 file length

* debugging

* Nek: use single source

* Makefile: fix Nek mpif.h issue

* Nek: Clean up build proces, improve documentation

* Makefile: enable building single Nek example in parallel with other examples

* Nek: Fix CC/FC and TESTARGS

show more ...

bd2096da18-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #298 from CEED/jeremy/ref-priority-fix

Fix ref backend priorities

3a6bb9fc18-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #297 from CEED/jeremy/209-f-fix

Fix typo in t209-f

706bc5e618-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

backends: fix ref backend priorities

deafd6bc18-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

tests: fix typo in t209-f

01b3627d15-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #296 from CEED/jed/fix-error-reduction

examples/petsc: fix max reduction type and address

00742fa814-Jul-2019 Jed Brown <jed@jedbrown.org>

examples/petsc: fix max reduction type and address

e24ee97614-Jul-2019 Jed Brown <jed@jedbrown.org>

Merge branch 'valeria/fix-ns-derivatives' [PR #292]

* valeria/fix-ns-derivatives:
examples/navier-stokes: formatting cleanup
navier-stokes: use du[3][3] to clean up indexing
Fix derivatives or

Merge branch 'valeria/fix-ns-derivatives' [PR #292]

* valeria/fix-ns-derivatives:
examples/navier-stokes: formatting cleanup
navier-stokes: use du[3][3] to clean up indexing
Fix derivatives order in du

show more ...

6278642311-Jul-2019 Valeria Barra <valeria.barra@colorado.edu>

examples/navier-stokes: formatting cleanup

[Jed] Commit revised to leave derivatives intact.

04ad072912-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #295 from CEED/cpu_priority_fix

BugFix - Resource strcmp in LIBXSMM backends

6f7d248d12-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

Update CPU backends to give default for /cpu/self/***

e0fc044712-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

Fix resource strcmp in xsmm backends

56cf2fbb10-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #291 from CEED/rstr-mult

Add ElemRestrGetMultiplicity and Tests

1469ee4d10-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

Add ElemRestrGetMult and tests

f405f80604-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #289 from CEED/cuda-occa-copy-vals

Update CUDA/OCCA CEED_COPY_VALUES logic

ea03cb9503-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

Update CUDA/OCCA CEED_COPY_VALUES logic

1f6d0b3602-Jul-2019 Jed Brown <jed@jedbrown.org>

Merge branch 'yohann/cuda-restr-opt' [PR #269]

* yohann/cuda-restr-opt:
Remove dead or unnecessary code.
Remove dead code. Cuda-reg restriction optimization.
Optimization of 3D kernels for cud

Merge branch 'yohann/cuda-restr-opt' [PR #269]

* yohann/cuda-restr-opt:
Remove dead or unnecessary code.
Remove dead code. Cuda-reg restriction optimization.
Optimization of 3D kernels for cuda-shared backend.
Optimization of 1D kernels for cuda-shared backend.
Optimization of 2D kernels for cuda-shared backend.
Minor bug fix
Optimization of weight kernel and dynamic allocation of shared memory.
Optimization of cuda-reg restriction.
Start the optimization of the Cuda restriction operator.

show more ...

f51b1ba228-Jun-2019 Jed Brown <jed@jedbrown.org>

navier-stokes: use du[3][3] to clean up indexing

Reported-by: Leila Ghaffari <Leila.Ghaffari@colorado.edu>

0f9724d527-Jun-2019 Valeria Barra <valeria.barra@colorado.edu>

Fix derivatives order in du

1226057f27-Jun-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Merge branch 'master' into yohann/cuda-restr-opt

Conflicts:
backends/cuda-reg/ceed-cuda-reg-restriction.c
backends/cuda-shared/ceed-cuda-shared-basis.c

ece74ea827-Jun-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #284 from CEED/jed/blas-lib

Makefile: MKL link flags for non-Intel compilers

c0ea104826-Jun-2019 Jed Brown <jed@jedbrown.org>

Makefile: MKL link flags for non-Intel compilers

One can always run

make BLAS_LIB=-mkl

when using the Intel compiler, but it's typically important for us to
link the sequential version and we'd

Makefile: MKL link flags for non-Intel compilers

One can always run

make BLAS_LIB=-mkl

when using the Intel compiler, but it's typically important for us to
link the sequential version and we'd like a system that works with
non-Intel compilers, where the incantation is so hard to remember that
Intel made a web app:

https://software.intel.com/en-us/articles/intel-mkl-link-line-advisor/

show more ...

1...<<181182183184185186187188189190>>...226