History log of /libCEED/README.md (Results 76 – 100 of 158)
Revision Date Author Comments
# 5462ed45 17-Sep-2019 jeremylt <jeremy.thompson@colorado.edu>

Makefile - Detect MKLROOT for LIBXSMM linking


# 4d537eea 02-Sep-2019 Yohann <dudouit1@llnl.gov>

Single Source QFunction (#304)

Introduce a new macro CEED_QFUNCTION that allows to define qFunctions in a single source code independently of the targeted backend.

Thanks-to: Jeremy Thompson
Tha

Single Source QFunction (#304)

Introduce a new macro CEED_QFUNCTION that allows to define qFunctions in a single source code independently of the targeted backend.

Thanks-to: Jeremy Thompson
Thanks-to: Jed Brown
This work is the result of a fruitful discussion between Jed Brown, Jeremy Thompson and Yohann Dudouit. Jeremy Thompson also implemented important features in this commit and was very active and helpful all along the progress of this work.

[NEWS] Breaking change: QFunctionField parameter 'ncomp' changed to 'size'. This change requires setting the previous value of 'ncomp' to 'ncomp*dim' when adding a QFunctionField with eval mode 'CEED_EVAL_GRAD'.

* First steps toward cuda-gen backend!

* Closer to real code generation.

* Generated code should be ready for nvrtc.

* The code generation skeleton is ready.

* Hack with the qfunction to make the operator kernel compile.

* Some tweaks in the makefile + Input fields structure change.

* Remove using cout.

* 1d interp and grad device functions.

* 1d readDofs, readQuads, writeDofs, writeQuads.

* Remove dead code.

* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d

* 2d interp and grad

* 3d interp and grad

* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel

* Add the qFunction reader + inlining

* Add qf files for the tests.

* Add qf file for ceed/ex1

* Add qf file for mfem/bp1

* All tests pass.

* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.

* mfem/bp1 passes + remove dead code

* Fix a bug in n_quads_out for writeQuads

* mfem/bp3 passes.

* All tests all examples pass.

* Temporary tweaks for mfem benchmarking

* Add Context management.

* Modify .qf files to take into account the context.

* Enable optimizations.

* First set of optimization for 2D and 3D.

* double pointer format for the qFunction.

* Change the .qf files to have the same code as the C functions.

* Make previous Cuda backends use .qf files.

* Add a return value to qFunctions.

* Make cpu backends use .qf files.

* Minor: clean commented code.

* Add guarded math.h for petsc examples.

* Remove previous nek qf files.

* Remove .cu files.

* Remove .qf files.

* Remove dead code in the tests.

* make style

* Make style fix.

* more make style fixes.

* CEED_QFUNCTION - improve macro for CPU filenames

* Add CEED_QFUNCTION macro to navierstokes.c

* Fix PETSc gitignore

* Change default NS problemtype to density_current (#307) in navierstokes.c

* Fix petsc bp1.h

* Real Fix for petsc bp1.h...

* fix

* README - Add /gpu/cuda/gen

* PETSc - Update dmplex example to use *_loc

* cuda/reg - fix typo

* Revert a couple of small changes

* Fix a bug in mfem bp3 similar to the previous bug in petsc bp3.

* Make PETSc qfunctions look closer to master, and minor style for debugging.

* More uniformity changes

* Fix a strange CUDA_OUT_OF_RESSOURCE bug.

* NS - fix fname variables

* Use a different convention for qFunction ncomp.

* update cuda-gen backend and bpsdmplex.

* PETSc - style update

* update mfem bp1 and bp3.

* Interface - Use size instead of ncomp for QFunction fields

* update ceed example and tests.

* Tests - Update ncomp to size

* CPU Backends - Update ncomp to size

* CPU Backends - style

* Nek - Update ncomp to size

* Opt - fix style

* CUDA - update ncomp to size

* Doc - Update API documentation for QFunction \ncomp->size

* OCCA - Patch QFunction ncomp -> size, work but revamp will be better

* OCCA - assert dim>0 for clang-tidy

* CUDA - Change GetNumComp to GetSize

* Basis - Shift check for dim > 0 to interface

* Doc update

* Update NS field size

* NS - Fix problem options

show more ...


# 6f67eec9 23-Jul-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #300 from CEED/jeremy/doc-fix

README: Update corectness for examples


# 01a7252b 22-Jul-2019 jeremylt <jeremy.thompson@colorado.edu>

README: Update corectness for examples


# 1226057f 27-Jun-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Merge branch 'master' into yohann/cuda-restr-opt

Conflicts:
backends/cuda-reg/ceed-cuda-reg-restriction.c
backends/cuda-shared/ceed-cuda-shared-basis.c


# 650cb9e7 17-Jun-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #278 from CEED/petsc-bp-docfix

PETSc BP Docfix


# 59d7a4ce 17-Jun-2019 jeremylt <jeremy.thompson@colorado.edu>

Update Benchmark documentation


# 55542c59 17-Jun-2019 jeremylt <jeremy.thompson@colorado.edu>

Update README for PETSc BPs


# 1856ee7c 29-May-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #239 from CEED/decorator

Add delegates for specific objects


# 68ba6316 28-May-2019 jeremylt <jeremy.thompson@colorado.edu>

README: Drop tmpl backend from list


# 7cfafa9d 24-May-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #258 from CEED/valgrind-fix

Check for devel Valgrind


# bdc3149d 24-May-2019 jeremylt <jeremy.thompson@colorado.edu>

Check for devel Valgrind


# 854b0396 24-May-2019 Yohann <dudouit1@llnl.gov>

Add description of cuda-shared backend in README.md


# 8ffbcc63 24-May-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Revert "Brief description of the cuda-shared backend in README.md" (#256)


# db777ff9 24-May-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #255 from CEED/yohann/cuda-shared-readme

Brief description of the cuda-shared backend in README.md


# a17ac2e9 24-May-2019 Yohann Dudouit <yohann.dudouit@gmail.com>

Brief description of the cuda-shared backend in README.md


# d4fd2798 18-May-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #228 from CEED/rstr-block

Restriction Block


# 89c6efa4 03-May-2019 jeremylt <jeremy.thompson@colorado.edu>

Use blocking in optimized serial backends


# 045b9c47 29-Mar-2019 jeremylt <jeremy.thompson@colorado.edu>

Include full evec blocked backend


# 486febe6 15-May-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #224 from CEED/mem-zero

Check QFunction Output Vecs


# fc7cf9a0 18-Apr-2019 jeremylt <jeremy.thompson@colorado.edu>

Set QFunction outputs undefined before apply in new memcheck backend


# b8c86f25 28-Mar-2019 Valeria Barra <39932030+valeriabarra@users.noreply.github.com>

Ns fix alignment (#226)

* Fix alignment

* remove #include <ceed-impl.h> from tests/t306-basis.c

* Update examples/pets/Makefile


# e17541c0 28-Mar-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #222 from CEED/blas-vs-mkl

Add MKL optional flag


# 724a7164 27-Mar-2019 jeremylt <jeremy.thompson@colorado.edu>

Add MKL optional flag


# 330b45c2 24-Mar-2019 Valeria Barra <39932030+valeriabarra@users.noreply.github.com>

Merge pull request #220 from CEED/README-typo-fix

README typo fix


1234567