History log of /libCEED/rust/libceed-sys/c-src/backends/cuda-gen/ceed-cuda-gen.c (Results 51 – 69 of 69)
Revision Date Author Comments
# db3f49c5 30-Jun-2020 Yohann <dudouit1@llnl.gov>

Merge branch 'master' into yohann/nameKernels


# a970f638 30-Jun-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #566 from CEED/jeremy/solid-cuda

Solids - enable use of GPU vectors


# af17f337 29-Jun-2020 Jeremy L Thompson <thompson.jeremy.luke@gmail.com>

CUDA - reorder backend priority, lower is better


# 3f1466f8 26-Jun-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #561 from CEED/jeremy/is-deterministic

Ceed - add IsDeterministic


# 52d8ac88 25-Jun-2020 Jeremy L Thompson <thompson.jeremy.luke@gmail.com>

CUDA - add missing codecov exceptions


# 4d36c801 24-Jun-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #568 from CEED/jeremy/cuda-destroy

Small CUDA Tidying


# 7df94212 23-Jun-2020 Jeremy L Thompson <thompson.jeremy.luke@gmail.com>

CUDA - clean up includes


# 73b3ccaf 23-Jun-2020 Jeremy L Thompson <thompson.jeremy.luke@gmail.com>

CUDA - clean up minor memory leak


# b1d74153 12-Jun-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

CUDA/MAGMA - add AssembleLinearQFunction (#553)

* CUDA - refactor operator apply for CUDA assemble linear QF impl

* CUDA/MAGMA - add AssembleLinearQFunction

* make style

* CUDA - clean up a

CUDA/MAGMA - add AssembleLinearQFunction (#553)

* CUDA - refactor operator apply for CUDA assemble linear QF impl

* CUDA/MAGMA - add AssembleLinearQFunction

* make style

* CUDA - clean up assembleLinearQF after q/e layout refactor

* CUDA - fallback operator for cuda/gen to cuda/ref

* CUDA - use delegation for cuda/gen prefered memtype

show more ...


# 65275b31 13-May-2020 valeriabarra <valeriabarra21@gmail.com>

Merge branch 'master' into valeria/NSfixes


# ab213215 23-Apr-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

More comments in CUDA backends (#518)

* CUDA - adding comments as I work to understand these backends

* PETSc - remove extra include, breaks single source

* make style


# b250684e 26-Mar-2020 Valeria Barra <39932030+valeriabarra@users.noreply.github.com>

Merge branch 'master' into valeria/jupyter-notebook-tutorials


# ccaff030 26-Mar-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Solid Mechanics Example (#474)

libCEED + PETSc Solid Mechanics example

Co-authored-by: Jeremy L Thompson <jeremy.thompson@colorado.edu>
Co-authored-by: Arash Mehraban <arashm81@gmail.com>
Co-au

Solid Mechanics Example (#474)

libCEED + PETSc Solid Mechanics example

Co-authored-by: Jeremy L Thompson <jeremy.thompson@colorado.edu>
Co-authored-by: Arash Mehraban <arashm81@gmail.com>
Co-authored-by: Jed Brown <jed@jedbrown.org>
Co-authored-by: Valeria Barra <valeriabarra21@gmail.com>


* Solids - initial commit of elasticity example

Co-authored-by: Jeremy L Thompson <jeremy.thompson@colorado.edu>
Co-authored-by: Arash Mehraban <arashm81@gmail.com>

* Solids - add VLA macro

* Solids - fix test for mesh file output string

* Solids - update documentation

Co-authored-by: Jeremy L Thompson <jeremy.thompson@colorado.edu>
Co-authored-by: Arash Mehraban <arashm81@gmail.com>

* Solids - update README

* Solids - update ctx struct and casting for CUDA

* Tests - clean up test conditions for GPU backends

* GPU - fix falbacks identified

* CUDA - whitelist FDM Inverse

* make style

* doc: solids editing/correctness

* Solids - separate FE boilerplate into its own function

* Solids - clean up double includes

* doc: add more complete explanation of hyperelasticity and formulation

* Solids - Use ISGetSize for global number of face sets (had no effect on PETSc bug in MR 2615)

* edited docs for hyperFS

* Improve doc of solids example with some constitent notation and fonts

* HyperSS docs modified

* edited typo

* linearization added

* fixed a typo

* Improved doc

* doc: more writing on hyperelasticity and implementation tradeoffs

* Solids - fix typo

* Solids - compute and store C, Cinv, and S as symmetric

* Solids - use new Newton linearization, but its not converging

* Solids - update BC handling

* Solids - update README with new BC options

* removed old notation

* Solids - whitespace changes only in documentation

* doc: adjust differential notation and derive dP(dF)

* Solids - small documentation flow edits

* Solids - simplify setting BCs

* Solids - Symmetritize e in SS

* Solids - set more reasonable default number of steps for nonlinear

* Solids - fix documentation typo

* Solids - clean up computation of deltaS

* Solids - fix sample runs

* Solids - fix hyperSS jacobian

* Solids - fix hyperSS documentation

* Solids - clean up code for hyperSS

* Solids - add file and function docstrings

* doc: commutative diagram of linearization and stable stress expression

* Solids - update hyperSS documentation with linearization

* doc: further tips on stable computation

* Solids - fix typo with trace in hyperSS

* Solids - intro and hyperSS documentation updates

* Solids - update hyperFS for numerical stability with small E and nearly incompressible

* examples/solids: simpler C = I + 2E

* Solids - more compact/readable E computation

* Solids - minor style consistency change

* Solids - replace log1p with series approximation, log1p is not vectorized in libc

* Solids - refactor common components between hyperFSF and hyperFSdF

* examples/solids: use PETSc dependency flags for incremental rebuild

* examples/solids: extend range of log1p_series via shift/scale

* Solids - add bc_clamp_max option to specify maximum displacement for clamped boundary

* make style

* Solids - update with sample run informantion and spellchecking

* Doc - typo fixes

* Solids - homogonize presentation of command line options

* examples/solids: fix -Wmisleading-indentation

* examples/fluids: PETSc 3.13 compatibility

* examples/solids: use log1p for numerical stability in hyperSS

* doc: copy editing in examples/solids

* Solids - update documentation on running the code

* Solids - convert README to rst and include in documentation

* Solids - clean dependency file

* Solids - use log1p approximation in hyperSS as well

* Solids - update README to explain mms

* Solids - switch range on hyperSS log1p series approximation

* Solids - update README to clarify mandatory options

* Solids - add link to sample meshes

* Solids - simplify listing of mesh options

* Travis - use newer PETSc

* Solids - update sample run to list [.exo file] instead of name of sample mesh

* Solids - update make clean to remove src/*.d

* examples/solids: makefile cleanup

* make style

* doc examples/solids: clean up and explain nondimensionalization

* examples/solids: document algebraic solver

* examples/solids: use CG for Chebyshev eig estimation

* doc: fix references to examples/fluids

* Solids - remove mesh folder

* Solids - drop expicit ExodusII requirement, PETSc supports many types of meshes

* doc: mention PETSc --download-exodusii for Exodus

* doc: fix latexpdf

* Fix \longrightarrow rendering in PDF

* Solids - fix toctree link to README for pdflatex

* Examples and BPs Docs: Homogenize math with dS and dV and other tiny esthetics

* Solids - fix unshifted series for hyperSS log1p

* Improve doc of bpssphere: Add differential for consistency with all other examples and explain domain for clarity

* Solids - include sample mesh image

* Move mesh figure after note and slightly rename inclusion marker

Co-authored-by: Arash Mehraban <arashm81@gmail.com>
Co-authored-by: Jed Brown <jed@jedbrown.org>
Co-authored-by: valeriabarra <valeriabarra21@gmail.com>

show more ...


# 621cd461 16-Mar-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #421 from SanderA/sanderarens/fix_ceed_cuda_subclasses

Add Ceed_Cuda struct to Ceed_Cuda_ref/shared/gen.


# c00ee0d7 23-Nov-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #413 from CEED/jeremy/op-add

Add CeedOperatorApplyAdd


# 5afe0718 23-Nov-2019 jeremylt <jeremy.thompson@colorado.edu>

CUDA - fix up composite operator delegation


# abfaacbb 17-Nov-2019 Sander Arens <sanderarens@gmail.com>

Add Ceed_Cuda struct to Ceed_Cuda_ref/shared/gen.

Now Ceed_Cuda_ref/shared/gen act like subclasses and can be properly cast to Ceed_Cuda.


# 288c0443 13-Sep-2019 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

QFunction Create by Name (#311)

This PR adds a QFunction gallery to libCEED with 1D, 2D, and 3D mass and Poisson operators.

Closes issue #37, issue #340

* Add QFunction gallery, rename focca

QFunction Create by Name (#311)

This PR adds a QFunction gallery to libCEED with 1D, 2D, and 3D mass and Poisson operators.

Closes issue #37, issue #340

* Add QFunction gallery, rename focca

* Gallery - add initial QFunctions

* Add a test for using the QF gallery

* Modify ex1 to use gallery

* Add multiple test configs to tap

* Move output to test directory

* Update junit

* Add OCCA galley exception

* Add ex2

* Update ex2 for dim->ncompx

* Gallery - modify to work for CUDA as is

* Update Documentation

* Gallery - typo fix

* Gallery - convention change, postappend qfunction family variant

* Gallery - update template with new name checking convention

* Gallery - condense diff3DBuild QFunction

* Gallery - rename diff -> poisson

* Gallery - clarify poisson3DBuild comment

* Gallery - use Pragma SIMD, store Qdata in Voigt convention

* Examples - Convert BP3-6 to Voigt convention

* Examples - add cl option to switch between header and gallery qfs in CEED examples

* Examples - clean up construction of QF name

* Gallery - Switch to PascalCase for gallery names

* Doc - fix function type page

* Interface - Make sure strncpy result is null terminated

* Gallery - Update Poisson 2/3D Apply to new QF body

* make style

* make style - fix worst style problems

* make style - add gallery to make style

* Doc - update documentation errors and inconsistencies

* Examples - test ex1 ex2 with and without gallary

* Examples - reduce testing of ex1/ex2 without gallery, clean up non-gallery qfunctions

* MFEM - revert another make style mistake

* Manual make style updates

* Doc - update function documentation page

* Style updates, document test numbering conventions

* doc: resolve ambiguous image location warning, allow more Dot nodes

* Tests - style and cast cleanup

* Tests - fix README indentation

show more ...


# 241a4b83 25-Jul-2019 Yohann <yohann.dudouit@gmail.com>

Full jit compiled operator: cuda-gen backend (#275)

* First steps toward cuda-gen backend!

* Closer to real code generation.

* Generated code should be ready for nvrtc.

* The code generatio

Full jit compiled operator: cuda-gen backend (#275)

* First steps toward cuda-gen backend!

* Closer to real code generation.

* Generated code should be ready for nvrtc.

* The code generation skeleton is ready.

* Hack with the qfunction to make the operator kernel compile.

* Some tweaks in the makefile + Input fields structure change.

* Remove using cout.

* 1d interp and grad device functions.

* 1d readDofs, readQuads, writeDofs, writeQuads.

* Remove dead code.

* readDofs, readQuads, writeDofs, writeQuads for 2d and 3d

* 2d interp and grad

* 3d interp and grad

* - weight functions for 1d,2d,3d
- link the indices to the kernel
- link the fields to the kernel
- link the basis to the kernel

* Add the qFunction reader + inlining

* Add qf files for the tests.

* Add qf file for ceed/ex1

* Add qf file for mfem/bp1

* All tests pass.

* Add qFunction for mfem/bp3, petsc/bp1, and petsc/bp3.

* mfem/bp1 passes + remove dead code

* Fix a bug in n_quads_out for writeQuads

* mfem/bp3 passes.

* All tests all examples pass.

* Temporary tweaks for mfem benchmarking

* Add Context management.

* Modify .qf files to take into account the context.

* Enable optimizations.

* First set of optimization for 2D and 3D.

* Makefile tweaks and destructor code.

* make style.

* Add -MP flag.

* Fix linking issues with the tests.

* Update .qf files for the tests.

* Add .qf files for nek5000 examples.

* Use shared memory for B and G matrices.

* Fix bug introduced in previous commit.

show more ...


123