| /petsc/src/sys/objects/device/tests/output/ |
| H A D | ex7_with_info_sycl.out | 3 [0] <device> MarkFromID_IncompatibleModes(): dctx 2 (global root) - obj 8 (memory allocation): new … 4 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 2 (global root) - obj 8 (memory all… 5 [0] <device> MarkFromID_IncompatibleModes(): dctx 2 (global root) - obj 8 (memory deallocation): ne… 6 [0] <device> MarkFromID_IncompatibleModes_UpdateLastWrite(): dctx 2 (global root) - obj 8 (memory d… 7 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 2 (global root) - obj 8 (memory dea… 8 [0] <device> MarkFromID_IncompatibleModes(): dctx 2 (global root) - obj 12 (memory allocation): new… 9 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 2 (global root) - obj 12 (memory al… 10 [0] <device> MarkFromID_IncompatibleModes(): dctx 2 (global root) - obj 13 (memory allocation): new… 11 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 2 (global root) - obj 13 (memory al… 12 [0] <device> MarkFromID_IncompatibleModes(): dctx 2 (global root) - obj 14 (memory allocation): new… [all …]
|
| H A D | ex7_with_info_cuda.out | 3 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 58 (memory allocation): ne… 4 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 58 (memory a… 5 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 58 (memory deallocation): … 6 [0] <device> MarkFromID_IncompatibleModes_UpdateLastWrite(): dctx 53 (global root) - obj 58 (memory… 7 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 58 (memory d… 8 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 63 (memory allocation): ne… 9 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 63 (memory a… 10 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 64 (memory allocation): ne… 11 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 64 (memory a… 12 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 65 (memory allocation): ne… [all …]
|
| H A D | ex7_with_info_hip.out | 3 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 58 (memory allocation): ne… 4 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 58 (memory a… 5 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 58 (memory deallocation): … 6 [0] <device> MarkFromID_IncompatibleModes_UpdateLastWrite(): dctx 53 (global root) - obj 58 (memory… 7 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 58 (memory d… 8 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 63 (memory allocation): ne… 9 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 63 (memory a… 10 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 64 (memory allocation): ne… 11 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 64 (memory a… 12 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 65 (memory allocation): ne… [all …]
|
| H A D | ex7_with_info_host_no_device.out | 3 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 56 (memory allocation): ne… 4 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 56 (memory a… 5 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 56 (memory deallocation): … 6 [0] <device> MarkFromID_IncompatibleModes_UpdateLastWrite(): dctx 53 (global root) - obj 56 (memory… 7 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 56 (memory d… 8 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 60 (memory allocation): ne… 9 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 60 (memory a… 10 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 61 (memory allocation): ne… 11 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 61 (memory a… 12 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 62 (memory allocation): ne… [all …]
|
| H A D | ex7_with_info_host_with_device.out | 3 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 56 (memory allocation): ne… 4 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 56 (memory a… 5 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 56 (memory deallocation): … 6 [0] <device> MarkFromID_IncompatibleModes_UpdateLastWrite(): dctx 53 (global root) - obj 56 (memory… 7 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 56 (memory d… 8 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 60 (memory allocation): ne… 9 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 60 (memory a… 10 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 61 (memory allocation): ne… 11 [0] <device> PetscDeviceContextMarkIntentFromID_Private(): dctx 53 (global root) - obj 61 (memory a… 12 [0] <device> MarkFromID_IncompatibleModes(): dctx 53 (global root) - obj 62 (memory allocation): ne… [all …]
|
| /petsc/src/tao/linesearch/impls/armijo/ |
| H A D | armijo.c | 16 PetscCall(PetscFree(armP->memory)); in TaoLineSearchDestroy_Armijo() 28 PetscCall(PetscFree(armP->memory)); in TaoLineSearchReset_Armijo() 145 if (!armP->memory) PetscCall(PetscMalloc1(armP->memorySize, &armP->memory)); in TaoLineSearchApply_Armijo() 148 for (i = 0; i < armP->memorySize; i++) armP->memory[i] = armP->alpha * (*f); in TaoLineSearchApply_Armijo() 151 armP->lastReference = armP->memory[0]; in TaoLineSearchApply_Armijo() 159 if (armP->memory[i] > ref) { in TaoLineSearchApply_Armijo() 160 ref = armP->memory[i]; in TaoLineSearchApply_Armijo() 167 for (i = 0; i < armP->memorySize; i++) ref += armP->memory[i]; in TaoLineSearchApply_Armijo() 169 ref = PetscMax(ref, armP->memory[armP->current]); in TaoLineSearchApply_Armijo() 171 ref = PetscMin(ref, 0.5 * (armP->lastReference + armP->memory[armP->current])); in TaoLineSearchApply_Armijo() [all …]
|
| /petsc/src/tao/linesearch/impls/owarmijo/ |
| H A D | owarmijo.c | 42 PetscCall(PetscFree(armP->memory)); in TaoLineSearchDestroy_OWArmijo() 175 if (!armP->memory) PetscCall(PetscMalloc1(armP->memorySize, &armP->memory)); in TaoLineSearchApply_OWArmijo() 178 for (i = 0; i < armP->memorySize; i++) armP->memory[i] = armP->alpha * (*f); in TaoLineSearchApply_OWArmijo() 180 armP->lastReference = armP->memory[0]; in TaoLineSearchApply_OWArmijo() 185 ref = armP->memory[0]; in TaoLineSearchApply_OWArmijo() 189 if (armP->memory[i] > ref) { in TaoLineSearchApply_OWArmijo() 190 ref = armP->memory[i]; in TaoLineSearchApply_OWArmijo() 197 for (i = 0; i < armP->memorySize; i++) ref += armP->memory[i]; in TaoLineSearchApply_OWArmijo() 199 ref = PetscMax(ref, armP->memory[armP->current]); in TaoLineSearchApply_OWArmijo() 201 ref = PetscMin(ref, 0.5 * (armP->lastReference + armP->memory[armP->current])); in TaoLineSearchApply_OWArmijo() [all …]
|
| /petsc/src/benchmarks/results/ |
| H A D | performance_cfd_2_10.m | 10 text(1,22,'IBM SP Superchip, 4 memory cards','Rotation',90) 11 text(2,22,'IBM SP Superchip, 2 memory cards','Rotation',90) 30 text(1,22,'IBM SP Superchip, 4 memory cards','Rotation',90) 31 text(2,22,'IBM SP Superchip, 2 memory cards','Rotation',90)
|
| H A D | performance_arco1.m | 12 text(3,22,'IBM SP Superchip, 4 memory cards','Rotation',90) 13 text(4,22,'IBM SP Superchip, 2 memory cards','Rotation',90) 38 text(3,22,'IBM SP Superchip, 4 memory cards','Rotation',90) 39 text(4,22,'IBM SP Superchip, 2 memory cards','Rotation',90)
|
| /petsc/doc/manual/ |
| H A D | performance.md | 6 with PETSc, particularly on distributed-memory machines with multiple 15 each byte loaded or stored from global memory. Therefore, the 22 (*memory bandwidth limited*) rather than by the rate of floating point 25 This section discusses ways to maximize the memory bandwidth achieved by 43 : number of processes used. One can get close to peak memory bandwidth with only a 49 over the number of processes used. One can get close to peak memory 56 cores is required to saturate the memory channels. For example, a 58 than 80 percent of achievable peak memory bandwidth with only four 64 PETSc provides a simple way to measure memory bandwidth for different 67 one can obtain on the given machine (not necessarily a shared memory [all …]
|
| H A D | streams.md | 9 Most algorithms in PETSc are memory 10 …a simulation depends more on the total achievable [^achievable-footnote] memory bandwidth of the c… 12 … for gaining insights into parallel performance (scaling) by measuring achievable memory bandwidth. 22 …REAMS measures the total memory bandwidth achievable when running `n` independent threads or proce… 23 `N` on a shared memory node. 26 Though real simulations have more complex memory access patterns, most computations for PDEs have l… 27 independent non-overlapping memory STREAMS model still provides useful information. 30 …ficiency) obtained on a given system indicates the likely performance of memory bandwidth-limited … 32 Fig. {any}`fig_gcc_streams` plots the total memory bandwidth achieved and the speedup for runs on a… 43 There are three important concepts needed to understand memory bandwidth-limited computing. [all …]
|
| /petsc/src/ksp/ksp/tests/output/ |
| H A D | ex81_1.out | 27 …INFOG(21) (size in MB of memory effectively used during factorization - value on the most memory c… 28 …INFOG(22) (size in MB of memory effectively used during factorization - sum over all processors): 0 34 … INFOG(30, 31) (after solution: size in Mbytes of memory used during solution phase): 0, 0 39 …e of all MUMPS internal data for running BLR in-core - value on the most memory consuming processo… 41 … all MUMPS internal data for running BLR out-of-core - value on the most memory consuming processo… 76 …INFOG(21) (size in MB of memory effectively used during factorization - value on the most memory c… 77 …INFOG(22) (size in MB of memory effectively used during factorization - sum over all processors): 0 83 … INFOG(30, 31) (after solution: size in Mbytes of memory used during solution phase): 0, 0 88 …e of all MUMPS internal data for running BLR in-core - value on the most memory consuming processo… 90 … all MUMPS internal data for running BLR out-of-core - value on the most memory consuming processo…
|
| H A D | ex81_2.out | 39 …INFOG(21) (size in MB of memory effectively used during factorization - value on the most memory c… 40 …INFOG(22) (size in MB of memory effectively used during factorization - sum over all processors): 0 46 … INFOG(30, 31) (after solution: size in Mbytes of memory used during solution phase): 0, 0 51 …e of all MUMPS internal data for running BLR in-core - value on the most memory consuming processo… 53 … all MUMPS internal data for running BLR out-of-core - value on the most memory consuming processo… 107 …INFOG(21) (size in MB of memory effectively used during factorization - value on the most memory c… 108 …INFOG(22) (size in MB of memory effectively used during factorization - sum over all processors): 0 114 … INFOG(30, 31) (after solution: size in Mbytes of memory used during solution phase): 0, 0 119 …e of all MUMPS internal data for running BLR in-core - value on the most memory consuming processo… 121 … all MUMPS internal data for running BLR out-of-core - value on the most memory consuming processo…
|
| /petsc/doc/manualpages/doctext/ |
| H A D | doctextcommon.txt | 9 Operations using the _same_ `PetscDeviceContext` which access objects or memory regions 13 memory region are strongly write-ordered. That is, the following operations\: 21 _Given an operation `A-B` (e.g. `A` = `write`, `B` = `read`) on an object or memory 31 memory regions may execute in an arbitrary order and offer no guarantee of sequential 46 which asynchronously allocates memory and returns a pointer to the memory, the value
|
| /petsc/src/ksp/ksp/tutorials/output/ |
| H A D | ex52_4.out | 2 MUMPS INFOG(16) :: Max memory in MB = 2 3 MUMPS INFOG(17) :: Sum memory in MB = 6
|
| /petsc/doc/developers/ |
| H A D | kernel.md | 10 - memory management, 131 the error code `PETSCERRMEM` is used whenever a requested memory 168 `petscsys.h`, while the implementation code is in `src/sys/memory`. 183 ```{literalinclude} /../src/sys/memory/mal.c 190 ```{literalinclude} /../src/sys/memory/mal.c 202 support requests for zero bytes of memory correctly. Freeing memory 204 from accidentally using memory that has been freed. All PETSc memory 205 allocation calls are memory aligned on at least double-precision 210 `PetscMallocA()` supports a request for up to 7 distinct memory 213 increasing performance, and it clarifies in the code related memory [all …]
|
| /petsc/include/petsc/private/cpp/ |
| H A D | object_pool.hpp | 17 namespace memory namespace 31 struct hash<::Petsc::memory::align_val_t> { 33 using argument_type = ::Petsc::memory::align_val_t; 37 …constexpr size_t operator()(const ::Petsc::memory::align_val_t &x) const noexcept { return static_… in operator ()() 45 namespace memory namespace 339 using allocator_type = memory::PoolAllocator;
|
| /petsc/src/sys/objects/device/impls/cupm/ |
| H A D | cupmallocator.hpp | 31 class PETSC_SINGLE_LIBRARY_VISIBILITY_INTERNAL HostAllocator : public memory::impl::SegmentedMemory… 34 using base_type = memory::impl::SegmentedMemoryPoolAllocatorBase<PetscType>; 82 class PETSC_SINGLE_LIBRARY_VISIBILITY_INTERNAL DeviceAllocator : public memory::impl::SegmentedMemo… 85 using base_type = memory::impl::SegmentedMemoryPoolAllocatorBase<PetscType>;
|
| /petsc/src/sys/tests/output/ |
| H A D | ex2_1_alt.out | 1 [0]PETSC ERROR: Caught signal number 4 Illegal instruction: Likely due to memory corruption
|
| /petsc/src/dm/impls/plex/tests/output/ |
| H A D | ex35_tet.out | 5 Estimated memory 60096
|
| H A D | ex35_tet_interp.out | 5 Estimated memory 190496
|
| H A D | ex35_quad.out | 6 Estimated memory 3328
|
| H A D | ex35_hex.out | 6 Estimated memory 26096
|
| H A D | ex35_hex_interp.out | 6 Estimated memory 73496
|
| H A D | ex35_tri_interp.out | 6 Estimated memory 7728
|