Lines Matching refs:any
32 Fig. {any}`fig_gcc_streams` plots the total memory bandwidth achieved and the speedup for runs on a…
51 …non-uniform memory access (**NUMA**), meaning the memory latency or bandwidth for any particular c…
149 Fig. {any}`fig_streams` shows the achieved bandwidth for gcc, icc, icx, and nvc using MPI and OpenM…
159 Note the two dips in the performance with OpenMP and gcc using binding in Fig. {any}`fig_gcc_stream…
165 Fig. {any}`fig_icc_streams` shows the performance with the icc compiler. Note that the icc compiler…
181 Fig. {any}`fig_icx_streams` shows the performance with the icx compiler.
198 The results are displayed in Fig. {any}`fig_icc_O1_streams`; sure enough, the results now match tha…
207 Next we display the STREAMS results using gcc with parallel efficiency instead of speedup in {any}`…
220 - We do not have any explanation why the improvement in speedup for gcc, icx, and nvc slows down be…
225 Fig. {any}`fig_m2_gcc_streams` provides the results. Based on the plateau in the middle of the plot…
243 … and the time for the matrix-vector products. These are displayed in Fig. {any}`fig_gamg`. The run…
267 Fig. {any}`fig_m2_gamg` provides the results. The performance is better than predicted by the STREA…
282 We plot the speedup in Fig. {any}`fig_gamg_server` and parallel efficiency in {any}`fig_gamg_server…
320 In {any}`fig_gamg_server_pe_streams`, we plot the parallel efficiency of the linear solve and the S…
324 in {any}`fig_m2_gamg_server_shared_speedup`.