| #
d8e47b63
|
| 17-Sep-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2024-09-02/rebased-fix-conversion-warnings' into 'main'
Compiler finds (and forbid) casts from higher precision integers to lower
See merge request petsc/petsc!7806
|
| #
6497c311
|
| 25-Aug-2024 |
Barry Smith <bsmith@mcs.anl.gov> |
Add to CI compilers flags '-Wconversion', '-Wno-sign-conversion', '-Wno-float-conversion', '-Wno-implicit-float-conversion']
Also fix the code to repository to compile cleanly with these flags in th
Add to CI compilers flags '-Wconversion', '-Wno-sign-conversion', '-Wno-float-conversion', '-Wno-implicit-float-conversion']
Also fix the code to repository to compile cleanly with these flags in the CI
show more ...
|
| #
9dd11ecf
|
| 25-Aug-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2023-08-17/header-guard-check' into 'main'
Check header guards
See merge request petsc/petsc!6822
|
| #
a4963045
|
| 18-Aug-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Convert all header guards to pragma once
|
| #
7bd9bc37
|
| 25-Sep-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'stefanozampini/sf-fast-bcast' into 'main'
PetscSF: optimize SFALLGATHERV for the one-to-all pattern
See merge request petsc/petsc!5663
|
| #
66100624
|
| 21-Sep-2022 |
Stefano Zampini <stefano.zampini@gmail.com> |
PetscSF: optimize SFALLGATHERV for the one-to-all pattern
this happens with ML models using data parallelism having replicated local parameters Add test
|
| #
061e922f
|
| 22-Sep-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2022-09-21/2-bike-2-shed' into 'main'
Feature: Bicycle Storage Facility 2
See merge request petsc/petsc!5661
|
| #
6524c165
|
| 21-Sep-2022 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Transform all header-guards into ifndefs to make clang-format ignore them for preprocessor indentation
|
| #
58d68138
|
| 23-Aug-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2022-08-21/clang-format-source' into 'main'
format repository with clang-format
See merge request petsc/petsc!5541
|
| #
9371c9d4
|
| 22-Aug-2022 |
Satish Balay <balay@mcs.anl.gov> |
clang-format: convert PETSc sources to comply with clang-format
|
| #
7a560c3d
|
| 25-Jan-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
1c0ae101
|
| 25-Jan-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/fix-sfallgather-bug' into 'release'
SF: fix a bug in SFAllgather(v) when using non-gpu-aware MPI
See merge request petsc/petsc!4749
|
| #
9319200a
|
| 14-Sep-2021 |
Junchao Zhang <jczhang@mcs.anl.gov> |
SF: fix a bug in sfallgather/v when using non-gpu-aware MPI
|
| #
b0d1f38a
|
| 22-Mar-2020 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/jczhang/feature-sf-pack-direct'
Directly pass root/leafdata to MPI in SF when possible
See merge request petsc/petsc!2506
|
| #
00816365
|
| 08-Jan-2020 |
Junchao Zhang <jczhang@mcs.anl.gov> |
No need to get mtypes in PetscSFXxxEnd(). The cuda call is not cheap.
|
| #
cd620004
|
| 05-Dec-2019 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Refactor SF packing
1) Separate out local communication from remote communication 2) Directly pass root/leafdata to MPI when suitable
|
| #
203a8786
|
| 29-Nov-2019 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/feature-sf-do-pack-on-gpu' into 'master'
Add support to do pack/unpack on GPU and do MPI on CPU
See merge request petsc/petsc!2205
|
| #
855db38d
|
| 16-Oct-2019 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Support data on device but no gpu-aware MPI for sf(all)gather(v)
|
| #
c1acdb04
|
| 28-Sep-2019 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/jczhang/feature-sf-on-gpu'
Add GPU-aware VecScatter/PetscSF
See merge request petsc/petsc!1995
|
| #
eb02082b
|
| 25-Sep-2019 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Added SF GPU support
|
| #
b23bfdef
|
| 13-Aug-2019 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Update pack/unpack routines to do packing/unpacking for all neighbors in at most two routines
One is used to pack data in self to self communication; The second is used for remote communication. So
Update pack/unpack routines to do packing/unpacking for all neighbors in at most two routines
One is used to pack data in self to self communication; The second is used for remote communication. So that on GPU, we can use at most two kernels to do packing/packing for all neighbors instead of multiple kernels
show more ...
|
| #
05393080
|
| 25-Jul-2019 |
Karl Rupp <me@karlrupp.net> |
Merge branch 'jczhang/sf-more-opts' [PR #1567]
* jczhang/sf-more-opts: Add more optimizations in SF and use it as the default for VecScatter.
|
| #
9d1c8add
|
| 23-Jul-2019 |
Junchao Zhang <jczhang@mcs.anl.gov> |
SF: Partially fix a bug when overlapped SF communications have same rootdata or leafdata on some ranks
Now we use two keys (rootdata, leafdata) to identify a pending communication. But that is still
SF: Partially fix a bug when overlapped SF communications have same rootdata or leafdata on some ranks
Now we use two keys (rootdata, leafdata) to identify a pending communication. But that is still not enough for cases where communications have same rootdata and leafdata on some ranks. Currently We error out on these cases. See src/vec/is/sf/examples/tutorials/ex2.c for various cases we can handle and we can not handle.
show more ...
|
| #
dd5b3ca6
|
| 19-Jun-2019 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Add patterned SF graphs and use x as roots and y as leaves in x to y vecscatter
|