History log of /petsc/src/vec/is/sf/impls/basic/allgatherv/sfallgatherv.h (Results 1 – 24 of 24)
Revision Date Author Comments
# d8e47b63 17-Sep-2024 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2024-09-02/rebased-fix-conversion-warnings' into 'main'

Compiler finds (and forbid) casts from higher precision integers to lower

See merge request petsc/petsc!7806


# 6497c311 25-Aug-2024 Barry Smith <bsmith@mcs.anl.gov>

Add to CI compilers flags '-Wconversion', '-Wno-sign-conversion', '-Wno-float-conversion', '-Wno-implicit-float-conversion']

Also fix the code to repository to compile cleanly with these flags in th

Add to CI compilers flags '-Wconversion', '-Wno-sign-conversion', '-Wno-float-conversion', '-Wno-implicit-float-conversion']

Also fix the code to repository to compile cleanly with these flags in the CI

show more ...


# 9dd11ecf 25-Aug-2023 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2023-08-17/header-guard-check' into 'main'

Check header guards

See merge request petsc/petsc!6822


# a4963045 18-Aug-2023 Jacob Faibussowitsch <jacob.fai@gmail.com>

Convert all header guards to pragma once


# 7bd9bc37 25-Sep-2022 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/sf-fast-bcast' into 'main'

PetscSF: optimize SFALLGATHERV for the one-to-all pattern

See merge request petsc/petsc!5663


# 66100624 21-Sep-2022 Stefano Zampini <stefano.zampini@gmail.com>

PetscSF: optimize SFALLGATHERV for the one-to-all pattern

this happens with ML models using data parallelism having replicated local parameters
Add test


# 061e922f 22-Sep-2022 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jacobf/2022-09-21/2-bike-2-shed' into 'main'

Feature: Bicycle Storage Facility 2

See merge request petsc/petsc!5661


# 6524c165 21-Sep-2022 Jacob Faibussowitsch <jacob.fai@gmail.com>

Transform all header-guards into ifndefs to make clang-format ignore them for preprocessor indentation


# 58d68138 23-Aug-2022 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2022-08-21/clang-format-source' into 'main'

format repository with clang-format

See merge request petsc/petsc!5541


# 9371c9d4 22-Aug-2022 Satish Balay <balay@mcs.anl.gov>

clang-format: convert PETSc sources to comply with clang-format


# 7a560c3d 25-Jan-2022 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/release'


# 1c0ae101 25-Jan-2022 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/fix-sfallgather-bug' into 'release'

SF: fix a bug in SFAllgather(v) when using non-gpu-aware MPI

See merge request petsc/petsc!4749


# 9319200a 14-Sep-2021 Junchao Zhang <jczhang@mcs.anl.gov>

SF: fix a bug in sfallgather/v when using non-gpu-aware MPI


# b0d1f38a 22-Mar-2020 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/jczhang/feature-sf-pack-direct'

Directly pass root/leafdata to MPI in SF when possible

See merge request petsc/petsc!2506


# 00816365 08-Jan-2020 Junchao Zhang <jczhang@mcs.anl.gov>

No need to get mtypes in PetscSFXxxEnd(). The cuda call is not cheap.


# cd620004 05-Dec-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Refactor SF packing

1) Separate out local communication from remote communication
2) Directly pass root/leafdata to MPI when suitable


# 203a8786 29-Nov-2019 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/feature-sf-do-pack-on-gpu' into 'master'

Add support to do pack/unpack on GPU and do MPI on CPU

See merge request petsc/petsc!2205


# 855db38d 16-Oct-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Support data on device but no gpu-aware MPI for sf(all)gather(v)


# c1acdb04 28-Sep-2019 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/jczhang/feature-sf-on-gpu'

Add GPU-aware VecScatter/PetscSF

See merge request petsc/petsc!1995


# eb02082b 25-Sep-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Added SF GPU support


# b23bfdef 13-Aug-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Update pack/unpack routines to do packing/unpacking for all neighbors in at most two routines

One is used to pack data in self to self communication; The second is used for remote communication.
So

Update pack/unpack routines to do packing/unpacking for all neighbors in at most two routines

One is used to pack data in self to self communication; The second is used for remote communication.
So that on GPU, we can use at most two kernels to do packing/packing for all neighbors instead of multiple kernels

show more ...


# 05393080 25-Jul-2019 Karl Rupp <me@karlrupp.net>

Merge branch 'jczhang/sf-more-opts' [PR #1567]

* jczhang/sf-more-opts:
Add more optimizations in SF and use it as the default for VecScatter.


# 9d1c8add 23-Jul-2019 Junchao Zhang <jczhang@mcs.anl.gov>

SF: Partially fix a bug when overlapped SF communications have same rootdata or leafdata on some ranks

Now we use two keys (rootdata, leafdata) to identify a pending communication. But that is still

SF: Partially fix a bug when overlapped SF communications have same rootdata or leafdata on some ranks

Now we use two keys (rootdata, leafdata) to identify a pending communication. But that is still not
enough for cases where communications have same rootdata and leafdata on some ranks. Currently We
error out on these cases. See src/vec/is/sf/examples/tutorials/ex2.c for various cases we can handle
and we can not handle.

show more ...


# dd5b3ca6 19-Jun-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Add patterned SF graphs and use x as roots and y as leaves in x to y vecscatter