Skip to content

Releases: pmodels/mpich

v4.0.1

23 Feb 18:10
19e4faf
Compare
Choose a tag to compare

Changes in 4.0.1

  • Multiple fixes for NVIDIA/PGI HPC Compilers support

  • Fix ch4:ofi:gni provider capability set

  • Fix MPI_SESSION_INIT "thread_level" info hint

  • Fix build on macOS with --disable-shared

  • Fix QMPI function definitions

  • Fix support for "host" info hint in MPI_COMM_SPAWN[_MULTIPLE]

  • Fix manpage generation

  • Add missing MPI_F_sync_reg function

  • Add missing const to MPI_Psend_init buffer argument

  • Make Python 3 optional in configure script

  • Remove -Wl,flat_namespace from compile wrappers by default (macOS only)

  • Update UCX module to v1.12.0

  • Update yaksa module to support latest Ampere compute capability

v4.0

21 Jan 21:10
9d9d208
Compare
Choose a tag to compare

Changes in 4.0

  • All MPI-4 APIs have been implemented. Major MPI-4 features include MPI
    sessions, partitioned point-to-point communications, events in the MPI tool
    information interface, large-count functions, persistent collectives,
    MPI_Comm_idup_with_info, MPI_Isendrecv and MPI_Isendrecv_replace,
    MPI_Info_get_string, MPI_Comm_split_type with new split_type --
    MPI_COMM_TYPE_HW_GUIDED and MPI_COMM_TYPE_HW_UNGUIDED.

  • Add QMPI (experimental) support.

  • Add MPIX_Delete_error_{class,code,string}.

  • MPI_Info objects can be accessed before MPI_Init{_thread}.

  • Generate C API interface functions including man page notes and error
    checking using Python scripts.

  • Generate Fortran (mpif.h, mpi_f08) bindings using Python scripts.

  • Generate collective entrance functions and generate per-algorithm tests.

  • Support explicit --without-cuda configure option.

  • Drop support for UCX version < 1.7.0.

  • Configure now optionally require Python 3 (when F08 is enabled).

  • Multi-NIC support in ch4:ofi.

  • Default to ch4:ofi when configure doesn't have a clear choice. Add message
    block at the end of configure to advise user.

  • Multiple VCI is fully implemented including the active message fallback paths.

  • Extend IPC to support non-contig datatypes.

  • Add AMD GPU support using HIP.

  • Add generic RNDV callback mechanism with active messages.

  • Refactor ch4 dynamic process functions.

  • Avoid building MPL and hwloc multiple times.

  • Fix MPIX_Query_cuda_support.

  • Many bug fixes and code clean-ups.

v4.0rc3

11 Jan 22:13
c41ba6d
Compare
Choose a tag to compare
v4.0rc3 Pre-release
Pre-release
tagging 'v4.0rc3'

v4.0rc2

07 Jan 19:17
bc0a035
Compare
Choose a tag to compare
v4.0rc2 Pre-release
Pre-release
tagging 'v4.0rc2'

v3.4.3

16 Dec 18:20
7aeecca
Compare
Choose a tag to compare
tagging 'v3.4.3'

v4.0a2

11 Jun 14:18
Compare
Choose a tag to compare
v4.0a2 Pre-release
Pre-release
tagging 'v4.0a2'

v3.4.2

28 May 14:49
de06592
Compare
Choose a tag to compare
tagging 'v3.4.2'

MPICH 4.0a1

14 Mar 20:47
e414249
Compare
Choose a tag to compare
MPICH 4.0a1 Pre-release
Pre-release
v4.0a1

tagging 'v4.0a1'

MPICH 3.4.1

14 Mar 20:49
f3f380e
Compare
Choose a tag to compare
v3.4.1

tagging 'v3.4.1'

v3.4

25 Jul 17:38
c91dae4
Compare
Choose a tag to compare

Changes in 3.4

  • ch4 replaces ch3 as the default device configuration. If no network
    module is specified at configuration-time, MPICH will search the
    user environment in order to select one to build. The user will be
    prompted to choose if no preferred network library is detected.

  • Add support for Yaksa datatype engine (default in ch4).

  • Add support for GPU buffers (CUDA, Level Zero) in pt2pt,
    collectives, and one-sided communication.

  • Add support for XPMEM.

  • Add support for multiple virtual communication interfaces for more
    efficient MPI_THREAD_MULTIPLE (experimental).

  • Add DAOS ADIO driver to ROMIO (contributed by Intel).

  • Add Quobyte ADIO driver to ROMIO (contributed by Quobyte).

  • Add support for Arm compiler toolchain

  • Add support for NVIDIA HPC compilers

  • Add support for flang/f18 Fortran compiler

  • Add support for AddressSanitizer and UndefinedBehaviorSanitizer to
    debug configuration

  • Remove mxm, llc, and portals4 netmods from ch3.

  • Remove support for logical reduction operations on floating point
    types.

  • Remove MPIX_Mutex interfaces.

  • Further improvements to ch4 business card exchange: extra
    long address support and fixes for PMIx integration.

  • Un-inline non-critical ch4 code for improved build times.

  • Fix several test program bugs.

  • Fix several static analysis and compiler warnings.

  • Change the signature of MPID_Init to include requested and provided
    thread levels.