Releases: pmodels/mpich
v4.0.1
Changes in 4.0.1
-
Multiple fixes for NVIDIA/PGI HPC Compilers support
-
Fix ch4:ofi:gni provider capability set
-
Fix MPI_SESSION_INIT "thread_level" info hint
-
Fix build on macOS with --disable-shared
-
Fix QMPI function definitions
-
Fix support for "host" info hint in MPI_COMM_SPAWN[_MULTIPLE]
-
Fix manpage generation
-
Add missing MPI_F_sync_reg function
-
Add missing const to MPI_Psend_init buffer argument
-
Make Python 3 optional in configure script
-
Remove -Wl,flat_namespace from compile wrappers by default (macOS only)
-
Update UCX module to v1.12.0
-
Update yaksa module to support latest Ampere compute capability
v4.0
Changes in 4.0
-
All MPI-4 APIs have been implemented. Major MPI-4 features include MPI
sessions, partitioned point-to-point communications, events in the MPI tool
information interface, large-count functions, persistent collectives,
MPI_Comm_idup_with_info, MPI_Isendrecv and MPI_Isendrecv_replace,
MPI_Info_get_string, MPI_Comm_split_type with new split_type --
MPI_COMM_TYPE_HW_GUIDED and MPI_COMM_TYPE_HW_UNGUIDED. -
Add QMPI (experimental) support.
-
Add MPIX_Delete_error_{class,code,string}.
-
MPI_Info objects can be accessed before MPI_Init{_thread}.
-
Generate C API interface functions including man page notes and error
checking using Python scripts. -
Generate Fortran (mpif.h, mpi_f08) bindings using Python scripts.
-
Generate collective entrance functions and generate per-algorithm tests.
-
Support explicit --without-cuda configure option.
-
Drop support for UCX version < 1.7.0.
-
Configure now optionally require Python 3 (when F08 is enabled).
-
Multi-NIC support in ch4:ofi.
-
Default to ch4:ofi when configure doesn't have a clear choice. Add message
block at the end of configure to advise user. -
Multiple VCI is fully implemented including the active message fallback paths.
-
Extend IPC to support non-contig datatypes.
-
Add AMD GPU support using HIP.
-
Add generic RNDV callback mechanism with active messages.
-
Refactor ch4 dynamic process functions.
-
Avoid building MPL and hwloc multiple times.
-
Fix MPIX_Query_cuda_support.
-
Many bug fixes and code clean-ups.
v4.0rc3
v4.0rc2
v3.4.3
v4.0a2
v3.4.2
MPICH 4.0a1
v4.0a1 tagging 'v4.0a1'
MPICH 3.4.1
v3.4.1 tagging 'v3.4.1'
v3.4
Changes in 3.4
-
ch4 replaces ch3 as the default device configuration. If no network
module is specified at configuration-time, MPICH will search the
user environment in order to select one to build. The user will be
prompted to choose if no preferred network library is detected. -
Add support for Yaksa datatype engine (default in ch4).
-
Add support for GPU buffers (CUDA, Level Zero) in pt2pt,
collectives, and one-sided communication. -
Add support for XPMEM.
-
Add support for multiple virtual communication interfaces for more
efficient MPI_THREAD_MULTIPLE (experimental). -
Add DAOS ADIO driver to ROMIO (contributed by Intel).
-
Add Quobyte ADIO driver to ROMIO (contributed by Quobyte).
-
Add support for Arm compiler toolchain
-
Add support for NVIDIA HPC compilers
-
Add support for flang/f18 Fortran compiler
-
Add support for AddressSanitizer and UndefinedBehaviorSanitizer to
debug configuration -
Remove mxm, llc, and portals4 netmods from ch3.
-
Remove support for logical reduction operations on floating point
types. -
Remove MPIX_Mutex interfaces.
-
Further improvements to ch4 business card exchange: extra
long address support and fixes for PMIx integration. -
Un-inline non-critical ch4 code for improved build times.
-
Fix several test program bugs.
-
Fix several static analysis and compiler warnings.
-
Change the signature of MPID_Init to include requested and provided
thread levels.