1.18.17

fhieber released this 24 May 12:08

· 511 commits to main since this release

8835331

[1.18.17]

Changed

Updated to MXNet 1.2
Use of the new LayerNormalization operator to save GPU memory.

[1.18.16]

Fixed

Removed summation of gradient arrays when logging gradients.
This clogged the memory on the primary GPU device over time when many checkpoints were done.
Gradient histograms are now logged to Tensorboard separated by device.

Assets 2