Skip to content

1.18.17

Compare
Choose a tag to compare
@fhieber fhieber released this 24 May 12:08
· 511 commits to main since this release
8835331

[1.18.17]

Changed

  • Updated to MXNet 1.2
  • Use of the new LayerNormalization operator to save GPU memory.

[1.18.16]

Fixed

  • Removed summation of gradient arrays when logging gradients.
    This clogged the memory on the primary GPU device over time when many checkpoints were done.
    Gradient histograms are now logged to Tensorboard separated by device.