SparseML v1.0.1 Patch Release
This is a patch release for 1.0.0 that contains the following changes:
- Quantized ONNX graph folding resolution that prevents and extra quant/dequant pair being added into the residuals for BERT style models. This was causing an accuracy drop after exporting to ONNX of up to 1% and is now fixed.