Skip to content
This repository was archived by the owner on Jun 3, 2025. It is now read-only.

SparseML v1.0.1 Patch Release

Choose a tag to compare

@jeanniefinks jeanniefinks released this 13 Jul 11:04
0fa9f72

This is a patch release for 1.0.0 that contains the following changes:

  • Quantized ONNX graph folding resolution that prevents and extra quant/dequant pair being added into the residuals for BERT style models. This was causing an accuracy drop after exporting to ONNX of up to 1% and is now fixed.