This repository was archived by the owner on Jun 3, 2025. It is now read-only.

SparseML v1.0.1 Patch Release

jeanniefinks released this 13 Jul 11:04

0fa9f72

This is a patch release for 1.0.0 that contains the following changes:

Quantized ONNX graph folding resolution that prevents and extra quant/dequant pair being added into the residuals for BERT style models. This was causing an accuracy drop after exporting to ONNX of up to 1% and is now fixed.

Assets 3