Release SparseZoo v1.7.0 · neuralmagic/sparsezoo

New Features:

Download support for LLMs (#379, #436)
Functionality to support channelwise quantization analysis (#441)
Chunked downloads for improved handling of large files (#446, #471)
SparseZoo Model Additions:
- For various NL tasks, including chat, instruction tuning, code generation, summarization, question answering, and arithmetic reasoning:
  - Sparsified and baseline: Llama 7B (view) | Mistral 7B (view)
- For code generation:
  - Sparsified and baseline: CodeLlama 7B (view)

Deployment directories are directly downloaded as tar.gz and subsequently unzipped, enabling faster downloads. (#389)
Timestamps are appended with "nightly" are to improve analytics aggregation. (#406)
Reference to LICENSE-NEURALMAGIC removed and consolidated model license attributions into a single file. (#400, #409)
Legacy sparsezoo.analyze functionality has been renamed to sparsezoo.analyze_v1 for clarity. (#460)
The ability to overwrite existing files during download was added, resulting in auto-correction of file corruption errors for model downloads. (#453)

Reporting issues resolved within GitHub Actions. (#382)
Manual kickoff of GitHub Actions workflows implemented. (#382)
Multiple names can now be passed for registering a value in the RegistryMixin class. (#385)
Incorrect shape computations with the ONNX Runtime no longer result in incorrect FLOPs calculations through the analyze functionality. (#408)
Processing file paths during analysis no longer results in analyze pathways crashing for certain paths. (#425)
sparsezoo.model.download now works as intended, where it would previously not download all of the files necessary for Transformers-based models. (#422)
Handling by raising TypeErrors instead of ValueErrors improved. (#427)
Instructions within the sparsezoo.analyze command-line tool were corrected. (#433)
SparseZoo's analysis functionality now results in correct values and no longer crashes when analyzing LLMs. (#421, #461, #462, #463, #438)
SparseZoo models can correctly access metrics.yaml files, whereas before these files were not available. (#431)
SparseZoo now handles external data for smaller models correctly, where smaller LLMs could previously fail to download correctly. (#443)
Handling dictionary values for SparseZoo model objects previously resulted in crashes for LLM downloads. (#448)
Reloading previous sparsezoo.analyze results no longer results in serialization errors for channelwise quantized models. (#455)
An external data bug was resolved that had resulted in LLMs not downloading properly. (#468)