Skip to content

SparseZoo v1.7.0

Compare
Choose a tag to compare
@jeanniefinks jeanniefinks released this 15 Mar 01:18
0b58962

New Features:

  • Download support for LLMs (#379, #436)
  • Functionality to support channelwise quantization analysis (#441)
  • Chunked downloads for improved handling of large files (#446, #471)
  • SparseZoo Model Additions:
    • For various NL tasks, including chat, instruction tuning, code generation, summarization, question answering, and arithmetic reasoning:
      • Sparsified and baseline: Llama 7B (view) | Mistral 7B (view)
    • For code generation:
      • Sparsified and baseline: CodeLlama 7B (view)

Changes:

  • Deployment directories are directly downloaded as tar.gz and subsequently unzipped, enabling faster downloads. (#389)
  • Timestamps are appended with "nightly" are to improve analytics aggregation. (#406)
  • Reference to LICENSE-NEURALMAGIC removed and consolidated model license attributions into a single file. (#400, #409)
  • Legacy sparsezoo.analyze functionality has been renamed to sparsezoo.analyze_v1 for clarity. (#460)
  • The ability to overwrite existing files during download was added, resulting in auto-correction of file corruption errors for model downloads. (#453)

Resolved Issues:

  • Reporting issues resolved within GitHub Actions. (#382)
  • Manual kickoff of GitHub Actions workflows implemented. (#382)
  • Multiple names can now be passed for registering a value in the RegistryMixin class. (#385)
  • Incorrect shape computations with the ONNX Runtime no longer result in incorrect FLOPs calculations through the analyze functionality. (#408)
  • Processing file paths during analysis no longer results in analyze pathways crashing for certain paths. (#425)
  • sparsezoo.model.download now works as intended, where it would previously not download all of the files necessary for Transformers-based models. (#422)
  • Handling by raising TypeErrors instead of ValueErrors improved. (#427)
  • Instructions within the sparsezoo.analyze command-line tool were corrected. (#433)
  • SparseZoo's analysis functionality now results in correct values and no longer crashes when analyzing LLMs. (#421, #461, #462, #463, #438)
  • SparseZoo models can correctly access metrics.yaml files, whereas before these files were not available. (#431)
  • SparseZoo now handles external data for smaller models correctly, where smaller LLMs could previously fail to download correctly. (#443)
  • Handling dictionary values for SparseZoo model objects previously resulted in crashes for LLM downloads. (#448)
  • Reloading previous sparsezoo.analyze results no longer results in serialization errors for channelwise quantized models. (#455)
  • An external data bug was resolved that had resulted in LLMs not downloading properly. (#468)

Known Issues:

  • None