Add LLama 3.2 1B and 3B lightweight models descriptions #130

jsmp · 2024-09-27T08:56:06Z

This adds ModelConfigurations for the new Llama 3.2 lightweight models with 1B and 3B parameters, and bumps the Xcode project dependency on swift-transformers so that it can use the new Sequence processor, required for Llama v3.2.

Llama 3.2 introduces two new lightweight versions with 1B and 3B parameters, which speeds the inference considerably when compared to the 3.1 8B version (on my M1 16Gb, it goes from 12 tokens/sec to 66 tokens/sec on the 1B model).

awni · 2024-09-28T13:23:18Z

Looks good! Can you run the formatting hook then we can merge it?

awni

Thank you!

jsmp added 3 commits September 27, 2024 09:46

Add LLama 3.2 1B and 3B lightweight models

fe9b312

Add LLama 3.2 1B and 3B lightweight models

b754c71

Merge branch 'main' of github.com:jsmp/mlx-swift-examples

7f070f5

Fix formatting

cd7d018

awni approved these changes Sep 30, 2024

View reviewed changes

awni merged commit 169650a into ml-explore:main Sep 30, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLama 3.2 1B and 3B lightweight models descriptions #130

Add LLama 3.2 1B and 3B lightweight models descriptions #130

jsmp commented Sep 27, 2024

awni commented Sep 28, 2024

awni left a comment

Add LLama 3.2 1B and 3B lightweight models descriptions #130

Add LLama 3.2 1B and 3B lightweight models descriptions #130

Conversation

jsmp commented Sep 27, 2024

awni commented Sep 28, 2024

awni left a comment

Choose a reason for hiding this comment