Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add LLama 3.2 1B and 3B lightweight models descriptions #130

Merged
merged 4 commits into from
Sep 30, 2024

Conversation

jsmp
Copy link
Contributor

@jsmp jsmp commented Sep 27, 2024

This adds ModelConfigurations for the new Llama 3.2 lightweight models with 1B and 3B parameters, and bumps the Xcode project dependency on swift-transformers so that it can use the new Sequence processor, required for Llama v3.2.

Llama 3.2 introduces two new lightweight versions with 1B and 3B parameters, which speeds the inference considerably when compared to the 3.1 8B version (on my M1 16Gb, it goes from 12 tokens/sec to 66 tokens/sec on the 1B model).

@awni
Copy link
Member

awni commented Sep 28, 2024

Looks good! Can you run the formatting hook then we can merge it?

Copy link
Member

@awni awni left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!

@awni awni merged commit 169650a into ml-explore:main Sep 30, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants