TALNT: Teach an LLM New Tricks

(NOTE: This is a work in progress and requires some adjustments. It is also unproven)

Typically adding a new token, whether that be a word or some sort of special token/action, has required non-trivial retraining or finetuning of a model in order for it to learn how to utilize the token. This utility allows for the adding of a token to a HuggingFace Transformers model + tokenizer using a description of the new token instead. This makes use of the fact that the sum of the embeddings of the tokens of a definition often show a high cosine similarity to the embedding of the token being defined (and that embeddings to some extent follow algebraic rules). It uses that sum of the embeddings of the tokens of the definition/description to initialize the new column of the transformer's token embeddings table, giving the model a better jumping off point to use the token. Some finetuning will still be necessary to train the final linear layer, but the hope is that it only requires a small handful of examples.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
TALNT.py		TALNT.py
example_baseline.py		example_baseline.py
example_play_music.py		example_play_music.py
experiment_1b_001_full_single_50_talnt.py		experiment_1b_001_full_single_50_talnt.py
run_experiment.py		run_experiment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TALNT: Teach an LLM New Tricks

About

Releases

Packages

Languages

0xLienid/TALNT

Folders and files

Latest commit

History

Repository files navigation

TALNT: Teach an LLM New Tricks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages