How to load the original model LM head in the new version of Adapters? #592

Guitaricet · 2023-10-12T20:22:49Z

Details

Hello!

We are working with LLaMA which is not supported in the old version of Adapter-Transformers. So we decided to use the beta (Adapters branch). The issue is, it's unclear how to load the llama language modeling head. By default, the model is loaded without any head, which is a problem. This kind of approach made a lot of sense in the BERT era, but it's unclear if it's an intuitive solution now when people want to fine-tune language models with their original head on.

Here's what we tried:

from adapters import LlamaAdapterModel
from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-2-7b-hf")
model = LlamaAdapterModel.from_pretrained("meta-llama/Llama-2-7b-hf")

model.add_causal_lm_head("lm_head")
adapters.init(model)
out = model.generate(tokenizer("The cat is", return_tensors="pt")["input_ids"], max_length=20)
tokenizer.decode(out[0])
# output: '<s> The cat isРСРoulevalu str siècleTest materadejkradeavenradebosebose étudesvä'

so, it seems like this way the added head is randomly initialized.

Could you help us to get the original llama head weights?

The text was updated successfully, but these errors were encountered:

hSterz · 2023-10-13T09:34:09Z

Hello @Guitaricet, yes you are correct the add_causal_lm_head adds a randomly initialized language modelling head. What you can do here is you can use the original transformers class and initialize it afterwards:

model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-2-7b-hf")
adapters.init(model)

This should solve the problem (let me know if it does not).

Guitaricet · 2023-12-30T19:13:31Z

Thank you, everything works now!

Guitaricet added the question Further information is requested label Oct 12, 2023

calpt added the enhancement New feature or request label Oct 16, 2023

calpt self-assigned this Oct 16, 2023

calpt mentioned this issue Oct 17, 2023

Automatically convert heads when loading with XAdapterModel #594

Merged

calpt linked a pull request Oct 17, 2023 that will close this issue

Automatically convert heads when loading with XAdapterModel #594

Merged

calpt removed a link to a pull request Oct 29, 2023

Automatically convert heads when loading with XAdapterModel #594

Merged

calpt removed their assignment Nov 11, 2023

Guitaricet closed this as completed Dec 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to load the original model LM head in the new version of Adapters? #592

How to load the original model LM head in the new version of Adapters? #592

Guitaricet commented Oct 12, 2023

hSterz commented Oct 13, 2023

Guitaricet commented Dec 30, 2023

How to load the original model LM head in the new version of Adapters? #592

How to load the original model LM head in the new version of Adapters? #592

Comments

Guitaricet commented Oct 12, 2023

Details

hSterz commented Oct 13, 2023

Guitaricet commented Dec 30, 2023