readme example does not work for quantization pissa model #31

chuangzhidan · 2024-12-30T07:13:52Z

from trl import SFTTrainer
from datasets import load_dataset
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel
MODEL_ID = "PiSSA-Llama-2-7b-hf-r128"
residual_model = AutoModelForCausalLM.from_pretrained(MODEL_ID,device_map="auto")
model = PeftModel.from_pretrained(residual_model, MODEL_ID, subfolder = "pissa_init", is_trainable=True)
tokenizer = AutoTokenizer.from_pretrained(MODEL_ID)
dataset = load_dataset("imdb", split="train[:1%]") # Only use 1% of the dataset
trainer = SFTTrainer(
model=peft_model,
train_dataset=dataset,
dataset_text_field="text",
max_seq_length=128,
tokenizer=tokenizer,
)
trainer.train()
peft_model.save_pretrained("pissa-llama-2-7b-ft")

this example does not work for quantization model like

fxmeng/PiSSA-Llama-2-7B-r16-4bit-5iter, "fxmeng/PiSSA-Qwen2-7B-4bit-r128-5iter" and so on.

chuangzhidan · 2024-12-30T08:24:52Z

model = PeftModel.from_pretrained(residual_model, MODEL_ID, subfolder = "pissa_init", is_trainable=True) is a typo? should be：
peft_model = PeftModel.from_pretrained(residual_model, MODEL_ID, subfolder = "pissa_init", is_trainable=True)

because
trainer = SFTTrainer(
model=peft_model,
train_dataset=dataset,
dataset_text_field="text",
max_seq_length=128,
tokenizer=tokenizer,
)

also,those 2 parameters doesn't work anymore with newest trl-0.13.0:
dataset_text_field="text",
max_seq_length=128,

so ,how to change the code acccordingly?

chuangzhidan changed the title ~~does not work for quantization pissa model~~ readme example does not work for quantization pissa model Jan 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme example does not work for quantization pissa model #31

readme example does not work for quantization pissa model #31

chuangzhidan commented Dec 30, 2024 •

edited

Loading

chuangzhidan commented Dec 30, 2024 •

edited

Loading

readme example does not work for quantization pissa model #31

readme example does not work for quantization pissa model #31

Comments

chuangzhidan commented Dec 30, 2024 • edited Loading

chuangzhidan commented Dec 30, 2024 • edited Loading

chuangzhidan commented Dec 30, 2024 •

edited

Loading

chuangzhidan commented Dec 30, 2024 •

edited

Loading