Bump transformers from 4.33.1 to 4.38.1 #1576

dependabot · 2024-02-26T05:39:37Z

Bumps transformers from 4.33.1 to 4.38.1.

Release notes

v4.38.1

Fix eager attention in Gemma!

[Gemma] Fix eager attention #29187 by @sanchit-gandhi

TLDR:
-        attn_output = attn_output.reshape(bsz, q_len, self.hidden_size)
+        attn_output = attn_output.view(bsz, q_len, -1)
v4.38: Gemma, Depth Anything, Stable LM; Static Cache, HF Quantizer, AQLM

New model additions

💎 Gemma 💎

Gemma is a new opensource Language Model series from Google AI that comes with a 2B and 7B variant. The release comes with the pre-trained and instruction fine-tuned versions and you can use them via AutoModelForCausalLM, GemmaForCausalLM or pipeline interface!

Read more about it in the Gemma release blogpost: https://hf.co/blog/gemma
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("google/gemma-2b")
model = AutoModelForCausalLM.from_pretrained("google/gemma-2b", device_map="auto", torch_dtype=torch.float16)
input_text = "Write me a poem about Machine Learning."
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")
outputs = model.generate(**input_ids)
You can use the model with Flash Attention, SDPA, Static cache and quantization API for further optimizations !

Flash Attention 2
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("google/gemma-2b")
model = AutoModelForCausalLM.from_pretrained(
"google/gemma-2b", device_map="auto", torch_dtype=torch.float16, attn_implementation="flash_attention_2"
)
input_text = "Write me a poem about Machine Learning."
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")
outputs = model.generate(**input_ids)

... (truncated)

Commits

a085774 Release: v4.38.1
2f54e0b [Gemma] Fix eager attention (#29187)
08ab54a [ gemma] Adds support for Gemma 💎 (#29167)
2de9314 [Maskformer] safely get backbone config (#29166)
476957b 🚨 Llama: update rope scaling to match static cache changes (#29143)
7a4bec6 Release: 4.38.0
ee3af60 Add support for fine-tuning CLIP-like models using contrastive-image-text exa...
0996a10 Revert low cpu mem tie weights (#29135)
15cfe38 [Core tokenization] add_dummy_prefix_space option to help with latest is...
efdd436 FIX [PEFT / Trainer ] Handle better peft + quantized compiled models (#29...
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

sakoush · 2024-02-29T15:07:12Z

@dependabot recreate

Bumps [transformers](https://github.com/huggingface/transformers) from 4.33.1 to 4.38.1. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](huggingface/transformers@v4.33.1...v4.38.1) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>

dependabot · 2024-03-01T14:25:13Z

Superseded by #1605.

dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update Python code labels Feb 26, 2024

dependabot bot requested a review from a team February 26, 2024 05:39

dependabot bot mentioned this pull request Feb 26, 2024

bump transformers from 4.33.1 to 4.36.2 #1528

Closed

dependabot bot force-pushed the dependabot/pip/transformers-4.38.1 branch from aeee718 to f83bed8 Compare February 28, 2024 14:16

dependabot bot force-pushed the dependabot/pip/transformers-4.38.1 branch from f83bed8 to 83e2f12 Compare February 29, 2024 15:27

dependabot bot closed this Mar 1, 2024

dependabot bot deleted the dependabot/pip/transformers-4.38.1 branch March 1, 2024 14:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump transformers from 4.33.1 to 4.38.1 #1576

Bump transformers from 4.33.1 to 4.38.1 #1576

dependabot bot commented on behalf of github Feb 26, 2024 •

edited

Loading

sakoush commented Feb 29, 2024

dependabot bot commented on behalf of github Mar 1, 2024

Bump transformers from 4.33.1 to 4.38.1 #1576

Bump transformers from 4.33.1 to 4.38.1 #1576

Conversation

dependabot bot commented on behalf of github Feb 26, 2024 • edited Loading

v4.38.1

Fix eager attention in Gemma!

v4.38: Gemma, Depth Anything, Stable LM; Static Cache, HF Quantizer, AQLM

New model additions

💎 Gemma 💎

sakoush commented Feb 29, 2024

dependabot bot commented on behalf of github Mar 1, 2024

dependabot bot commented on behalf of github Feb 26, 2024 •

edited

Loading