Does the current IA3 include the two additional loss terms? #930
Answered
by
SumanthRH
Vincent-Li-9701
asked this question in
Q&A
-
I was going through the source code of IA3, but it doesn't seem to contain the two additional loss terms proposed by the author. unlikelihood loss (Lul) and a length-normalized loss (Lln). Am I missing anything here? Could someone point to me where it's implemented? |
Beta Was this translation helpful? Give feedback.
Answered by
SumanthRH
Sep 14, 2023
Replies: 1 comment 6 replies
-
Hi, IA3 - “Infused Adapter by Inhibiting and Amplifying Inner Activation" is simply the PEFT method that adds additional parameters. You seem to be referring to the loss functions in the full T-Few recipe. You should have a look at the official implementation from the authors here |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Yes that's right! Custom loss functions don't seem to fit in with the goal of the PEFT library (which is why it only implements IA3, not the full training recipe in T-few), so I don't think this is coming anytime soon. I will still cc @younesbelkada for a final word.