Request for Sigmoid Loss Integration: SigLip #618

fabiozappo · 2023-09-04T07:58:15Z

Hi there,

First of all, I'd like to express my appreciation for the outstanding work you've done on this project. Your efforts are truly commendable!

I recently had the opportunity to read the SigLip research paper, and the authors of the paper propose that replacing the conventional softmax loss with a sigmoid loss could potentially enhance a model's learning capabilities, particularly when dealing with lower batch sizes. Given the potential benefits highlighted in the paper, I'm curious to know if there are any plans to integrate this approach into your repository.

rom1504 · 2023-09-04T09:42:11Z

@rwightman Could you describe what you tried on this ?

rwightman · 2023-09-14T21:16:21Z

I had this 80% of the way but hadn't debugged the isend/irecv code to do the neighbour shifting (the paper authors use jax.lax.ppermute for this which is not available as pytorch primitive).

I'll see if I can get it into a state where it does something soonish... and then push a PR for others to look at, help polish/test, etc

rwightman · 2023-09-15T20:16:22Z

@rom1504 @fabiozappo what I have is here #634 , I tried working on the distributed part a bit more, but not sure I've got it behaving properly... at least it seems to not converge, or converge very poorly vs non-distributed ... currently trying a new autograd.Function approach which exchanges grad in opposite dir...

rwightman mentioned this issue Sep 15, 2023

SigLIP impl #634

Merged

rwightman closed this as completed Sep 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for Sigmoid Loss Integration: SigLip #618

Request for Sigmoid Loss Integration: SigLip #618

fabiozappo commented Sep 4, 2023

rom1504 commented Sep 4, 2023

rwightman commented Sep 14, 2023

rwightman commented Sep 15, 2023 •

edited

Loading

Request for Sigmoid Loss Integration: SigLip #618

Request for Sigmoid Loss Integration: SigLip #618

Comments

fabiozappo commented Sep 4, 2023

rom1504 commented Sep 4, 2023

rwightman commented Sep 14, 2023

rwightman commented Sep 15, 2023 • edited Loading

rwightman commented Sep 15, 2023 •

edited

Loading