Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Database of generated embeddings #30

Open
gabrielctn opened this issue May 1, 2023 · 2 comments
Open

Database of generated embeddings #30

gabrielctn opened this issue May 1, 2023 · 2 comments
Assignees

Comments

@gabrielctn
Copy link

Hi, thanks so much for your work !

I was wondering, using Ankh, have you generated the embeddings on uniref50 or another database and made it available somewhere by any chance ? It would be awesome and time saving ! Both in float16 or float64. (float16 would be easier to store, and I think it is as performant as float64 on downstream tasks ?)

Thanks

@agemagician
Copy link
Owner

Hi,

Thanks for your interest.

We have not generated the embedding for uniref50. As you know, it would require a lot of computing and storage.
As our models are highly optimized, users could use them quickly to extract the embedding for their own use cases.

Could you please share your own use-case and what are the benefits for you to have access to the whole uniref50 embedding?

I have tagged our product owner @wafaaashraf to follow-up on this request.

@gabrielctn
Copy link
Author

Thanks for your answer.
I asked because it would save us some time and computing, but I might as well try generating embeddings on the fly.

My use-case is for research, I need all the embeddings of uniref50 because I use them for downstream tasks as input.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants