Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update huggingface dataset #46

Open
rmaguire31 opened this issue Sep 4, 2024 · 1 comment
Open

Update huggingface dataset #46

rmaguire31 opened this issue Sep 4, 2024 · 1 comment

Comments

@rmaguire31
Copy link

rmaguire31 commented Sep 4, 2024

Hi

Really excellent work collating this benchmark (& all your excellent contributions to mutation effect prediction). My team and I have found the data really useful, and we're super excited to see the benchmark continue to grow. The data processing and collation of a wide range of assays is particularly useful.

Are you planning adding ProteinGym v1.0 as a new version to huggingface/datasets, in a similar manner to ProteinGym v0.1 (https://huggingface.co/datasets/OATML-Markslab/ProteinGym)? This is how we have currently been downloading ProteinGym, and would prefer to use the huggingface datasets interface if possible.

Kind regards
Russell

@pascalnotin
Copy link
Contributor

Hi @rmaguire31 -- thank you for the kind words!

We had pushed an updated version to HF that seemed to have remained private for a while. I just made it public here. Please let me know if it works for you / if you have any suggestions for improvement.

Kind regards,
Pascal

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants
@pascalnotin @rmaguire31 and others