You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.
Some datasets on the Dynabench NLI task were accidentally deleted on 03/10. As a result, the files and metrics were also removed from S3. Tracking progress on recovery and mitigation steps:
Leaderboard:
snli-test
mnli-test-mismatched
mnli-test-matched
anli-r1-test
anli-r2-test
anli-r3-test
Non Leaderboard
superglue-winogender
mnli-dev-mismatched
mnli-dev-matched
snli-dev
hans
nli-stress-test
anli-r1-dev
anli-r2-dev
anli-r3-dev
Next Steps
This was caused by confusion with the "add dataset" interface, where previous datasets looked like part of the submission form. Some steps to prevent future incidents:
UX improvements: Pop-up warning when someone tries to delete a dataset, or header above the datasets marking a clear separation from the submission form.
Enable bucket versions: Versioned S3 buckets would ensure deleted items are backed up for X days.
This process also uncovered several UX bugs, eg. successful dataset upload still sent an error message.
The text was updated successfully, but these errors were encountered:
One of the key errors here that was confusing was: if you don't assign a round to a dataset after uploading, it eventually fails to update the scores for that dataset in the DB. This happened before too see #835 , maybe we should maybe change the default round for dataset uploads to something non zero
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Some datasets on the Dynabench NLI task were accidentally deleted on 03/10. As a result, the files and metrics were also removed from S3. Tracking progress on recovery and mitigation steps:
Leaderboard:
Non Leaderboard
Next Steps
This was caused by confusion with the "add dataset" interface, where previous datasets looked like part of the submission form. Some steps to prevent future incidents:
This process also uncovered several UX bugs, eg. successful dataset upload still sent an error message.
The text was updated successfully, but these errors were encountered: